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PAIRED-ASSOCIATE LEARNING AND THE 
TIMING OF AROUSAL * 


D. E. BERLYNE, DONNA M. BORSA, JANE H. HAMACHER, 
AND ISOLDE D. V. KOENIG 


University of Toronto 


In a paired-associate learning experiment, 75-db. white noise during 
presentation of stimulus and response terms in training trials sig- 
nificantly increased recall in a test trial held 1 day later. White noise 
after the response made no significant difference, and there was no 
significant interaction. White noise produced no effects on anticipa- 
tions during training or on a test held immediately after training trials. 
The results are discussed with reference to 4 kinds of hypotheses 
regarding the relations between arousal and reinforcement and to the 
possible outcomes of interaction between short-term detrimental effects 
of arousal on performance and durable facilitatory effects of arousal on 


learning. 


A previous paired-associate-learning 
experiment (Berlyne, Borsa, Craw, 
Gelman, & Mandell, 1965), in which 
white noise was used as a means of 
inducing arousal, showed items learned 
under white noise to be recalled less 
often than others during training but 
more often during a test given 1 day 
later. These findings tally with those 
reported by the Michigan group 
(Kleinsmith & Kaplan, 1963, 1964; 
Walker & Tarte, 1963), according to 
Which more arousing stimulus terms 
make. for worse immediate recall and 
better long-term recall than less arous- 
ing stimulus terms. 

1This investigation was supported by 
Research Grant APT 73 from the National 
Research Council of Canada and Research 
Grant MH-06324 from the National Institute 
of Mental Health, United States Public 
Health Service. 


The assumption that white noise 
raises arousal is supported by (a) 
neuroanatomical and neurophysiologi- 
cal evidence that all exteroceptive 
stimulation activates the reticular 
arousal system, (b) the finding that 
continuous white noise causes skin re- 
sistance to drop significantly over a 
period of 15-20 min. in conditions 
that would otherwise leave skin resist- 
ance virtually unchanged (Berlyne & 
Lewis, 1963), (c) the finding that 
sounds increase muscular tension 
(Davis, 1948), and (d) the concord- 
ance between the effects of white noise 
on paired-associate learning and the 
results of the Michigan experiments. 

The experiments to be reported here 
were undertaken in the hope of pin- 
pointing further the effects of white 
noise, and thus presumably of arousal, 
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by having white noise begin and termi- 
nate at different points in the learning 
process. Four conditions, represented 
in Fig. 1, were subjected to within-Ss 
comparisons, 

It was hoped that the findings might 
have some bearing on the problem of 
how arousal and reinforcement are re- 
lated. In contemporary English-lan- 
guage psychology, “reinforcement” is 
often treated as a synonym of “re- 
ward,” i.e., to denote a factor that 
promotes instrumental learning by 
strengthening a response that it closely 
follows. The original usage, exempli- 
fied by Pavlov’s writings, was broader 
(cf. Kimble, 1961, pp. 5-6). In the 
spirit of this Usage, we may regard 
as à source of "reinforcement" any 
factor other than the elements to be 
associated that strengthens the stimu- 
lus-response association. Since con- 
tiguity alone may weaken an associa- 
tion or leave its strength unchanged, 
Some source of reinforcement, in this 
broad sense, must be at work in all 
learning. How closely related verbal 
learning is to classical conditioning and 
instrumental learning, and how far, if 
at all, it is susceptible to the kinds of 
reinforcement that govern them, is, of 
course, still unsettled. Verbal learn- 
ing, must, however, be susceptible to 
reinforcing factors of some sort, since 
it is not ensured by contiguity alone, 
Moreover, there is plenty of evidence, 
in addition to the experiments already 
mentioned, that verbal learning is 
strongly influenced by arousal (eg., 
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Fic. 1. Experimental conditions. (Cross- 


hatching represents white noise.) 


Maltzman & Raskin, 1965; P. A, 
Obrist, 1962; W. D. Obrist, 1950; 
Schónpflug, 1963). 

Different expectations regarding the 
effects on learning of the four con- 
ditions depicted in Fig. 1 could be 
derived from current speculations re- 
garding motivational aspects of learn- 
ing. The Michigan group (Kaplan, 
1963; Kleinsmith & Kaplan, 1963; 
Walker & Tarte, 1963) holds the prin- 
cipal role of arousal to be in the con- 
solidation of memory traces during a 
period of perseveration, involving rev- 
erberating neural circuits, following the 
response. This would lead one to 
expect that white noise affer the re- 
sponse has been completed would bene- 
fit long-term recall, so that learning 
under Cond. II and III would surpass 
learning under the other conditions. 
An arousal-reduction or drive-reduc- 
tion hypothesis would predict that 
Cond, I, in which the white noise goes 
off just after the response has been 
made, would be most effective. If 
the degree of learning is believed to 
depend principally on the prevailing 
level of arousal, then Cond. IV should 
be least effective, and Cond. I and II 
should be equally effective, Condition 
III should surpass I and II or fall 
short of I and IT, according to whether 
the prevailing level of arousal depends 
on the total duration of white noise or 
on the number of shifts between white 
noise and silence, A view that has 
been widely favored among Soviet 
physiologists (e.g, Anokhin, 1958: . 
Vinogradova, 1959) asserts that the 
orientation reaction (a phasic rise in 
arousal) supplies the cerebral cortex 
with excitation from the brain stem, 
which facilitates, and in most cases 
is necessary for; the establishment of 
"temporary Connections." This view, 
like somewhat related views offered by 
American writers (eg., Miller, 1963; 
Razran, 1930), implies that arousal 
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does its work during the performance 
of the response, in which case Cond. I 
and III would be superior to the 
others. 


EXPERIMENT I 
Method 


Subjects, —The Ss were 64 female under- 
graduates taking elementary psychology 

*courses. 

'-. Material and design—A sequence of 40 
items, each with a disyllabic adjective as 
stimulus term and a disyllabic, familiar male 

k first name as response term, was constructed. 

It consisted of the 36 items used in the 

previous experiment (Berlyne et al, 1965, 

Exp. III) and four additional similar items. 

This sequence was divided into four sub- 

sequences of 10 items. 

Half of the Ss had the subsequences in the 
order 1, 2, 3, 4, and the remaining Ss had 
them in the order 2, 1, 4, 3, on training 
trials. Both groups had the order of sub- 
sequences reversed on test trials (see Table 
1). Each of these two groups of Ss was 
further split into four subgroups, having the 
four experimental conditions in four different 
orders during training trials in accordance 
with a counterbalanced Latin square, as 
shown in Table 1. 

Procedure.—Every S underwent three 
training trials in immediate succession on 
1 day and was brought back for a single 
test trial about 24-hr. later. The whole 
sequence of 40 items was presented three 
times running. White-noise conditions were 


TABLE 1 
Exp: I AND I] EXPERIMENTAL DESIGN 


Three Training Trials ZEN 
A Order of Experimental 
$ quences 
i 8 Ss.| 8 Ss | 8 Ss | 8 Ss 
First Group* 
O1 I|IV| II | III 4 
2 IL|II | IV |. I S 
à s a FIT I|IV 2 
I| II 

Second Group* i " 
TO IT} IV] II | III 3 
1 II} HI | IV I 4 
4 Ill} II I| IV 1 
3 IV I|III| II 2 


a32 Ss in each group. 


TABLE 2 


MEAN No. or ConnECTLY RECALLED 
RESPONSE TERMS ON Test TRIALS 


After Response 
During Stimuli 
and Response 
WN No WN Total 
Exp. I* 
WN 4.0 (III) | 41 (D| 81* 
No WN | 3.6 (II) | 3.6 (1V)| 7.2* 
Total 7.6 7.7 
Exp. II^ 
WN 5.9 (III) | 63. (1)| 12.2 
NoWN | 6.1 (II) | 62 (IV) | 12.3 
Total 12.0 12.5 


a Test trial after 24 hr. 
b Immediate test trial. 
* Significantly different from each other, p < .025. 


changed from one subsequence to the next. 
The test trial was conducted without any 
white noise. When white noise was present, 
its intensity was 75 db. 

During training trials, each stimulus term 
appeared alone for 4 sec. in the aperture of 
a Stowe memory drum, and then for an 
additional 2 sec. with the response term 
to the right of it. Following this there was 
an interval of 6 sec., with a blank appearing 
in the aperture, after which the next stimulus 
term came into view. The S was instructed 
to anticipate the response term whenever 
possible during the period when the stimulus 
term was on alone, but the response term had 
to be pronounced as soon as it appeared, 
whether or not it had been anticipated. 
During test trials, stimulus terms alone were 
presented for 4 sec. with intervals of 4 sec., 
and S was instructed to pronounce the 
corresponding response terms if possible. 


Results 

Test trials.—The mean number of 
response terms learned under the dif- 
ferent experimental conditions that 
were correctly recalled in the test trials 
are shown in Table 2. Analysis of 
variance showed that recall was sig- 
nificantly better when items had had 
white noise during the presentation of 
stimulus and response terms (ie. in 
Cond. I and III): F (1, 168) = 5.16, 
p «.025. Whether white noise was 
present (Cond. II and III) or absent 
(Cond. I and IV) after the response 
made no significant difference, and 
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there was no significant interaction: 
F <1 in both cases. None of the 
interactions involving order of experi- 
mental conditions and of subsequences 
reached significance: F «1.6 in all 
cases. 

Training trials.—The mean number 
of correct anticipations on Training 
Trial 3 (12.1) was significantly greater 
than the corresponding mean of Train- 
ing Trial 2 (5.1) : F (1, 392) — 393.8, 
p «001. There were, however, no 
significant effects and no significant 
interactions due to white-noise condi- 
tions: F «1 in all cases. 


ExPERIMENT II 


Experiment II differed from Exp. I 
solely in the timing of the test trial, 
which was presented immediately after 
the third training trial. Otherwise, the 
experimental material, design, and 
procedure were the same. 

Experiment II had two purposes. 
First, it was hoped to investigate pos- 
sible effects of the various white-noise 
conditions on immediate recall, with a 
view to finding out more about the 
short-term detrimental effect of arousal 
on recall observed in the Michigan 
experiments and in the previous ex- 
periment by Berlyne et al. (1965, Exp. 
IIT). Secondly, it was hoped to meas- 
ure how much of the material learned 
under different white-noise conditions 
is retained immediately after training, 
So as to test the hypothesis that the 
significant effect observed in Exp. I 
represents differential forgetting. 


Method 


Subjects.—Once again, Ss were 64 female 
undergraduates, taking elementary psychol- 
ogy courses, none of whom had participated 
in Exp. I. Since Exp. II was conducted 
later in the academic year, a high proportion 
of its Ss had to be taken from Honors 
courses. Furthermore, by the time they 
took part in the experiment, they, unlike the 
Ss of Exp. I, had learned about paired- 
associate learning in their course work, 


Results 


Neither the test trial nor Training 
Trials 2 and 3 produced any signifi- 
cant differences or interactions involv- 
ing white-noise conditions. Response 
terms learned under Cond. I and III 
were actually recalled slightly less fre- 
quently than those learned under Cond. 
II and IV (see Table 2). 

The mean recall scores on Training 
Trial 2 and Training Trial 3 were 
7.4 and 12.2, respectively. The in- 
crease from the one to the other is 
significant at the .001 level: F (1, 392) 
= 284.59, p < .001. 

Comparison between Exp. I and II. 
—The mean recall scores over Train- 
ing Trials 2 and 3 were significantly 
higher in Exp. II than in Exp. I: 
F (1, 112) 29.05, p « 005. This 
was presumably due to the differences 
mentioned above under "Subjects." 

When the two experiments were 
compared with respect to the difference 
in recall scores on test trials between 
items learned under Cond. I and III 
and items learned under Cond. II and 
IV (ie, differences due to presence 
or absence of white noise during pres- 
entation of stimuli on training trials), 
the interaction yielded an F (1, 336) 
of 3.52, for which p~.06. This 
means that the one-sided hypothesis of 
differential forgetting is confirmed at 
the .03 confidence level. 

The difference in recall during train- 
ing trials between Ss of the two ex- 
periments suggests the alternative hy- 
pothesis that the difference failed to 
appear in Exp. II because Ss were 
abler rather than because they were 
tested immediately after training. It 
can, however, be rejected. Correla- 
tion coefficients between the total num- 
ber of correct anticipations during 
Training Trials 2 and 3 and the dif- 
ference between numbers of items re- 
called on the test trial that had been 
learned under Cond, I and III and 
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under Cond. II and IV were computed. 
These coefficients came to —.03 for 
Exp. I and +.07 for Exp. II. 


Discussion 


The significant difference found in Exp. 
I on the test trial taking place on the day 
following training favors the kind of 
hypothesis that ascribes reinforcing ef- 
fects to arousal during performance of 
the response to be learned or during 
presentation of the stimulus with which 
it is to be associated. Further work will 
be needed before we can ascertain the 
relative importance of arousal accom- 
panying the stimulus and arousal accom- 
panying the response. The lack of an 
interaction with the presence or absence 
of arousal after the response speaks 
against an arousal-reduction or drive- 
reduction hypothesis. Perseveration hy- 
potheses are disfavored by the fact that 
white noise after the response made no 
significant difference. Since it evidently 
matters when white noise is applied, the 
claims of hypotheses stressing the overall 
level of arousal during learning are like- 
wise weakened, 

The comparison between Exp. I and 
II indicates that, immediately after train- 
ing, items learned with white noise ac- 
companying the stimuli and the response 
are recalled about as well as the other 
items, but that fewer of the former are 
forgotten over the succeeding 24 hr. 

Experiment II failed to produce the 
detrimental effect of white noise on an- 
ticipation of responses during training 
that was found in the previous experi- 
ment (Berlyne et al., 1965, Exp. III). 
Consequently, it did not achieve its pur- 
pose of throwing further light on this 
effect. Several differences could have 
made the present Exp. IT too insensitive 
to show up such an effect, including the 
smaller number of Ss, the different com- 
bination of experimental treatments, and 
the fact that lists of 40 items, rather 
than of 18 items, were used on training 
trials. 

There are reasons, both experimental 
and theoretical for believing that the 


effects of arousal immediately after learn- 
ing are variable and complicated. 

On the one hand, the Michigan experi- 
ments and two Toronto experiments ( Ber- 
lyne et al., 1965, Exp. II and III) found 
recall to be worse immediately after train- 
ing with items producing high GSRs and 
items learned under white noise, respec- 
tively. Other experiments indicate, how- 
ever, that increases in arousal do not 
always have detrimental short-term ef- 
fects on recall, as the Michigan group 
maintain, but may, in some conditions, 
improve recall. Moderate auditory stimu- 
lation has been found to favor immediate 
recall in paired-associate learning (Ber- 
lyne et al, 1965, Exp. Il; Hörmann & 
Todt, 1960) and serial learning (Schón- 
pflug & Schafer, 1962). Another ex- 
periment has shown immediate recall of 
emotional words, productive of intense 
GSRs, to be superior in serial learning 
(Schónpflug & Beike, 1964).? It seems 
likely, in fact, that there is an optimum 
degree of arousal for immediate recall, 
the location of the optimum varying 
widely with circumstances. 

Since the reported detrimental effects 
are confined to a short period after train- 
ing, they must be accounted effects on 
performance rather than effects on learn- 
ing (Berlyne et al, 1965; Berlyne, 
Salapatek, Gelman, & Zener, 1964), The 
generalization that measures of perform- 
ance are curvilinearly related to level of 
arousal, with variations in the location 
of the optimum depending on several fac- 
tors, has been put forward by several 
writers with substantial experimental 
backing (e.g, Bélanger & Tétreau, 1961; 
Duffy, 1962; Malmo, 1959; Schlosberg, 
1954). 

The favorable long-term effect of 
arousal on recall qualifies as a. reinforce- 
ment effect in the broad sense of the term. 
Reinforcing effects on learning can be 
demonstrated only if there is a sufficient 
delay between training trials and test 


? An experiment by Maltzman and Raskin 
(personal communication) has also shown 
that words evoking more intense orientation 
reactions are more likely to be recalled in an 
immediate test. 
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trials for transient cue effects and tran- 
sient effects on motivational condition 
to dissipate (Berlyne et al., 1965; Ber- 
lyne et al., 1964). Nevertheless, effects 
of all known reinforcing agents are also 
in evidence immediately after training. 
In the experiments mentioned, the supe- 
rior consolidation of items learned under 
conditions of high arousal must have been 
counteracted by a detrimental effect on 
performance. If we have two such ef- 
fects working in opposite directions, it 
is only to be expected that sometimes one 
will overshadow the other, sometimes the 
other will prevail, and sometimes they 
will cancel out. 
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INTENSITY-TIME RELATIONSHIP AND PERCEIVED SHAPE* 


H. W. LEIBOWITZ, SHARON E. TOFFEY, anv JOHN L. SEARLE 


Pennsylvania State University 


The effect of exposure duration on perceived shape was determined for 
intensity-time combinations which were adjusted to produce an equal 
amount of effective photolytic energy in accordance with the reciprocity 
relationship. Matched shape tends to remain constant for the shorter 
exposure durations, but increases with exposure duration, particularly 
above the critical duration of .1 sec. The results are interpreted as 
reflecting the importance of temporal summation within the visual system 
in the perception of shape, and the critical contribution of time, inde- 
pendent of intensity, in the manifestation of the tendency toward shape 


constancy. 


The intensity-time relationship has 
played a significant role in visual ex- 
perimentation. A number of measures 
of visual  discrimination—absolute 
threshold (Davy, 1952; Graham & 
Margaria, 1935; Karn, 1936; Long, 
1951; Sperling & Jolliffe, 1965), in- 
tensity discrimination (Graham & 
Kemp, 1938), visual acuity (Graham 
& Cook, 1937), perception of motion 
(Brown, 1955), digit identification 
(Kahneman & Norman, 1964), and 
perception of number (Hunter & Sig- 
ler, 1940) can be predicted from the rec- 
iprocity relationship J x T =C. For 
these measures, below a critical dura- 
tion of approximately .1 sec. the same 
visual effect is obtained behaviorally 
when the product of intensity and time 
is constant. For durations longer than 
4 sec, the law fails, presumably be- 
cause the initial events which influ- 
ence the response have already been 
determined (Hartline, 1934). 

It is of interest to determine the ex- 
tent to which the reciprocity law is ap- 
plicable to other visual functions which 
are influenced by the variables of time 
and intensity. The perception of shape 
has been shown in a previous study 


1 Supported in part by Research Grant 
MH08061 from the National Institute of 
Mental Health, Public Health Service. 


(Leibowitz & Bourne, 1956) to be in- 
fluenced by both of these variables, al- 
though the precise nature of this rela- 
tionship has not been determined. The 
purpose of the present study is to ob- 
tain shape-matching functions at vari- 
ous durations of exposure for which 
intensity has been compensated to pro- 
duce an equal J X t product. In addi- 
tion, we wish to determine the extent 
to which the phenomenon of shape 
constancy, the tendency to perceive the 
true shapes of objects despite variation 
in retinal shape resulting from oblique 
viewing, is manifested for stimuli 
which provide a constant amount of 
stimulus energy. 

The significance of these determina- 
tions is related to the importance of 
temporal summation in the visual sys- 
tem, If the critical events determining 
the perception of shape are completed 
within the critical-duration period, 
shape perception should be considered 
theoretically on the same basis as the 
other measures of discrimination which 
are in accordance with the reciprocity 
relationship. Whether or not the 
tendency towards shape constancy is 
also manifested within the critical-du- 
ration period will indicate the impor- 
tance of exposure duration with re- 
spect to this phenomenon. 
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TABLE 1 


Megan MATCHED Axis RATIO AS A FUNCTION 
or EXPOSURE DURATION 


Duration of Exposure in Msec. 


.0001 | .001 | .01 0.1 1.0 


Mean Matched 
Axis Ratio 53 | .54 | .54 | .55 | .59 
SD .03 | .03 | .03 | .04 | .05 


Note.—The retinal axis ratio of the obliquely viewed 
circular test object was .50. The luminance of the 
stimulus was adjusted so as to produce a constant 
amount of effective photolytic energy in accordance 
with the reciprocity relationship. 


METHOD 


The apparatus consisted of a Scientific 
Prototype three-channel tachistoscope in 
which high-intensity lamps were controlled 
by electronic circuitry. The test object, a cir- 
cular disk 3 cm. in diameter and 3 mm. thick, 
was cut from white plastic (reflectance 62%) 
and mounted on a turntable which was 
covered with black flock paper (reflectance 
85%). The turntable was oriented at a 
constant angle of 30° to S’s line of vision 
so that it projected an elliptical retinal image 
with an axis ratio (ratio of minor to major 
axis) of .50. The adapting field contained 
a series of 22 white ellipses viewed against 
a black background. The axis ratio of the 
ellipses ranged from 1.00 to .071 in approx- 
imately equal steps. The S's task was to 
choose the ellipse which "looked" the most 
like the test object. The Ss were told, 


On the left-hand side of your visual field 
you will see a geometrical figure which will 
be flashing on and off. On the right-side 
of your visual field you will see the same 
kind of figure which also will be flashing 
on and off. Your task will be to match the 
figure on the left with the one on the right 
that looks most like it. You can vary the 
size of the figure which will apear on the 
right by turning the set of knobs by your 
right hand. Turning the top knob will 
bring larger and larger figures into view 
whereas turning the lower one will bring 
smaller figures into view. There is a wide 
range of sizes from which you may choose, 
and on your first trial we would recom- 
mend that you test the limits of your range 
of choices. Are there any questions? We 
are going to turn out the lights for 5 
minutes before we begin the experiment. 
This is to allow your eyes to dark adapt. 


Exposure durations of .1, 1.0, 10.0, 100.0, 
1,000.0 msec. were used. Luminance was set 
at the maximum available (75 footlamberts 
[ftl.]) for the shortest exposure duration and 
attenuated by a factor of 10 for each of the 
succeeding exposure durations up to 100 
msec.—keeping the intensity-time product 
constant for what has been previously de- 
termined to be the critical period. The lumi- 
nance was not reduced for the duration of 
1,000 msec. since this time is above the 
“critical duration” and the reciprocity func- 
tion would not be applicable. For visual 
measures for which the reciprocity relation- 
ship holds, all of these intensity-time com- 
binations would be equivalent, and approx- 
imately one log unit above the absolute foveal 
threshold. Each S was tested under all five 
conditions according to a Latin-square de- 
sign, The Ss were 50 students recruited 
from an introductory psychology course. All 
had 20/20 vision and were naive as to the 
purpose of the experiment. 


RESULTS AND DISCUSSION 


The data, presented as the mean 
matched axis ratio as a function of ex- 
posure duration, are presented in Table 
1 and plotted in Fig. 1. As a conse- 
quence of the experimental design, it 
is possible to compare these data with 
several theoretical predictions. An 
agreement with the reciprocity rela- 
tionship would be indicated by matches 
at a fixed axis ratio, ie, a constant 


MEAN MATCHED AXIS RATIO 
in 
E 


i 


LAW OF THE RETINAL IMAGE 


On 10 10 100 1000 


EXPOSURE DURATION (MILLISECONDS) 


Fic. 1. Mean matched axis ratio as a 
function of exposure duration for a circular 
test object inclined 30° to the direction of 
vision (retinal axis ratio .50). (The lumi- 
nance of the stimulus was adjusted so as to 
produce a constant amount of effective photo- 
lytic energy in accordance with the reciproc- 
ity relationship.) 


RECIPROCITY LAW AND SHAPE PERCEPTION 9. 


behavioral effect, since the stimuli 
were adjusted to produce, in all cases, 
a fixed amount of effective energy. 
With respect to perceptual constancy, 
the “law” of the retinal image, a condi- 
tion in which matched shape is predict- 
able from the shape of the retinal im- 
age, is also indicated by a horizontal 
Jine representing a matched axis ratio 
of .50. At the other theoretical ex- 
treme, the “law” of shape constancy, 
which predicts that matched shape 
should be in accord with the true shape 
of the test object, would be here repre- 
sented by a horizontal line at a 
matched axis ratio of 1.00. 

The empirical results are in close 
agreement with both the reciprocity re- 
lationship and retinal image predic- 
tions for the shorter exposure dura- 
tions. At the longest duration, these 
theoretical predictions are no longer 
in accord with the data obtained. The 
matched axis ratios increase, implying 
both the inapplicability of the reciproc- 
ity prediction and a tendency toward 
shape constancy. 

With respect to the reciprocity pre- 
diction, it will be noted that up to 10 
msec. the curve is flat, indicating good 
agreement with theory. There is a 
slight rise in the curve between 10 and 
100 msec., implying an incipient tend- 
ency to deviate from strict reciprocity. 
At 1,000 msec., the deviation is marked. 
Although Hartline (1934) obtained a 
sharp cut-off for the single optic nerve 
fiber of Limulus, variability in the re- 
gion of the critical duration is typical 
of data obtained in the human eye. 
This variability is attributable to retinal 
interaction and increases with large 
test fields (Graham & Margaria, 1936). 

The close approximation to the reci- 
procity prediction for the shorter du- 
rations and the marked deviation from 
it at the longest duration, indicate that 
this relationship is indeed applicable 


to the perception of shape. It should 
be noted that Ss were able to make 
consistent shape judgments for all in- 
tensity-time combinations even though 
the shortest duration was only .0001 
sec., and the overall energy level ap- 
proximately one log unit above absolute 
foveal threshold. By inference, sum- 
mation within the visual system is as- 
sumed to be critical to the perception 
of shape. Thus, shape discrimination, 
within the limits of the critical dura- 
tion, may be considered to be mediated 
by the same processes which have been 
shown to be involved in a variety of 
measures of visual discrimination. 

Although temporal summation is 
critical to the perception of shape per 
se, summation within the critical dura- 
tion period is not sufficient for the oc- 
currence of the phenomenon of shape 
constancy. For the shorter durations, 
the matched axis ratios lie fairly close 
to the theoretical prediction based on 
the “law” of the retinal image, indi- 
cating a very slight tendency, of the 
order of 7-8%, toward shape con- 
stancy. The rapid rise in the curve, 
in the direction of the constancy effect, 
does not occur until after the critical 
duration period. Comparing the data 
for exposures below critical duration 
with those obtained at 1,000 msec., it 
is apparent that the manifestation of 
the tendency toward shape constancy 
does not result solely from the total 
excitation produced by the stimulus, 
but rather is strongly dependent on ex- 
posure duration. Assuming that ade- 
quate energy is available to permit 
shape discrimination, the tendency to- 
ward shape constancy is seen to be 
dependent upon mechanisms in addi- 
tion to those which are activated within 
the critical duration period. These in- 
volve, at the very least, durations of 
exposure beyond the critical duration 
for their manifestation, 
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EFFECTS OF A READY SIGNAL UPON 
EYELID CONDITIONING 


BARBARA B. TURNER 
Duke University 


4 groups of 20 Ss participated in a factorially designed eyelid-condi- 
tioning experiment where a ready signal (RS) was present or absent 
on Trials 1-60 or 61-90. On RS trials the signal preceded the CS by 


5 different temporal intervals. 


without an RS than with one. After 30 trials, 
performances were obtained at the different RS-CS intervals, 
with best conditioning. Introducing or omitting 


sec. being associated 


After 4-6 trials, conditioning was better 


consistently different 
with 2.0 


the RS after 60 trials had no effect. 


In experiments on eyelid condition- 
ing it is often convenient to present a 
ready signal (RS) before each trial to 
announce that the trial is about to be- 
gin. Several studies of the effect of 
this procedure (Dufort & Kimble, 
1958; King, Kimble, Gorman, & King, 
1961; McAllister & McAllister, 1960a, 
1960b; Price, Vandament & Abbott, 
1964; Prokasy & Whaley, 1962) have 
all shown that conditioning is poorer 
with an RS than without one. Beyond 
this fact, however, little is known about 
the effect of a ready signal and no 
generally accepted theoretical interpre- 
tation exists. The primary aim of this 
study was to gather data from which 
testable hypotheses may be generated. 

The most important maneuvers in 
this investigation were: (a) There was 
a systematic study of the effect of the 
length of the temporal interval by 
which the RS preceded the CS. While 
previous eyelid-conditioning studies 
have varied the RS-CS interval, none 


1 This paper is based on a dissertation sub- 
mitted to Duke University in partial fulfill- 
ment of the requirements for the degree of 
Doctor of Philosophy. The author is grate- 
ful to Gregory A. Kimble for his assistance 
in directing the project and preparing the 
paper. The research received financial sup- 
port from two grants, G-7079 from the Na- 
tional Science Foundation and MH08090 
from the National Institutes of Health. 


1i 


of them has presented an analysis of 
the level of conditioning obtained with 
the different intervals. (b) For half 
the Ss, the RS was introduced or dis- 
continued after 60 conditioning trials 
in an attempt to determine whether its 
effect is upon learning or performance. 


METHOD 


Apparatus.—The equipment used has been 
described elsewhere (Nicholls & Kimble, 
1964). In brief, it consists of a Grass poly- 
graph which records the deflections of the 
arms of a microtorque potentiometer linked 
to S’s right eyelid by a false eyelash and 
nylon thread. 

The CS was the onset of a circle of red 
light, 1.25 in. in diameter, provided by a 
Grason-Stadler Model E4580 multiple-stim- 
ulus projector located about 24 in. in front of 
S and about 25 in. above eye leve. The CS 
terminated at the onset of the UCS, which 
was a 50-msec. puff of air delivered about 
3 in. from the temporal corner of $'s right 
eye through a dr-in. nozzle at a pressure 
capable of supporting a 180-mm. column of 
mercury. The interstimulus interval was 
500 msec. The RS was a white noise of 
100-msec. duration and approximately 70-db. 
SPL intensity produced by a Grason-Stadler 
noise generator. Tntertrial intervals varied 
haphazardly from 10 to 20 sec. around an 
average of 15 sec. All temporal intervals 
were controlled by Grason-Stadler equipment. 

Subjects.—The Ss were 107 women drawn 
from a course in introductory psychology. 
Twenty-seven Ss were discarded: 13 because 
of apparatus difficulties, 7 because the records 
were illegible, 3 for failure to obey instruc- 
tions, 2 as voluntary responders, and 2 be- 
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Fic. 1. Mean percentage of CRs for all 
groups in blocks of 15. (The curves are 
broken at the point where an RS was in- 
troduced or omitted for two. groups.) 


cause of a decrease in the amplitude of the 
UCR to one fourth its value on two test 
trials before conditioning. 

Design and procedure.—Twenty Ss were 
assigned serially to each cell of a 2 X 2 fac- 
torial design in which the two factors were 
(a) the presence or absence of an RS on (b) 
the first 60 and/or final 30 trials, One group 
(NRS-NRS) received no ready signal dur- 
ing either series of trials. Another group 
(RS-RS) received the signal during both 
series. For a third group (NRS-RS), the 
RS was introduced on Trial 6l. The final 
group (RS-NRS) received the signal only 
on Trials 1-60. 

For the groups receiving a ready signal, 
RS preceded CS on equal numbers of trials 
by one of five intervals: 1, 2, 3, 5, and 7.5 
sec, Each block of 15 trials contained three 
presentations in a haphazard order of the 
stimuli at each of the five intervals. 

Instructions similar to those used by Du- 
fort and Kimble (1958) were read to Ss after 
they were seated in the experimental cubicle, 
Then there were, in order, 2 trials with the 
CS alone, 2 trials with the UCS alone, and 
the 90 conditioning trials A 3-min. rest 
period followed the sixtieth trial. 


RESULTS 


All responses with latencies of 200- 
500 msec. and an amplitude of at least 


2 mm. on the polygraph record were 
counted as CRs. Figure 1 presents 
the percentages of CRs for all groups 
in blocks of 15 trials. The difference 
between the performance of the NRS 
and RS groups was significant at the 
.005 level, F (1, 78) = 11.10, for all 
trials. A factorial analysis of variance 
performed on the measures obtained on 
Trials 61-90 reveals that the only sig- 
nificant source of variation is the RS 
condition on Trials 1-60, F (1, 76) = 
4.41; 05> p > .025. (The difference 
between the two NRS groups on Trials 
31-60 appearing in Fig. 1 must be the 
result of sampling error since these 
groups were treated the same and Ss 
were assigned to conditions without 
bias.) 

Figure 2 presents percentages of 
CRs on Trials 31-60 for the two RS 
groups combined plotted as a func- 
tion of the length of the RS-CS inter- 
val A Treatment X Ss analysis of 
variance reveals that the differences 
depicted in Fig. 2 are significant, F (4, 
156) = 2.69; p = .05. 

The data for Trials 61-90 provided 
another opportunity to examine the 
function relating level of conditioning 
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Fic. 2. Mean percentage of CRs for Trials 
31-60 as a function of RS-CS interval. 


PREPARATORY INTERVAL IN EYELID CONDITIONING 


of CRs on these trials for the five in- 
tervals (from shortest to longest) were 
61.25, 72.50, 72.08, 68.33, and 69.58. 
The pattern of these values is similar 
to that shown in Fig. 2 and an analysis 
of variance showed the differences 
among these points to be significant, 
F (4, 156) 23.14; p= .025. Both 
sets of data suggest that conditioning 
with an RS is best at an RS-CS in- 
terval of about 2.0 sec. and that longer 
and shorter intervals result in poorer 
conditioning. 

A. question related to the analyses 
just presented is whether conditioning 
in the NRS groups was better than at 
the optimal RS-CS interval of 2.0 sec. 
The mean percentage of CRs for the 
NRS groups on Trials 31-60 and for 
the RS groups at the 2.0-sec. interval 
were 72 and 63, respectively, but the 
indicated difference is not significant 
(t = 123). 


Discussion 


Obviously the manipulation of the 
RS-CS interval in conditioning is opera- 
tionally analogous to manipulating the 
length of the foreperiod in a reaction-time 
(RT) study. There are also certain 
similarities in the effects of the two pro- 
cedures. It has been known since the 
very early investigations (Woodrow, 
1914) that length of the foreperiod is an 
important determinant of RT. In studies 
with a constant foreperiod, Woodrow 
found an optimal interval at 2-4 sec. a 
fact that has been interpreted in terms 
of a "getting ready" on the part of S. 
There is some evidence that the process 
of becoming prepared is partially a mat- 
ter of developing muscular tension. Thus, 
Fink (1954) found that Ss in an RT 
task responded to a ready signal with 
increasing muscular action potential 
which reached a maximum in about 2.0 
sec. Up to a point, an interpretation 
based on these RT results fits our con- 
ditioning data quite well The function 
relating conditioning to the RS-CS inter- 
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val is like that obtained in RT studies 
and the highest level of conditioning 
occurred at the interval which Fink found 
to be associated with the maximal state 
of muscular tension. 

Although these parallels suggest the 
posssibility of explaining the RS effect 
in terms of preparedness, such an ex- 
planation encounters difficulty with the 
data on conditioning without an RS. 
Under these circumstances, there is no 
signal to initiate the postulated state of 
preparedness but conditioning is better 
than with the RS. There is even some 
indication that conditioning without an 
RS may be better than when the state 
of preparedness is optimal. 

This contradiction is one of the reasons 
for concluding that it is necessary to con- 
sider other possible interpretations proba- 
bly as supplements to rather than alterna- 
tives for the RT explanation. A close 
inspection of the data suggests that the 
situation created by the use of an RS 
involves at least two separable processes: 
(a) the process of preparedness discussed 
above that is associated with the particu- 
lar RS-CS interval on a given trial, and 
(b) a more general process associated 
with the presence or absence of the RS, 
without respect to the length of the inter- 
val. The desirability of making a dis- 
tinction of this type may be demonstrated 
by considering the effect of the RS in 
relation to the number of conditioning 
trials. Although 30 or more trials were 
required for the effect of the specific RS- 
CS intervals to appear, the difference 
between the curves for the RS and NRS 
groups was already well established in 
15 trials. An analysis of variance per- 
formed on the data from the RS and NRS 
groups for Trials 1-60 reveals that there 
is not a significant interaction. between 
successive blocks of 15 trials and experi- 
mental conditions, F (3, 24) 2228. The 
curves are parallel after some point in 
the first trial block. 

In order to determine how quickly the 
RS-NRS difference is established, per- 
formance on Trials 1-15 was analyzed in 
blocks of 3 trials. The results of two 
analyses of variance are informative. The 
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first of these analyses was performed on 
the entire set of 15 trials. The important 
outcome was a significant interaction be- 
tween experimental condition and trials, 
F (4, 312) = 11.99; p= .001, indicating 
that the curves are diverging. A similar 
analysis for Trials 4-15, however, did 
not yield a significant interaction. Thus, 
the RS and NRS functions may be con- 
sidered parallel from Trials 4-6. 
Although established early in condi- 
tioning, we have seen that the effect of 
the RS is an enduring one, surviving the 
discontinuing of the RS. As the results 
of factorial experiments are commonly 
interpreted, this means that the influence 
of the RS is upon learning rather than 
performance, Although we accept this 
interpretation, the analysis of the data 
from Trials 1-15 raises a question as to 
the aspect of learning which has been 
manipulated by the use of the RS. Spe- 
cifically, although use of the RS produced 
a decrement in the percentage of CRs 
during the very early trials, the overall 
percentage continued to increase for 
many additional trials. This fact appears 
to require the conclusion that at least two 
forms of learning were present in this 
conditioning situation: One of these in- 
volved an increase in the tendency for 
the CS to elicit a conditioned blink; the 
other was the acquisition of some general 
reaction (“attitude ?" “set?”) closely as- 
sociated with the RS. The influence of 
the RS, although learned, seems to have 


been on the latter process and not directly 
on the conditioned eyeblink. 
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SIMPLE REACTION TIME AS A FUNCTION OF THE RELA- 
TIVE FREQUENCY OF THE PREPARATORY INTERVAL 


THEODORE P. ZAHN Au» DAVID ROSENTHAL? 
National Institute of Mental Health 


Simple auditory reaction 


time (RT) was investigated in relation to the 


length and relative frequency of each member of 2 pairs (1 and 3 sec. 


regular sequence. 
2 PIs was a decreasing 


and 3 and 10 sec.) of preparatory intervals (PI) presented in an ir- 
For each pair, RT on trials with the shorter of the 
function of the relative frequency of that PI, 


even when the length of the PI on the preceding trial (PPI) was con- 
trolled.” The detrimental effects of 
longer than for the shorter pair of 


quency are attributed to “expectancy,” 


long PPIs were greater for the 
PIs. The effects of relative fre- 
' and the effects of the PPI are 


attributed to its influence on time estimation. 


In experiments on simple reaction 
time (RT) in which the preparatory 
interval (PI—interval between warn- 
ing signal and stimulus) is varied from 
trial to trial in an unpredictable man- 
ner, it has usually been found that RT 
is a negatively accelerated decreasing 
function of the length of the immediate 
PI, at least up to some intermediate 
value of the PI. This is due in part to 
the fact that RT on a given trial is im- 
paired if the PI on the preceding trial 
(preceding PI or PPI) is longer than 
the immediate PI (Drazin, 1961; Kar- 
lin, 1959; Klemmer, 1956; Woodrow, 
1914). The latter finding has some- 
times been attributed to a decrease in 
the expectancy of a short PI produced 
by a preceding long one—S is said to 
be “caught napping” (Karlin, 1959). 

In two studies which used a small 
number (three to five) of different PI 
lengths, it was found that marked and 
consistent effects of the immediate PI 
occurred only when the PPI was 
longer than the PI (Karlin, 1959; 
Zahn, Rosenthal, & Shakow, 1963). 


1 The authors gratefully acknowledge the 
assistance of John A. Cooley who designed 
and constructed the apparatus and tested 
most of the Ss, and Daniel Primac who 
helped to analyze the data, 
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Drazin (1961), however, who used a 
continuous distribution of PI lengths, 
found greater PI effects when the PRI 
was short. The results of Karlin and 
of Zahn et al. suggest the possibility, 
at least when a small number of dis- 
crete PI lengths are used, that the im- 
mediate PI will affect RT only to the 
extent that S is unprepared for the oc- 
currence of a PI of that length. 

The purpose of the present experi- 
ment is to determine the effect of the 
immediate PI on RT as a function of 
the probabilities of occurrence (rela- 
tive frequencies) of the different PIs 
in the series. Drazin (1961) investi- 
gated the effects of reducing the prob- 
ability of stimulus occurrence by means 
of introducing a large number (up to 
50%) of “check” trials in which no 
stimulus occurred. He found that de- 
creases in stimulus probability pro- 
duced increased RT, especially on the 
longer PIs in the series. In effect, 
however, this procedure increases the 
number of long PPIs. Thus, what is 
not clear from his data is the extent 
to which the increase in RT was due 
to the increase in the proportion of 
times the PPI was longer than the PI. 
In the present study particular atten- 
tion will be paid to the effects of the 
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relative frequency of the PI when the 
length of the PPI is controlled. 


METHOD 


Subjects —The S’s were 10 college stu- 
dents 19-23 yr. of age. Nine were male and 
1 was female. 

Apparatus.—The apparatus consisted of a 
telegraph key, a Standard Electric timer 
which was accurate to 5 msec., an audio os- 
cillator, an amplifier and speaker which pro- 
duced a 300-cycle tone at 70 db. (re .0002 
dynes/cm?), a 15-w. incandescent lamp en- 
closed in a box with a translucent face, and 
an automatic programing device. The latter 
consisted essentially of a motor-driven 
sprocket which pulled 35-mm. film leader 
under a set of five brush contacts. These 
controlled the peripheral apparatus by means 
of slots of varying lengths which were cut 
in the film. When a brush “hit” a slot it 
closed a circuit which activated one of the 
relays inside the unit. There was a separate 
program for each of the 10 conditions of 
the experiment. 

Procedure.—'The reaction-time task con- 
sisted of the presentation of a .3-sec. light 
flash as the warning signal, followed by an 
unfilled delay (the PI) and then the tone 
stimulus to which S' responded by releasing 
the telegraph key. The tone ended when the 
key was released. The intertrial interval was 
normally 7 sec. The S was required to de- 
press the key again within 4 sec, after his re- 
sponse, Failure to do this, which was infre- 


REACTION TIME IN MILLISECONDS. 


L 1 1 ny 
3 7 EJ 


RELATIVE FREQUENCY 


Fic. 1, Reaction time as a function of the 
relative frequency of the PI for each PI 
length—all trials combined. (The subscripts 
refer to the PI with which the 3-sec, PI was 
paired.) 


quent, increased the intertrial interval pro- 
portionately. Reaction time was recorded 
by hand. 

There were 10 conditions, 2 conditions of 
PI length and 5 of relative frequency. Half 
the conditions involved PIs of 1 and 3 sec. 
and half involved PIs of 3 and 10 sec. The 
relative frequencies of the members of the 
pairs were 9 —.1, .7 —.3, .5 — .5, .3 — 7, and 
-1—.9 in different conditions. 

Each session consisted of 100 trials on one 
condition given in two identical blocks of 50 
trials. There was a 3-min. rest period be- 
tween blocks. Trials with each of the two 
PIs were presented in an irregular random 
sequence, with two restrictions. One of these 
was that each successive series of 10 trials 
included trials on the two PIs with the same 
relative frequency that was specified for the 
entire series. That is, on a condition where 
the l-sec. PI was presented .3 of the time 
and the 3-sec. PI .7 of the time, the 1-sec. 
PI occurred 3 times in the first 10 trials, 3 
in the second 10, and so on. The other con- 
straint was that the number of runs of dif- 
ferent lengths of each PI matched as closely 
as possible the statistical expectancy for that 
particular relative frequency. 

Because of factors beyond control not all 
the Ss were tested in all the sessions. Three 
Ss were given 8 sessions, four were given 9, 
and three were given 10. The Ss were gen- 
erally given one session per day, excluding 
weekends. The sequence of conditions was 
different for each S$ and was balanced by 
means of a modified Latin-square design. 

Each S had a practice session of over 100 
trials of RT prior to the start of the main 
experiment. Before each session $ was 
given 6-10 warm-up trials. 


RESULTS AND DISCUSSION 


„Since only minor and nonsignificant 
differences in the results for the first 
and second blocks of 50 trials occurred, 
they are combined for the presentation. 

Figure 1 shows the relation between 
RT and relative frequency of occur- 
rence for each PI over all trials, For 
the shorter member of each pair there 
Is a negative relationship between rela- 
tive frequency and RT, but for the 
longer PIs of the pairs no marked re- 
lationship exists. The data in this fig- 
ure and in subsequent ones were ana- 
lyzed by computing the least-squares 
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slopes of the curves for each S and 
comparing the mean slopes to zero or 
to one another by £ tests. 

Analyses of the data plotted in Fig. 
1 show that, for the shorter PI in each 
pair (1 and 3,, sec.)?, the mean slopes 
of the functions relating RT to the 
relative frequency of the PI are sig- 

„nificantly different from zero (p< 
001 and p< .02, respectively). The 
mean slopes for the longer PI in each 
pair (3, and 10 sec.) do not differ 
significantly from zero (p> .20). 
Also, for each pair, the slopes for the 
longer and shorter PIs differ signifi- 
cantly from one another (p < .01 and 
< .025 for the 1-3 and 3-10 pairs, re- 
spectively). Further, the difference 
between the slopes of the two 3-sec. PI 
curves is also significant (p < .05). 
The latter result supports the findings 
of many investigators that in situa- 
tions of this type the absolute length 
of the PI is a less important variable, 
in this regard, than is the context in 
which the PI occurs. 

These data show that on trials with 
the shorter of the two PIs RT is closely 
related to the proportion of times that 
PI occurs in the series. However, this 
result could be due entirely to the effect 
of the PPI rather than to proportion 
per se, since the lower the frequency of 
a shorter PI, the greater the proportion 
of times it is preceded by a longer PPI. 
Thus it is necessary to reexamine this 
relationship with the value of the PPI 
held constant. In Figure 2 the rela- 
tionship between RT and relative fre- 
quency for each PI is plotted separately 
for those trials on which the PPI was 
the same (s) value as the PI and for 
those trials on which it was the other 
(o) value. For example, the curve 
labeled 1, is based on those trials on 


? Where ambiguity might otherwise occur, 
subscripts will be used on the 3-sec. PI to 
denote the PI with which it was paired. 
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Fic. 2, Reaction time as a function of the 
relative frequency of the PI plotted separately 
for each combination of PI and PPI. (The 
numerals refer to the duration of the PI. 
The letters denote whether the PPI was the 
same [s] value as the PI or the other [o] 
value.) 


which the PI was 1 sec, and the PPI 
was 3 sec. For the 1-sec. PI, the slope 
of the function relating RT with the 
relative frequency of the PI is signif- 
icantly different from zero both when 
the PPI is the same (1,) and when 
PPI is the other (1,) interval (p < .02 
and p<.01, respectively). These 
slopes also differed from slopes of the 
comparable 3,-sec. PI curves (P < .01 
and p < .02, respectively). Thus, in 
this case Ss' readiness to respond at the 
appropriate time on trials with the 
shorter of the two PIs was clearly a 
function of the probability of occur- 
rence of that PI irrespective of the 
value of the PI on the preceding trial. 
Although the effect of the PPI was 
small, it was significant. The RT for 
the 1-sec. PI averaged over relative 
frequencies of .3—.9 when the PPI 
was the other (3-sec.) interval is 6.4 
msec. slower than when the PPI was 
the same (l-sec.) interval (p < .05). 
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The slopes of the curves for the 310- 
sec. PI with the PPI held constant 
(lower part of Fig. 2) are somewhat 
reduced from the overall curve of Fig. 
1, and they are not significantly differ- 
ent from zero (p < .10 when the PPI 
was the same and p < .20 when the 
PPI was the other interval). How- 
ever, in the case where the PPI is the 
same interval the lack of significance 
is accounted for by one quite aberrant 
S, 9 of the 10 Ss showing a negative 
relationship between RT and relative 
frequency. By this nonparametric cri- 
terion p < .05. The slopes of the 310- 
sec. PI did not differ significantly from 
the slopes of the comparable 10-sec. 
PI curves although the differences were 
in the expected direction (p < .20 and 
p> 20 for 3, and 3,, respectively). 

The effect of the PPI on RT on 
trials with the 3;9-sec. PI was much 
more marked than in the case of the 
l-sec. PI. The difference between 3, 
and 3, averaged over relative frequen- 
cies of ,3—-9 was 199 msec. (p< 
01). This was significantly greater 
than the difference found between the 
two l-sec. PI curves (p < .01). 


To summarize these findings, it has 
been shown that both the relative fre- 
quency of the shorter PI and the length 
of the PPI influence the effect of the 
immediate PI on RT. The effect of 
relative frequency was more pronounced 
in the case of the shorter pair of intervals 
while the effect of the PPI was more pro- 
nounced in the case of the longer pair. 
There appears to be no marked inter- 
action between the effects of the PPI and 
relative frequency. Since these two vari- 
ables had differential effects on the two 
pairs of PIs, it is likely that they influence 
RT by different mechanisms. 

The relationship between RT and rela- 
tive frequency for the shorter PI in each 
pair may be reasonably attributed to 
variations in the “expectancy” that a 
short PI will occur on the assumption 


that with a higher degree of expectancy 
S will be more likely to adopt a maximum 
readiness to respond relatively soon after 
the warning signal. The evidence here 
indicates greater effects of expectancy 
the shorter the PI. 

Tf this is true then the effects of the 
PPI would seem not to be attributable 
to expectancy, since there is no reason 
to believe that a longer PPI would influ- 
ence expectancy per se any more for one 
pair of PIs than another. An alternative 
hypothesis based on a suggestion of Bot- 
winick (1959), is that a long PPI leads 
to an overestimation of the duration of 
a shorter PI, even when the short PI is 
highly probable. Thus a long PPI would 
have an effect similar to an "anchor" in 
time-estimation studies (Goldstone, Lha- 
mon, & Boardman, 1957). This hypothe- 
sis is consistent with the present finding 
of a greater effect of the PPI in the 
longer pair of intervals since it would 
be expected that the longer PPI would 
produce greater "anchor" effects. This 
hypothesis could be tested further by re- 
ducing the uncertainty of the occurrence 
of a short PI to zero and testing for a 
PPI effect. 

Another bit of relevant data concerns 
the effects of PIs antedating the PPI. 
Here, RT for each value of the PI and 
for each relative frequency was examined 
in relation to the number of consecutive 
same or other PIs which preceded the 
PI (ie. the length of run of the PPI). 
The only significant effect was found with 
the 1-sec. PI where RT was a decreasing 
function of the length of run of the same 
(1-sec.) PPI. Although this relationship 
was significant for all relative frequency 
conditions combined (5 « .001), the most 
marked and only significant relationship 
for an individual relative frequency oc- 
curred when it was .9 (p «.001). For 
the 3,9-sec. PI the most marked trend 
also occurred in the case of the .9 relative 
frequency, but this was not significant. 

In terms of the time-estimation hy- 
pothesis offered above, the data for the 
l-sec. PI suggest that successive trials 
with a short PI increasingly overcome 
the anchor effects of the long PI which 
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preceded the run. The smaller influence 
of the length of run for relative frequen- 
cies below .9 may be due to the fact that 
the average length of run is shorter at 
these frequencies. Under these condi- 
tions there may be a tendency for repeti- 
tions of PIs of the same value to increase 
S’s expectancy for the other PI. The 
failure to find any effects of the length 
of run of the PPI when the latter was the 
other interval makes it necessary to as- 
sume that one long PPI impairs time 
estimation to the same extent as several 
long PPIs. 

Drazin (1961), on the basis of earlier 
suggestions of Woodrow and Karlin, has 
formulated an “adaptation hypothesis” to 
account for the effects of the PPI. This 
states that on the average readiness be- 
comes maximum at about the mean inter- 
val, but this maximum tends to be reached 
earlier or later according to the length 
of the PPI. The present data are not 
inconsistent with this hypothesis but ex- 
tend it in two ways. First, they show 
that the point of maximum readiness is 
influenced by the relative frequency of 
the shorter PIs in the series as well as 
by the PPI. Second, they suggest two 
different mechanisms by which this point 
of maximum readiness may be influenced : 
by errors in time estimation and by the 
degree to which a particular PI is “ex- 
pected." 

The present data are pertinent also to a 
methodological issue regarding the use 
of check trials in a constant PI procedure. 
It is to be noted from Fig. 2 and from 
the data on the effects of length of run 
that even a small proportion of relatively 
long PIs in the series lengthens RT on 
short PI trials. Some improvement may 
be gained if the PI on the check trial is 
only slightly longer than the constant PI, 
or if the short PI trial just following the 
check trial is excluded from any calcu- 
lations, but the present data show that 
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the effects of the long PI persist even 
beyond this just subsequent trial and 
occur when the less frequent (check) 
PI is relatively short. This suggests 
that the use of check trials may influence 
any attempt to determine a minimum RT 
for a given PI, the extent of the influence 
depending on their frequency and on the 
length of the check PI with respect to 
the constant PI. Therefore, some other 
method may be more appropriate in pre- 
venting or minimizing “anticipatory” re- 
sponses, such as the one used by Klemmer 
(1956, p. 180) of beginning a series of 
trials over again when a clearly antici- 
patory response occurred. 
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IDENTIFICATION VERSUS SAME-DIFFERENT JUDGMENT: 
AN INTERPRETATION IN TERMS OF UNCORRELATED 
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2 studies are reported concerning the relation between identification of 
a single tachistoscopically presented stimulus and the discrimination of 
pairs of stimuli presented at comparable exposure durations. The re- 
sults of both studies show that accuracy of identification of a single 
item is higher than accuracy of discrimination of 2 items as same or 
different. A simple model is proposed which assumes errors of identifi- 
cation of 2 simultaneous forms are independent. The predictions based 
upon the model fit the obtained accuracy scores and permit the predic- 
tion of simultaneous discrimination accuracy on the basis of single 


identification thresholds. 


Pierce (1963) studied the effects of 
discrimination vs. identification tasks 
using words as stimuli and found that 
identification thresholds for singly pre- 
sented words were consistently lower 
than were same-different judgments of 
simultaneous word pairs. The main 
purpose of the present study was to 
replicate these findings using a differ- 
ent sample of stimulus materials and 
to propose an interpretation of identifi- 
cation vs, same-different judgments in 
terms of uncorrelated perceptual error. 

An assumption of independence of 
sensitivity of different foveal areas at 
a given moment in time can account 
quite well for the finding that identifi- 
cation of forms is better than same- 
different discriminations. If percep- 
tual error is uncorrelated for separated 
retinal areas, then the accuracy of per- 
ception of two forms at a given moment 
should reflect this independence. One 
of the simultaneously presented forms 
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may be perceived while the other is a 
blur, both may be seen, or neither form 
may be clearly perceived. Also, a 
single form, if it covers a large retinal 
area, may not be equally clear over its 
total extent. Consequently, when S 
must make a same-different judgment 
of two forms presented simultaneously, 
his basis of judgment may vary from a 
perception of two clear forms through 
several permutations of clarity of one 
or the other form, to the perception of 
two blurred images. At low stimulus 
energies, when perceptual error is pres- 
ent, we are proposing that the proba- 
bility of perceiving both of two simul- 
taneous forms is the same as the 
probability of seeing each of two forms 
presented successively and further that 
both these probabilities are independ- 
ent. We also assume that a same- 
different judgment requires S to per- 
ceive both stimuli at some minimal 
level of clarity to allow an accurate 
judgment, 

The concept of independent error 
permits a prediction of the difference 
between identification accuracy and 
same-different discriminations with a 
simple probability model. The S's hit 
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rate (HR) on form identification un- 
der a two-choice condition can be 
broken into two components, one due 
to correct perception of the form and 
the other to correct guessing. In the 
situation where the a priori probabili- 
ties of a correct guess are .5, the best 
estimate of the number of times he 
guessed correctly is the same as the 
"number of times he guessed incorrectly. 
For a simple model we can assume 
that S was in either of two states 
during an experimental session: State 
A when the stimulus was perceived 
and State B where S failed to perceive 
the stimulus but guessed. Stated sym- 
bolically, the probability of S being in 
State A on a given trial is: Pca) = 
HR — Py, where HR is the hit rate 
proportion and Pw) the proportion of 
wrong responses when the probability 
of a correct guess is .5. 

If the assumption is made that S 
must perceive both forms in order to 
make a correct same-different discrimi- 
nation, a prediction of same-different 
discrimination accuracy can be made 
from S's observed accuracy on identi- 
fication in the following way: The 
proportion of time both forms are per- 
ceived is obtained by Pca)’. This value 
tells us the proportion of times both 
stimuli are perceived assuming that the 
errors on the two foveal areas are 
uncorrelated. Since on the remaining 
trials his probability of a correct guess 
of same or different is .5, a guessing 
component needs to be added to Pos: 
Stated symbolically, HRs» = Pa? + 
ill = Pua]?. 


EXPERIMENT I 
Method 


Subjects —Six undergraduates at the Uni- 
versity of Illinois, four female, served as 
paid volunteers. 

Apparatus and stimulus materials—Two 
sets of random shapes, one containing three 
five-turn figures and the other three 20-turn 


figures, were used (Attneave, 1957). The 
figures were photographed on white glossy 
paper, ¢ in. maximum size, and subtended 
.33 of visual angle. A three-field Scientific 
Prototype Model GA tachistoscope, its lumi- 
nances measured with a MacBeth illuminom- 
eter and timing periodically checked for 
calibration by a dual-beam oscilloscope and 
fast acting photocells, was used in the 
experiment. 

Procedure. —The adapting field of the 
tachistoscope was dark except for a faintly 
glowing x fixation point subtending .33° of 
visual angle. The stimuli were presented in 
Fields I and II, the exposure fields, at a 
luminance of .3 apparent footcandles (ftc.) 
under two conditions. In the identification 
condition the stimulus form was presented in 
Field I or II and the other field was blank 
but lighted. The blank field was turned on 
and terminated simultaneously with the 
stimulus presentation and termination in the 
other field in order to insure adequate con- 
trast ratio control for the two conditions, 
identification and discrimination. The side 
of the fixation point, right or left, on which 
the form appeared was randomized between 
trials and within Ss. The form occurred .25° 
of angle above and .25° of angle to the right 
or left of the fixation point. In the dis- 
crimination condition two forms were pre- 
sented simultaneously. The form in Field I 
occurred to the left of the fixation point and 
the form in Field II to the right of it. The 
forms were always .25° of angle above the 
fixation point but the distance between them 
was systematically varied. Three degrees of 
separation were studied: 65°, 1.0%) and 1.5*. 

Prior to beginning the experimental ses- 
sions all Ss were given four 1-hr. practice 
sessions. All Ss practiced identification of 
all six forms during the first three sessions. 
During the fourth practice session half the 
Ss practiced identification of five-turn (sim- 
ple) forms and half of complex (20-turn) 
forms. The Ss had numbered photographs 
of the forms available throughout all sessions 
and made their identification response by call- 
ing the number. During this fourth practice 
session an exposure duration was chosen for 
each S that resulted in aproximately 75-80% 
identification accuracy. Each S was then 
given 48 identification trials for the three 
forms (simple or complex) at the selected 
exposure duration and his obtained HR was 
used as the identification probabilities. The 
exposure duration used during the identifica- 
tion task was also employed for the stimulus 
presentations of same-different judgments on 
the following three experimental sessions 


22 C. W. ERIKSEN, H. L. MUNSINGER, AND T. S. GREENSPON 


Upon completion of the sessions, during 
which they had been making discrimination 
judgments of same-difference, all Ss were 
given two additional practice sessions to de- 
termine a threshold duration for the counter- 
balanced set of forms. The following three 
experimental sessions were again devoted to 
same-different judgments on simultaneously 
presented pairs of the forms exposed at the 
identification threshold duration. 

During the discrimination trial Ss made 
same-different judgments of two simulta- 
neously presented forms. On one-third of 
the trials the two forms were separated by 
.5? of visual angle, on a second third by 1° 
of angle, and on the remaining trials by 1.5° 
of angle. Only one of these degrees of 
separation was used in a given experimental 
session with the order of different separations 
counterbalanced across Ss. During each ses- 
sion $ was presented with all possible pair- 
ings of the six forms. 

In summary, at each angle of stimulus 
separation S was presented 108 pairings 
where the stimuli were the same and 108 
where they were different. Fifty-four of 
each of these pairings were presented at the 
exposure duration used in obtaining his iden- 
tification threshold for simple forms and 54 
at the exposure duration under which his 
identification for complex forms had been 
obtained. Of the 108 pairings where the 
same forms were paired, half were of sim- 
ple forms and half of complex. Of the 108 
different pairings, 36 were different simple 
forms, 36 different complex forms, and 36 
were pairings of a complex form with a 
simple form. 


EXPERIMENT II 


This experiment was carried out 
after the data from Exp. I had been 
analyzed. Since certain of the findings 
from Exp. I were somewhat surpris- 
ing, it was thought desirable to attempt 
to replicate the experiment with ex- 
perienced Ss. 


Method 


Subjects —T wo experienced male Os served 
as Ss in this study. 

Method and procedure—They were the 
same as in Exp. I with the following excep- 
tions, Only two retinal separations of the 
stimuli were employed, .5° and 1.5? of angle. 
In addition to five practice sessions each S 
sat for four experimental sessions during 


each of which he made 72 same-different 
judgments and 48 identification judgments 
under a given degree of visual angle separa- 
tion of the stimuli. During an experimental 
session the identification trials were inter- 
mixed with the discrimination or same-differ- 
ent trials. To make the identification judg- 
ment a two-category decision, as was the 
same-different judgment, on randomly se- 
lected trials only a single form was presented 
and E gave S two alternatives from the set 
of three forms from which he was to choose. 
For one of the Ss the first two experimental 
sessions required identification only of simple 
forms and the last two sessions, separated by 
two practice sessions, of complex forms. 
The reverse procedure was followed for the 
other S. This change in procedure equated 
identification with same-different discrimina- 
tion in terms of chance level of success and 
eliminated any possible effects of practice or 
order on the differential accuracy of the two 
types of judgments. It also made possible 
the computation of an identification threshold 
for the different degrees of retinal separation 
of the stimuli, .5? and 1.5°, since on the 
identification trials the single form occurred 
at the same locus with reference to fixation 
point as did the forms presented for same- 
different judgments. 


Results 


It had been anticipated that a longer 
exposure duration would be required 
for the complex forms to achieve iden- 
tification accuracy equal to that ob- 
tained for simple forms. However, no 
reliable difference in exposure dura- 
tions was obtained for identification of 
complex and simple forms. Since there 
were no significant nor appreciable 
order effects, these classifications were 
disregarded in the analyses. 

Figure 1 shows the percentage of 
correct judgments under the identifi- 
cation and the discrimination condi- 
tions at the three different degrees of 
retinal separation for the six Ss in 
Exp. I. Since the identification-condi- 
tion values are based upon a three- 
choice situation, whereas the same-dif- 
ferent judgments are based upon two, 
it was necessary to correct the identi- 
fication judgments by the following 
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equation: Identification = 75X + 25, 
where X is the percentage of correct 
identifications obtained with three 
choices. This formula corrects for 
differences in chance success between 
three-category and two-category judg- 
ments. 

Figure 1 shows that identification of 
a single form is much more accurate 
than detection of same or different of 
two simultaneously presented forms 
F (3,15) = 1485, € 001. All main- 
effect differences were accounted for 
by the difference between identification 
and the three same-different judg- 
ments. There was no significant dif- 
ference between the three degrees of 
retinal separation in terms of same- 
different accuracy, although the trend 
is for judgments at wider separations 
to be less accurate. 

Comparable results were obtained in 
the replication. As is seen in Fig. 2 
both Ss show appreciably higher accu- 
racy on the identification trials and 
again a trend for them to be less accu- 
rate at the wider retinal separations 

3 All six Ss had uncorrected percentages 
of identifications that were higher than any 
of their same-different judgments. 


on their same-different judgments. In 
addition there is a decrease in identifi- 
cation accuracy when the forms are 
presented further from the fixation 
point. 

An analysis of the accuracy in de- 
tecting differences between simple form 
pairs, complex pairs, and simple-com- 
plex pairs showed no significant effects. 
The mean percentage of error in judg- 
ing differences was 28% for each of 
the three kinds of pairings. These 
values have been averaged through the 
three retinal separations. The corre- 
sponding percentages of error on differ- 
ent judgments averaged for the two Ss 
in Exp. II. were 38% for simple form 
differences, 24% for complex forms, 
and 30% for simple-complex form 
pairings. 

Figure 3 shows the results of an 
analysis of the percentage of incorrect 
different judgments as a function of 
the degree of retinal separation of the 
paired stimuli, As can be seen, the 
number of incorrect “different” judg- 
ments increases rather markedly with 
increasing retinal separation of the two 
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forms for Exp. I Ss, F (2, 10) = 5.0, 
p «05. The data for the two Ss in 
Exp. II are also shown in Fig. 3 and 
conform quite well with the average 
data of the Exp. I Ss. 


Discussion 


These results show that Ss can identify 
a single form, selected from a small set 
of forms and presented at a brief constant 
exposure duration, appreciably more ac- 
curately than they can judge whether two 
simultaneously presented forms are the 
same or different. This finding confirms 
and extends that of Pierce (1963). 

One possible explanation for these data 
is in terms of an attentional process. 
Attention, if defined as eye movement, 
is an inapplicable explanation since the 
exposure durations used in this study of 
9-15 msec. make it impossible for S 
to begin eye movements. However, if 
attention is defined as an internal scan- 
ning mechanism and the input is thought 
to undergo immediate decay, then S may 
scan one item and not get to the second 
before it has decayed beyond his recogni- 
tion threshold. This is an intriguing, but 
not necessarily parsimonious explanation 
of these data. 
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A simpler explanation of these results 
can be based on an assumption of varying 
sensitivity of different areal parts of the 
visual system at a given moment in time, 
That is, different areas of the visual 
system represented by different foveal 
locations produce independent errors. 
Closely adjacent areas sharing essentially 
the same microenvironment may be ex- 
pected to show some correlation, but at 
some minimum separation the errors be-, 
come independent. Evidence of such un-e 
correlated errors on different foveal areawy 
has been reported for form perception Ins. 
Collins (1965) and Eriksen and Lapptht 
(1965). Eriksen and Lappin presersted 
one, two, four, or six simultaneous PEE 
titions of the same form randomly Sfice or 
on an imaginary circle around the cetitwo 
fixation point. The gain in recognition. 
accuracy by having two, four, and six 
Occurrences of the stimuli over a single 
occurrence was predicted quite closely by 
an assumption of uncorrelated error for 
different foveal locations separated by a 
degree or more of angle. 

If the model advanced in the introduc- 
tion is applied to the data of the present 
study, the prediction of same-different 
accuracy for the six Ss in Exp. I is 72%. 
This agrees quite closely with the ob- 
served judgmental accuracy at .5? and 1? 
separation and is slightly larger than that 
observed for 1.5°.4 This suggests that at 
-5° retinal separation the perceptual error 
was essentially uncorrelated. The over- 
estimation obtained for 1.5° separation 
could be expected since the identification 
accuracy used in the predictions was 
based upon forms located .5° apart. The 
data from Exp. II suggest that identifica- 


4The model proposed here will tend to 
somewhat overestimate the judgmental accu- 
racy to be obtained under the condition of 
independence or uncorrelated noise. This is 
due to the simplifying assumption that there 
are only two perceptual states in $: where 
he perceives the stimulus and where he 
guesses. A more refined model can be ob- 
tained by requiring Ss to give confidence 
ratings to their judgments which allow a 
finer differentiation of the different percep- 
tual states that can be empirically determined 
in Ss from trial to trial, 
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tion accuracy of single forms is somewhat 
poorer at 1.5* separation. Consequently 
the use of identification accuracy data for 
forms closer together will tend to give 
an overestimate for same-different accu- 
racy when the forms are 1.5° apart. 

The data from the two Ss in Exp. II 
provide a somewhat more appropriate 
test of the model for wider retinal spac- 

. ings of the stimuli. Applied to the data 
from this experiment, the model predicts 
same-different accuracy of 69% for .5° 
pf angle spacing and of 63.5% for 1:5? 
eparation and agrees quite well with the 

served accuracy of 71% and 63.3% for 
! "ese respective spacings. 

‘AN ot only does the concept of uncorre- 
navų perceptual error provide a satis- 
dovory prediction of same-different judg- 
anental accuracy, but it also accounts for 
the increasing number of “different” 
judgments as the retinal spacing of the 
stimuli becomes greater. If the degree 
of correlation is less as the stimuli are 
further separated on the fovea, the in- 
formation received from both form stimu- 
lations would be more and more discrep- 
ant due to the increasingly independent 
error. Thus the two forms are more 
likely to appear different to S at wider 
separations even if they are the same 
form. The concept of independent er- 
ror, in addition, agrees quite well with 
the introspective reports of Ss. They 
report that on some simultaneous trials 


only one form is seen quite clearly and 
on others only part of a single form is 
seen with the remainder of the form hazy 
or indefinite. These types of percepts 
would be expected if there were less than 
perfect correlation of the error in the 
perceptual system for different foveal 
areas. 

A somewhat surprising result of the 
present studies was the lack of a system- 
atic difference between simple-simple, 
complex-complex, and  simple-complex 
judgments. It was anticipated that judg- 
ments of different for pairings of simple 
and complex forms would be easier than 
for complex-complex or simple-simple 
pairings. However, the results provide 
no support for this supposition. 
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INDEPENDENCE OF SUCCESSIVE INPUTS AND 


UNCORRELATED ERROR IN VISUAL 
FORM PERCEPTION ! 


CHARLES W. ERIKSEN 


University of Illinois 


A model is presented for determining perceptual independence, defined 
as when information received on 2 successive form stimulations does 
not interact and the internal perceptual system error present on the 
separate stimulations is uncorrelated. 2 experiments are reported where 
the same stimulus form is presented on different foveal areas with lags 


different forms on the 2 stimulations. 


Three basic problems in visual 
perception can be studied through the 
following experimental arrangement. 
Simple forms are presented at an 
energy level sufficient for above 
chance identification over a number of 
trials but insufficient for 100% identi- 
fication. At this energy level the same 
form is presented to S twice with the 
interval between the two stimulations 
systematically varied. The ability of 
S to identify the form following the 
two presentations as a function of the 
delay interval between them can then 
be compared with his performance 
level for a single presentation. The 
successive form presentations are pre- 
sented on different foveal areas so as 
to avoid the problem of simple energy 
summation on the retina. 

The outcome of such an experiment 
will depend to some extent upon 
whether successive form presentations 
on different foveal areas interact. At 
certain time delays between the 
stimulations the perceptual system 
may sum the information received by 
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between the stimulations of 0-1500 msec. A 3rd experiment presented 7 
All 3 experiments indicate that he 

the successive inputs are independent even when separated by lags of ed 
less than 1 msec. The results are interpreted in terms of uncorrelated na- 
error or sensitivity at any given moment in time for different elements and 
in the visual perceptual system represented by different foveal areas. 2 or 
vo 

the successive stimulations. Alterna- 


tively, the interaction might result in 
an interference between the successive 
inputs taking the form of forward and 
backward masking phenomena (Raab, 
1963). 

A second and related determinant 
of the outcome of such an experiment 
involves the rate at which the visual 
system is able to assimilate stimulus 
inputs that fall on nonidentical foveal 
areas. The psychological refractory 
period may involve a sensory or 
perceptual rate component as well as 
response factors (Reynolds, 1964). 

Both the interaction and the rate 
formulations are interwoven with a 
third problem, the time characteristics 
of error components or varying sensi- 
tivity in the visual perceptual system. 
This becomes particularly important 
in determining the interval between 
stimulations where the error present 
on the first presentation is uncor- 
related with the error at the second 
signal occurrence. 


EXPERIMENT | 


A previous study (Eriksen & Hoff- 
man, 1963) found that two brief 
stimulations of a form with delays 
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between stimulations from 0 to 450 
msec, resulted in approximately a 7% 
increase in recognition over that 
obtained for a single stimulation. 
There was a suggestion in these data 
that the gain from double stimulation 
at 0- and 5-msec. lags was not as great 
as at 450 msec. While the difference 
.did not approach significance it is in 
the range of previously reported back- 
ward masking effects (Raab, 1963). 
Mthough in subsequent work we have 
slated two bases of masking effects 
, triksen & Collins, 1965; Eriksen & 
' appin, 1964), neither of which would 
ave contributed to the results of 
double stimulation, nonetheless it 
appeared desirable to explore care- 
fully the 0-100 msec. delay range on 
the double stimulation phenomenon. 


Method 


Subjects.—Six volunteer undergraduates, 
two female, were paid for their services in the 
experiment. 

Procedure.— The stimulus materials, equip- 
ment, and experimental procedure were es- 
sentially the same as employed in the Eriksen 
and Hoffman (1963) study. In brief, a 
three-field | tachistoscope consisting of an 
adapting field and two stimulus exposure 
fields was employed. The adapting field was 
used to vary the pre- and postadaptation 
brightness and the forms, consisting of the 
capital letters A, T, and U, were presented in 
Fields I and II, the exposure fields. In 
addition to permitting precise control of the 
duration of exposure in Fields I and II, this 
apparatus also allowed precise control of the 
interval between termination of stimulation 
in Field I and onset of stimulation in Field II. 

During the experimental sessions three 
kinds of stimulus presentations were used: 
(a) a form presented in Field 1 followed by 
light alone in Field IT; (b) a form presented in 
Field II preceded by light alone in Field 1; 
(c) double stimulation where the same form 
was presented in Fields I and II. Delay 
intervals or lag times between termination of 
stimulation in Field 1 and onset of Field H 
of 0, 25, 50, 75, and 100 msec. were studied. 

The three methods of form presentation 
at the various lag times were investigated 
under two experimental conditions, Under 


Cond. D the adapting field was dark and the 
interval between exposures in Field I and II 
was dark also. Under Cond. L the adapting 
field was illuminated with .58 apparent foot- 
candles (ftc.) and remained on during stimula- 
tion from the two exposure fields and the 
interval between them. 

Each S sat for 10 40-min. experimental 
sessions plus four practice sessions, In an 
experimental session each of the three forms 
was presented six times under each of the 
three methods of stimulus presentation and 
at each of the fivelag times, Four successive 
sessions were devoted to judgments under 
Cond. D and four under Cond. L and within 
each session the three methods of presenta- 
tion and the five lag times were counter- 
balanced, Three of the Ss began the task on 
Cond. L and three on Cond. D. 

Prior to beginning the experimental judg- 
ments under a given condition each S had 
two practice sessions devoted to obtaining à 
function showing his identification accuracy 
as a function of duration of exposure for the 
particular condition of pre- and postadapting 
field obtaining for that particular condition. 
In the practice sessions the stimulation se- 
quence was: adaptation field (light or dark 
depending upon the condition), exposure of 
one of the three forms, and then return to the 
adapting field. The three forms were presented 
randomly and the duration of expsoure was 
varied systematically to obtain a range of 
recognition varying from chance (334%) to 
approximately 907%. The S was required to 
make a judgment of A, T, or U following each 
stimulation sequence and express his confi- 
dence in his judgment by the numbers 1, 2, or 
3 corresponding to “confident,” “think so,” 
and "guess." To insure comparability of the 
two exposure fields, half of the presentations 
occurred in Field I and half in Field II in a 
counterbalanced order. On the basis of the 
data obtained from these base-rate determina- 
tions, a level of duration was selected for 
stimulus exposure in Fields I and II for the 
condition and for the individual S that yielded 
recognition of approximately 50-55%. The 
duration levels remained constant for this 
particular S for all judgments made under that 
condition. Due to the difference in pre- and 
postadapting fields the duration of exposure 
of the stimulus fields necessary to obtain 50% 
recognition accuracy in each of the two con- 
ditions was quite different. For Cond. D the 
average base-rate duration level was 15.2 
msec. and for Cond. L 58 msec. Each ex- 
perimental session was preceded by a T-min. 
period of visual adaptation to the task during 
which time 5 made practice judgments, 
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The S was instructed that he was partici- 
pating in a study of form perception and that 
on each trial one of the three letters A, T, or 
U would be presented. Further, on some 
occasions the letter would occur in the flash 
of the first field, other times in the flash of 
the second field, and on some trials the same 
letter would occur in both fields. He was 
further informed that these methods of 
presentation would vary randomly and also 
that the interval between flashes would vary. 
It was stressed that on those trials where a 
letter appeared in both fields it would always 
be the same letter. 

The exposure fields under both conditions 
had a luminance of .20 apparent ftc. The 
cross used as a fixation point (black in Cond. 
L and luminous in Cond. D) and the three 
letters each subtended 12 min. of visual angle. 
The letter forms occurred on the points of an 
imaginary square of 1.25? of angle centered 
on the fixation point. Which point the 
stimulus occurred on varied randomly from 
trial to trial with the restriction that for the 
double stimulation the same position would 
not occur for both form occurrences. 


Results and. Discussion 


Percentage of correct form identi- 
fications were analyzed in a four-way 
classification analysis of variance 
(Conditions, Lag Intervals, Single vs. 
Double Stimulations, and Ss). The 
main effects due to Lags, Single vs. 
Double Stimulations, and to Ss were 
significant (p < .01). Also the inter- 
action between Conditions and Lags 
was significant. The significant S 
effects as well as a significant .S 
X Condition interaction are attribut- 


Condition D. 


© Field 1 
o Field Zo S x 

Ie 
x Double / 


Condition L 


60) Nd adsl 
i 


ae 
sg ges 


4 


bassi cape 


Bete 0 25 60 75 100 


Bose 0 25 50 75 100 
Level. 


Log Interval [n Milliseconds 


Fic. 1. Identification as a function of 
method of stimulus presentation and lag 
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able to our inability to match exactly 
each S at a 50% identification on the 
base-level series within and between 
the conditions. 

In Fig. 1 percentage of identifica- 
tions are shown as a function of single 
stimulation in Fields I and II and 
double stimulation for the two condi- 
tions as a function of lag time. The 
results for both conditions, D and L, 
show very good agreement with the 
results from corresponding conditior 
in the previous study (Eriksen JA 
Hoffman, 1963). In Cond. D thereg 
a marked drop in identification f- 
forms presented in either Field I or I1, 
relative to the base level, for lag 
intervals under 50 msec. These re- 
sults are in accord with the previously 
demonstrated temporal luminance 
summation—contrast reduction effect 
(Eriksen & Lappin, 1964; Eriksen & 
Steffy, 1964). Here a single form 
presentation is either preceded or 
followed by the flash of the luminance 
of the other exposure field. At short 
delay intervals the luminance of the 
empty exposure field summates with 
the luminance of the field containing 
the form, resulting in a reduction of 
the figure-ground contrast for the 
form with a consequent impairment in 
identification. 

Condition D was employed in the 
present study as a means of experi- 
mentally manipulating the identifica- 
tion level for single form stimulations. 
As can be seen, identification of the 
double stimulation follows closely the 
variation of identification for single 
stimulations in either exposure field 
although the double stimulation shows 
a consistent superiority. 

_ In Cond. L the illuminated adapt- 
ing field upon which the luminances 
of the exposure fields are superim- 
posed maintains the contrast of the 
forms at close to a constant level over 
alllag times. Thus, identification of 
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single form stimulations does not 
vary systematically with lag time. 
Again, however, there is a consistent 
superiority of the double stimulation 
over single stimulation in either field. 


To see more clearly the effects of 
double stimulations relative to single, the 
data from Cond. D and L have been 
averaged in Fig. 2. To facilitate com- 
parison across experiments, the data 
from Eriksen and Hoffman (1963) from 
the relevant two conditions are also 
shown in Fig. 2. The data from the two 
experiments are quite consistent. To- 
gether they suggest two conclusions. 
The first is that a double stimulation 
results in a small but consistent gain in 
identification relative to a single stimula- 
tion in either of the two exposure fields. 
The second and more interesting con- 
clusion is that the amount of gain result- 
ing from a double stimulation appears to 
remain constant whether the second of 
the stimulations occurs immediately 
upon termination of the first or is delayed 
as long as 450 msec. This latter finding 
indicates that there are no interfering 
effects between the two stimulations at 
short delay intervals that are different 
from those that occur at longer lag times. 
However, before interpretation in terms 
of either a facilitation or interference or 


even of correlated error extending 
throughout these lag times is advanced, 
it is first necessary to ascertain what the 
gain from the double stimulation would 
be if the information received on each 
stimulation was independent of the other. 

Equation 1 is the formula for pre- 
dicting the occurrence of one or the 
other or both of two events, given 
that they are independent. 


Py + Ps — (SP) [t] 


This formula has been found to con- 
sistently overestimate the level of per- 
formance on auditory detection studies 
using the “multiple look” method (Swets, 
Shipley, McKey, & Green, 1964). In 
the situation where S has two oppor- 
tunities to perceive the stimulus and 
makes a single judgment based upon 
these two observations, the formula has 
the effect of giving double weight to the 
guessing component in S's hit rate (HR) 
for a single presentation. 

When a correction for guessing is 
applied, the assumption is made that on 
any given trial S can be in either of two 
states. In State A he perceives the 
stimulus and his HR when in this state 
is 1.00. In State B he perceives nothing 
and his HR in this state is the a priori 
probability of a correct guess. If S is 
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given two independent observations or 
trials before making his judgment, the 
expected joint occurrences of States A 
and B for the two trials can be obtained 
from the multiplication of the following 
matrices: 


Trial 1 Trial2 


[jor [ofa] = PaayPaa 
Pay Paca Paso 


If the assumption is made that if S 
is in State A on either or both trials the 
probability of a correct judgment asso- 
ciated with State A will obtain, the 
predicted HR for two independent 
observations is computed by summing 
the percentage of the double trials (or) 
from the multiplied matrix where State 
A occurs on either trial or both and 
multiplying the sum by the State A HR. 


HRpr= 1.00[ Pc. A) PaB - Pci] 
1 
+ [Paw] [2] 


A two-category model, unfortunately, 
is inadequate. Practiced Ss on most 
detection and recognition tasks have no 
difficulty in differentiating at least three 
different subjective states of confidence 
in their judgments over a series of trials. 
If asked to rate their confidence on 
each trial using a 3-point scale (“con- 
fident," "think so," and "guess") most 
Ss find this task easy. That these 
subjective confidence ratings reflect dif- 
ferent perceptual states or differing com- 
pleteness of information resulting from 
a trial, is reflected in the differential HR 
associated with the different confidence 
ratings. The HR is quite high on con- 
fident judgments and decreases progres- 
sively with lower degrees of confidence. 

It can be shown that a two-category 
model based upon two states of the S, 
one state where he perceives with an 
HR of 1.00 and a second state where he 
guesses, will consistently overpredict the 
HR based upon two independent trials 
or opportunities to perceive if the actual 
perceptual states of S are greater than two 
and have associated HRs of less than 1.00. 

The following model is proposed to 


assess the accuracy level to be expected 
when S is given two trials or presenta- 
tions of a stimulus at a constant energy 
level and the trials are independent. 
Independence is achieved when the in- 
formation received on the two trials does 
not interact and the internal error factors 
or sensitivity present on one trial is un- 
correlated with those on the other trial. 

It is assumed that the presentation of 
the stimulus results in oné of three 
possible perceptual states, A, B, or C, 
occurring in S which in turn can be 
measured by the confidence rating S 
assigns to his judgment on the given trial 
State A corresponds to judgments rated 
as "very confident," State B as “think 
so,” and State C to "guess," The rela- 
tive frequency of occurrence of these 
three states during an experimental ses- 
sion are determined as well as the 
differential HR associated with each 
state. Stated symbolically: HR =aP (4) 
+bP(B)+cP(C) in which HR is the 
proportion of correct judgments during 
an experimental session and P(A), P(B), 
and P(C) are the proportion of times S 
is in State A, B, and C, respectively. 
The symbols a, b, and c are the respective 
HR or proportion of correct judgments 
associated with the respective perceptual 
states. 

If an S is to be given two trials (I and 
II) or stimulations upon which to base 
his judgment, the level of accuracy to be 
expected under the conditions of inde- 
pendence as above defined will be deter- 
mined by the joint occurrences of the 
three perceptual states and their com- 
binations. Thus the probability of S 
being in State A on Trial I [P(Ar)] and 
in State C on Trial II [P(Cr)] and the 
remaining eight combinations can be 
determined by multiplying the two 
matrices. 


Trial 1 Trial 2 

P(A) [P(Ar)P(Bi)P(Ci)] 
P(B1) 

P(C1) 


P(Ar- Arr) P(Bi- Ar) P(Cr- Art) 
= P(Ar- Bi)P(Bri- Br) P(Cr- Bi) 
P(Ar- Cr) P(Br- Cr) P(Cr- C11) 


^ 
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At this point one further assumption 
is necessary. If a double stimulation 
results in two different perceptual states 
such as Ay and Cri, S’s judgment will be 
determined by the state representing the 
highest HR or confidence. This assump- 
tion is consistent with the definition of 
independence where the information 
received on the two stimulations does 
not interact. 

In keeping with this assumption the 
proportion of correct judgments resulting 
from the double stimulation can be 
predicted by first summing the propor- 
tion of times that State A occurred on 
one or the other or both stimulations, 
‘and the proportions of double trials 
where State B, but not A, occurred on 
either or -both stimulations, Thus: 
P(Awmn) = P(ArAm) + P(Ar: Bii) 
+ P(As* Cir) + P(Br- Ar) + P(Cr: Ai) 
and P(BA1vir) =P (Br: Bri) +P (Br: Cr) 
+P (Cr: Bir) where P(Arvr1) is the pro- 
portion of double trials where an A state 
occurred on either the first or second 
stimulation or both, and P(BA1vi) the 
proportion of double trials where a B, 
but not an A state, occurred on the first 
or second stimulation or both. P(Cr.i) 
is of course the proportion of double 
stimulations on which State C occurred 
for both. 

The predicted correct judgments or 
HR under the double stimulation is then 
given by weighting the proportion for 
each of the three states by the respective 
HR associated with that state and 
summing.? 


HRiax = aP (Arv) 
+ bP(BAiwn) cP(Cra) [3] 


To determine the expected HR for 
double stimulations under the condition 


2]f due to experimental arrangement the 
signal strength is different on the second 
stimulation of a double trial, the joint propor- 
tions in the multiplied matrix must each be 
multiplied by the appropriate HR and then 
the double stimulation ZR is obtained by 
summing these multiplied values directly. 
The appropriate HR to multiply a given 
joint proportion is the highest HR associated 
with the two perceptual states in the joint 
Occurrence. 


of independence, the confidence ratings 
of all six Ss in the present experiment 
resulting from the presentation of single 
stimuli were pooled for Cond. L and for 
Cond. D at lags of 50 msec. and greater 
where single stimulation HR appears to 
be constant over lag intervals. The 
respective confidence categories of con- 
fident, think so, and guess, were used 
25, 29.5, and 45.5% of the time, re- 
spectively. The HRs associated with 
each of these confidence states were .76, 
.58, and .39, respectively. If these three 
confidence categories are used as esti- 
mates of the three perceptual states, A, 
B, and C, with their associated HRs, the 
predicted HR for double stimulation is 
61.8%. The average obtained double 
stimulation ZR over the lag intervals of 
50-100 msec., where the single stimula- 
tion HR is constant, is 61.1%. As is 
seen in Fig. 2, the predicted value of 
61.8% is also a very good fit for the double 
stimulation results of the Eriksen and 
Hoffman (1963) study. 

This close agreement between the 
observed accuracy for double stimula- 
tions and what would be predicted 
assuming that the two presentations 
were independent, suggests that even 
when the two presentations are separated 
by less than a millisecond theinformation 
does not interact and/or the error factors 
in the visual perceptual system are un- 
correlated for the two foveal areas. 


EXPERIMENT II 


An alternative explanation for the 
above data is that the model for 
independence is incorrect and that 
correlated error or sensitivity exists 
out to 450-msec. lag time. Also there 
may be interference between the suc- 
cessive inputs. (A facilitative inter- 
action between the two stimulations 
can be ruled out due to the small 
magnitude of gain achieved in identi- 
fication by this method of presenta- 
tion.) The present experiment was 
carried out to extend the lag intervals 
over 1.5 sec. asa check upon the model 
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of independence and the possibility 
that correlated error could exist for 
as long as $ sec. 


Method 


Subjects.—Four additional Ss, two female, 
were recruited and paid for participation in 
the experiment. 

Procedure.—Yhe experimental procedure, 
stimulus materials, and equipment were 
identical with that of Exp. I with the follow- 
ing exceptions. The data were collected 15 
mo, after Exp. I were completed, and .Ss 
did not make confidence judgments on the 
experimental task. A Scientific Prototype 
Model GA tachistoscope with slight modifica- 
tions was used in place of the previous equip- 
ment. The Ss were again run under Cond. 
D and L and the lag intervals chosen were 
50, 75, 500, 1,000, and 1,500 msec. 


Results and Discussion 
The percentage of correct identi- 


fications were again analyzed by a’ 


four-way classification analysis of 
variance (Conditions, Lag Intervals, 
Single Stimulation in Field I and I] 
and Double Stimulation, and Ss). 
Significance at the .01 level was ob- 
tained for the kind of stimulation and 
for Ss. Also the interaction of Lag 
Intervals X Single Stimulation in Field 
I, Field II and Double Stimulation 
was significant. Lack of significant 
effects involving the D and L condi- 
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Vic. 3. Identification for single and double 
stimulus presentations in Exp. II. 


tions was anticipated since no appreci- 
able luminance summation effects had 
been obtained in Exp. 1 at lag inter- 
vals greater than 50 msec., the 
shortest employed in the present 
experiment. 

In Fig. 3, percentage of correct 
identifications of the forms are shown 
as a function of Lag Intervals and 
whether the form appeared in Field I, 
Field II, or both. In keeping with the 
results from the analysis of variance 
indicating no condition differences, 
the data in Fig. 3 have been pooled 
over conditions. The interaction be- 
tween Lag and Kind of Stimulation 
appears to consist mainly of decreas- 
ing identification for forms presented 
in Field II at 1,000 and 1,500 msec. 
after the empty flash of Field I. In 
part at least, this may have resulted 
from inability of Ss to maintain fine- 
ness of fixation and accommodation 
over these intervals after triggering 
the beginning of a trial. 

Again, as in the previous experi- 
ment and in the Eriksen and Hoffman 
(1963) study, the double stimulation 
results in a consistent small gain in 
identification over a single presenta- 
tion in either exposure field irrespec- 
tive of lag interval. Applying the 
Studentized range test to the com- 
bined data for D and L, it is found 
that a single stimulation in either 
Field I or Field II differs significantly 
(p < .01) from the double stimulation. 

While the gain in identification from 
a double stimulation was expected, 
the major interest was whether the 
amount of gain increased with in- 
creasing lag interval. Figure 3 sug- 
gests that the increment from the 
double stimulation remains relatively 
constant over Lag Intervals from 50 
to 1,500 msec. Although the smallest 
gain occurs at 50 msec., this would 
appear to be a sampling fluctuation 
since it is not supported by the data 
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at 75-msec. lag nor by the data from 
lags of 0, 25, and 50 msec. in Exp. I. 
In fact the amount of increment 
in Identification resulting from the 
double stimulation throughout the 
0-1,500 msec. range is substantially 
the same as obtained for the 0- and 
25-msec. lags in Exp. I. 

. Unfortunately confidence ratings 
were not obtained from Ss in this 
experiment and those obtained in 
Exp. I are not precisely applicable due 
to a small difference in HR for single 
stimulations between the two experi- 
ments. The average single stimula- 
tion HR in Exp. I was 53.8% as 
opposed to 55.9% in the present 
experiment. Allowing for this some- 
what higher value, the obtained 
double stimulation ZR in the present 
experiment of 64.5% is reasonably 
close to what would be expected under 
the condition of independence. 


Lack of evidence for an increasing 
advantage of the double stimulation over 
lag intervals as long as 1500 msec. sup- 
ports the conclusion of Exp. I that 
successive form stimulations separated 
by an interval of less than 1 msec. are 
independent. It also adds indirect con- 
firmation to the appropriateness of the 
model for computing independence. If 
there was nonindependence (perceptual 
interference or correlation between er- 
rors) for successive form stimulations at 
very brief intervals, it would be expected 
that this interference and/or correlation 
would decrease with increasing separa- 
tion of the two stimulations. A decreas- 
ing interference would lead to increasing 
form identification with double stimula- 
tion. However the data in Fig. 3 
provide little support for an increasing 
gain with increasing lag intervals be- 
tween stimulations. 


EXPERIMENT III 


This experiment was carried out as 
a further check upon the conclusion 
that perceptual independence exists 


even at lag intervals of less than 1 
msec. between successive form stim- 
ulations. In the previous experiments 
the successive or double stimulations 
always consisted of a repetition of the 
same form in the second stimulation. 
If instead of presenting the same form 
twice, a different form is presented 
in each of the two exposure fields 
separated by varying lag times, per- 
ceptual independence would require: 
(a) that identification accuracy of a 
form presented in either the first or 
the second exposure field would be 
the same irrespective of whether the 
preceding or following exposure field 
contained a form or simply a blank 
illuminated field; and (b) that the 
probability of correct identifications 
of both forms presented in a double 
stimulation be the product of the 
probabilities of correct identification 
of a single stimulation in each of the 
two exposure fields where a form 
occurs in only the one field and in the 
other field the blank signal. 


Method 


Subjects.—Five additional Ss, three female, 
volunteered and were paid for their services. 
Procedure.—Procedure, stimuli, and equip- 
ment were the same as those employed in 
Exp. I with the following exceptions. Only 
two forms, A and U, were employed. During 
the practice sessions that preceded Cond. D 
and L an exposure duration necessary for 
10% forced-choice identification accuracy was 
determined for each S. This duration was 
then used for all subsequent stimulus pres- 
entations for that S under that condition. 
During the experimental sessions the four 
combinations of the two stimuli occurred 
equally often and on double stimulation trials 
were separated by lag intervals of 0, 25, 50, 
75, and 100 msec. Intermixed with these 
double stimulation trials were trials in which 
only a single stimulus occurred, followed or 
preceded by the empty flash of the other 
exposure field. Prior to each of these single 
stimulation trials S was told that only one 
stimulus would occur on that trial and thus 
he was to make only a single response. On 
the double stimulation trials S was required 
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to make a judgment for each of the stimula- 
tions. In addition to identification S was also 
required to specify which quadrant of the 
area surrounding the fixation point each form 
had appeared in. In making his judgments 
S used the scale of numbers from 1 to 6 with 
the responses defined as: 1— "confident it's 
an A," 2—"think it's an A," 3—"guess it's 
an A," 4— "guess it's a U,” 5—"think it’s a 
U,” and 6—“confident it's a U.” The 
quadrants in which the letters occurred were 
identified by S following his numerical re- 
sponse as upper left, lower right, etc. 


Each SS served a total of 10 experimental 
sessions, five under each condition, during 
Which he received 40 double presentation 
trials at each of the five lag times and 20 
trials for a single stimulation in Field I and 
20 for a single stimulation in Field II at each 
of the lag times. Three Ss began the session 
on Cond. D and two on Cond. L. Within a 
condition all five lag times and methods of 
stimulus presentation were counterbalanced 
within and across sessions, 


Results and Discussion 


It was originally planned to sepa- 
rately analyze the data from those 
trials where S correctly and incor- 
rectly judged the quadrants in which 
the forms had appeared. However 
location errors occurred so seldom 
that separate analyses were not 
merited. 

Percentages of correct identifications 
were analyzed by a five-way classi- 
fication analysis of variance (Condi- 
tions, Lag Intervals, Fields I and I il; 
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Single vs. Double Stimulation Trials, 
and Ss). The only effect significant 
at the .01 level was Ss X Condition 
interaction which again appears at- 
tributable to inability to equate 
exactly an S’s base-level performance 
across conditions. Significant at the 
-05 level were the main effects of Lags 
and of Fields. 

In Fig. 4 identification accuracy is 
shown as a function of lag interval, 
field in which the stimulus appeared, 
and whether the stimulus was fol- 
lowed or preceded by another stimulus 
(single vs. double stimulation trials). 
As is apparent in both conditions, 
recognition accuracy does not vary as 
a function of which field the stimulus 
occurs in and more importantly, 
recognition accuracy is essentially the 
same for either field whether or not 
the presentation of the stimulus in 
that field was preceded or followed by 
a blank flash or a form in the other 
field, F(1,4) = 1.8, p> .10. This 
finding is of major interest since it 
indicates that there is no interaction, 
interfering or facilitating, between 
two successive stimulations at the lag 
times investigated. 

The interaction between lag times 
and condition was not significant but 
the trend of the data in Fig. 4 is 
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consistent with that of Exp. I in terms 
of the luminance summation effect. 

The frequency with which Ss used 
each of the three confidence categories, 
confident, think so, and guess, were 
25, 30, and 45%, respectively. The 
associated HRs for judgments in these 
confidence categories were 84%, 76%, 
and 60%, respectively. These con- 
fidence frequencies and associated 
HRs were computed only for Cond. L 
where the percentage of identification 
for single presentations remained es- 
sentially constant across lag times. 

These confidence ratings and their 
associated HRs can be used in con- 
junction with the model for computing 
independence between successive in- 
puts or stimulations. Whereas before, 
the model was used to compute the 
summative effects of a double presen- 
tation of the same stimulus, the 
problem now is to predict the accuracy 
of identification of both forms pre- 
sented in a double stimulation. The 
predicted HR for correct identification 
jointly in both stimulations is given 
by: 
HRyn = [uP (Ax) + bP (B1) 

+ P (C1) ]EanP (Aix) 
+ buP (Bu) + euP(Cx)]. [4] 


As is readily seen, this reduces to the 
multiplication of the HR for the first 
stimulation by the HR for the second 
as obtained from single stimulation 
trials. 

In Fig. 5 the percentage of correct 
identifications of both forms presented 
in a double stimulation is shown as a 
function of lag interval and condition. 
Also shown are the predicted HRs for 
identification of both stimuli com- 
puted as indicated above. For both 
Cond. D and L the obtained HR 


identification accuracy is almost iden- 
tical with the level predicted under 
the assumption of perceptual in- 
dependence. 


In summary, the present experiment 
adds substantial support to the con- 
clusion from the previous two experi- 
ments that perceptual independence 
exists for successive form stimulations 
even when the interval between the 
stimulations is less than 1 msec. The 
results from this experiment taken in 
conjunction with the results from the 
other two experiments, seem to be most 
parsimoniously explained in terms of a 
concept of noncorrelated internal per- 
ceptual error at a given moment in time 
for form stimuli falling on foveal areas 
separated by 1.25° of angle or greater. 
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EFFECTS OF CS OMISSION FOLLOWING 
AVOIDANCE LEARNING* 


J. M. BLOOM? ann BYRON A. CAMPBELL 


Princeton University 


3 experiments investigated the effect of CS omission on avoidance 
learning in rats. Experiment I varied the number of avoidance trials 
preceding a constant number of CS-omission trials, and Experiment II 
varied the number of CS-omission trials following a constant number 
of avoidance trials. The CS-omission effect was not found in either 
experiment. Instead, a decrement in avoidance responding following 
the CS-omission trials was observed in all experimental groups. Ex- 
periment III investigated the effect of different periods of delay fol- 
lowing CS-omission trials to determine whether the response decrement 
was caused by an overall increment in anxiety level. The decrement 
in avoidance responding following the CS-omission trials did not dimin- 


ish with the passage of time suggesting it was not due to a heightened 


anxiety level. 


In a series of experiments, Kimble 
and Dufort (1956) and Kimble, Mann, 
and Dufort (1955) omitted the CS on 
a block of trials during classical eyelid 
conditioning, with the rather startling 
result that when the CS was reinstated 
the level of responding was not differ- 
ent from that of a control group which 
received the CS on every trial (CS- 
omission effect). In most cases, the 
CS-omission group performed in a 
manner indistinguishable from or even 
superior to the control group. The 
implication is that learning in this situ- 
ation proceeds in much the same man- 
ner when the US is presented alone 
as when the CS and US are paired. 
A subsequent attempt to replicate these 
findings, however, met with failure 
(Goodrich, Ross, & Wagner, 1957), 
and further research has indicated that 
experimental procedure plays an im- 
portant role in determining whether or 
not the CS-omission effect is obtained 
(Dufort & Kimble, 1958). 
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In view of these discrepant results, 
and because of the theoretical signifi- 
cance inherent in such findings (Estes, 
1960) it is essential to determine the 
generality of the CS-omission effect 
in learning situations other than classi- 
cal conditioning in human Ss. Ac- 
cordingly, three experiments were per- 
formed to investigate the effect of CS 
omission on avoidance learning in rats. 


EXPERIMENT I 


Experiment I varied the number of 
avoidance trials preceding a constant 
number of CS-omission trials. 


Method 


Subjects—The Ss were 80 male albino 
tats of the Wistar strain ranging in weight 
from 250 to 300 gm., obtained from Dierolf 
Farms, Boyertown, Pennsylvania. 

Apparatus—The apparatus has been de- 
scribed in detail elsewhere (Brush & Knaff, 
1959). In brief, it consisted of a shuttle 
box 16 in. long, 53 in. wide, and 9 in. high 
Which was divided into two similar com- 
partments by a partition containing a 3-in. 
Square top-hinged door. The rat's passage 
between the compartments was recorded by 
means of photocells located 21 in. from each 
side of the partition and 1 in. above the 
floor. The entire apparatus was housed in 
2 sound-shielded chamber. 
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The CS was a combination 6-w. light and 
85 (+1) db., 10-cps click. The US was a 
185-v. shock delivered to the grid floor by 
a 150 K ohm “matched impedance” shock 
source (Campbell & Teghtsoonian, 1958). 
The shock was scrambled through a 12- 
contact, 10-position stepping switch. 

Procedure.—There were six experimental 
groups and two control groups in the ex- 
periment; each group contained 10 Ss. All 
Ss received five pretraining trials during 
which only the CS was presented. The CS 
was terminated by Ss passage through the 
door to the opposite compartment. Follow- 
ing this phase, Group E-5 received five 
training trials consisting of CS and US with 
a 5-sec, interstimulus interval. The S could 
avoid the shock by crossing to the opposite 
compartment during the interval. This re- 
sponse also terminated the CS. On trials 
where S did not make an avoidance response, 
CS and US were simultaneously terminated 
by the animal’s interruption of the photocell 
beam in the opposite compartment. Fifteen 
trials were then given with the CS omitted, 
and the US was terminated by an escape 
response through the door into the other 
compartment. During this phase, S could 
not avoid the shock. Finally, each S re- 
ceived 15 additional avoidance trials, con- 
stituting the test phase of the experiment. 

Groups E-0, E-15a, E-15b, E-30, and 
E-45 were similarly treated, except that 0, 
15, 15, 30, and 45 avoidance trials, respec- 
tively, were given before the 15 escape (CS- 
omission) trials. Groups Ca and Cb received 
45 and 75 avoidance trials without inter- 
ruption. 

The experiment was run in two parts. 
In the first, Groups E-0, E-5, E-15a, and Ca 
were run, and in the second carried out 2 
wk. later, Groups E-15b, E-30, E-45, and Cb 
were run. Groups E-15a and E-15b were 
combined when statistical analysis indicated 
no significant difference between them. 
Groups Ca and Cb were also combined for 
comparison with the E-15 groups, but they 
were not combined at any other level of 
training. 


Results 


Figure 1 shows the mean percentage 
of avoidance responses in blocks of 
five trials for each of the experimental 
groups immediately before and after 
the CS-omission trials. Also shown 
is the percentage of responding for the 
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Fic. 1. Effect of 15 CS-omission trials 
on avoidance responding after 0, 5, 15, 30, 
and 45 prior avoidance conditioning trials. 
(The data shown are avoidance responding 
for the five-trial block preceding and for the 
three five-trial blocks following the CS-omis- 
sion trials.) 


control groups on these trials, Exami- 
nation of the figure clearly indicates 
that performance following the 15 es- 
cape trials drops substantially at all 
levels of training. Of equal interest is 
the fact that the initial drop in per- 
centage of avoidance responses is ap- 
proximately the same at all levels. 
With no prior avoidance training Ss 
which received 15 CS-omission trials 
avoided shock 10% of the time during 
the first block of test trials, while the 
control Ss responded 52% of the time 
—a difference of 42%. For Group 
E-5 the difference was 55% ; for E-15, 
55%; for E-30, 47%; and for E-45, 
44%. All p’s < .01 (Mann-Whitney 
U tests). Similar differences were ob- 
tained when within-group comparisons 
between pre- and post-CS-omission 
performance was compared. Here the 
percentage difference was 21, 22, 40, 
and 32 for the E-5, E-15, E-30, and 
E-45 groups, respectively. All ?'s € 
.05 (Wilcoxon paired-replicates tests). 
Since Group E-0 received no avoidance 
training before the escape trials, no 
such comparison is possible, In sum- 
mary, the interpolated escape trials 
consistently produced a decrement in 
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avoidance responding, a result opposite 
to that of the CS-omission effect. 


EXPERIMENT II 


This experiment varied the number 
of CS-omission trials while holding 
number of preceding avoidance trials 
constant. 


Method 


Subjects—The Ss were 50 male albino 
rats of the Wistar strain, ranging in weight 
from 250 to 300 gm., obtained from Dierolf 
Farms. 

Apparatus.—The apparatus was the same 
as in Exp. I. 

Procedure.—There were four experimental 
groups and one control group, each contain- 
ing 10 Ss. Following pretraining, which was 
identical to that in Exp. I, each S received 
15 avoidance trials, after which it was 
assigned to one of the five groups in such 
a manner that the number of avoidance 
responses during the 15 trials was approxi- 
mately equal among the groups. Group E-2 
then received 2 trials with the CS omitted, 
and groups E-5, E-10, and E-15 were given 
5, 10, and 15 CS-omission trials, respectively. 
There followed 15 additional avoidance trials 
for each group. The control group was 
given 45 avoidance trials after pretraining. 

In all other respects the procedure was 
exactly the same as in Exp. I. 


Results 


Figure 2 shows the mean percentage 
of avoidance responses for the block 
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Fic. 2, Effect of 2, 5, 10, and 15 CS- 
omission trials on avoidance responding fol- 
lowing 15 avoidance conditioning trials, 


of five trials immediately preceding the 
CS-omission trials and for three blocks 
of five trials after the CS-omission 
trials. As in Exp. I omitting the CS 
during avoidance conditioning produces 
a decrement in responding. Moreover, 
the decrement in performance tends to 
increase as the number of CS-omission 
trials increases. Differences between 
the E and C groups on the first five 
trials following the CS-omission trials 
were significant for the E-10 and E-15 
groups (p's < .01, Mann-Whitney U 
tests) and the within-group decrements 
for the pre- and post-CS-omission trial 
blocks were all significant (all p’s < 
05, Wilcoxon paired-replicates tests). 
The difference among the four experi- 
mental groups on the first block of 
test trials, when analyzed by means of 
the Kruskal-Wallis analysis of vari- 
ance by rank, approached significance 
H (3) 2742, with 7.82 required for 
significance at the .05 level. 
Discussion 

The CS-ommission effect, as described 
by Kimble, was not obtained in these 
experiments, A decrement in responding 
following CS-ommission was observed 
not only in relation to a control group, 
but also in within-group comparisons be- 
tween pre- and postomission levels of 
responding. These findings indicate that 
both prior and interpolated escape train- 
ing interfere with the acquisition and 
performance of an avoidance response. 


EXPERIMENT III 


This experiment investigated the ef- 
fects of varying periods of delay follow- 
ing the CS-omission trials in an effort 
to determine whether the interference 
effects found in Exp. I and II were 
the result of an overall increment in 
anxiety level produced by the unavoid- 
able shock during the CS-omission 
trials. Presumably, a heightened anx- 
iety level would dissipate or vary 
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systematically over time (cf. Kamin, 
1957). 


Method 


Subjects.—The Ss were 80 female albino 
rats of the Sprague-Dawley strain, ranging 
in weight from 180 to 250 gm., purchased 
from the Camm Research Institute, Wayne, 
New Jersey. 

A Apparatus—The apparatus was the same 
as in Exp. I and II. 

Procedure.—Four experimental and four 
control groups, each containing 10 Ss were 
run in this experiment. No pretraining was 
given. The Ss were placed in the apparatus 
and the avoidance-training procedure began 
approximately 1 min. later. The experi- 
mental groups were given 15 avoidance con- 
ditioning trials followed by 15 CS-omission 
trials, and the control groups, 30 avoidance 
conditioning trials. Following this, without 
interruption, one experimental group and one 
control group received 15 additional avoid- 
ance conditioning trials. Other experimental 
and control groups were removed from the 
apparatus for 1 min., 1 hr, or 24 hr. and 
then returned and given 15 additional avoid- 
ance trials. The 1 hr. and 24 hr. Ss had 
ad libitum access to food and water during 
the delay period. 


Results and Discussion 


Figure 3 shows the mean number of 
avoidance responses made by all groups 
during the 15 avoidance trials follow- 
ing the delay period. Omission of the 
CS for 15 trials clearly produces a 
marked decrement in avoidance re- 
sponding, F (1, 72) = 14.60, p < .001. 
Moreover, there is no indication that 
the disruption of the avoidance re- 
sponse produced by the CS-omission 
trials diminishes with the passage of 
time, F (3,72) = 1.96. 

The results of this experiment fur- 
ther confirm the previous findings that 
the CS-omission effect is not obtained 
in avoidance conditioning paradigms 
with rats. Moreover, the finding that 
the decrement in responding produced 
by the interpolated escape trials did 
not diminish over time suggests that it 


RETENTION INTERVAL 


Fic. 3. Effect of 15 CS-omission trials 
on avoidance performance 1 min, 1 hr., and 
24 hr. later. 


is the acquisition of competing instru- 
mental or learned fear responses rather 
than a simple heightening of anxiety 
level that interferes with subsequent 
avoidance performance. 
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TRANSFER AS A FUNCTION OF TIME TO MEDIATE? 


NORMAN E. SPEAR, PETER J. MIKULKA, anp MARVIN PODD 


Rutgers University 


Transfer processes employed in the AB’, AC, and CD transfer para- 
digms were compared when either a 1-, 2-, or 4-sec. duration of an- 
ticipation interval (AI) was permitted during presentation of the 
transfer list. Assuming that varying AI effectively varied the time 
available for utilizing mediators, it was expected that shorter AI 
would result in less usage of mediation in the AB’ condition, thus 
transforming it into a functional AC paradigm. Within the range 
of AI used in this experiment, there was no evidence that variation 
in AI differentially affected behavior in these 3 transfer paradigms. 
This was viewed as indirect support for the conclusions that (a) this 
kind of mediation occurs more rapidly than 1 sec., and/or (b) media- 
tion is not the primary transfer process employed in the AB’ paradigm. 


Barnes and Underwood (1959) have 
concluded that transfer in the A-B, 
A-B' paradigm (hereafter referred to 
as AB') proceeds with the use of a 
mediation chain, A-B-B', One major 
fact that influenced this conclusion was 
the great increase in performance under 
their free-recall technique (MMFR, in 
Which 4 min. were permitted for re- 
sponding) relative to that on a com- 
parable anticipation trial (on which 
only 2 sec. were permitted for respond- 
ing). Moreover, recall of the transfer 
list was nearly perfect after a single 
anticipation trial Additional sets of 
evidence lend Support to this easily 
reconcilable suggestion that mediators 
may be used more effectively the longer 
the anticipation interval (cf. Katz, 
1963; Schulz & Lovelace, 1964), 

The present experiment attempted 
to vary mediation opportunity via vari- 
ation of the anticipation interval (AI) 
in an effort to predetermine the proc- 
esses employed in transfer. For exam- 
ple, one might expect that when media- 
tion opportunity is reduced during the 
transfer test in the AB’ condition, Ss 


1This research was supported by Grant 
MH-10194 from the National Institute of 
Mental Health and by Grant No. 685 from 
Rutgers University Research Council. 
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would necessarily resort to the unlearn- 
ing process which Barnes and Under- 
wood (1959) have shown to be typi- 
cally employed in the A-B, A-C 
transfer paradigm (hereafter referred 
to as AC). If this were the case, then 
Ss in the AB’ condition should behave 
more like Ss in an AC condition the 
shorter the AI employed while the 
transfer list is being learned. Thus, 
the present factorial design compared 
performance in the AB' and AC trans- 
fer paradigms under either of three 
levels of AI given on the second or 
transfer list. The lists and procedures 
of Barnes and Underwood were em- 
ployed to insure recapture of their 
mediation phenomena, and a nonspe- 
cific-transfer base line was provided at 
each level of AI by employing an A-B, 
C-D paradigm (hereafter referred to 
as CD). 


METHOD 


Subjects—The Ss were 180 male under- 
graduates enrolled in the introductory psy- 
chology classes at Rutgers College of Rut- 
gers University. Ten random orders of the 
18 conditions were constructed (two different 
sets of lists were used for each of 9 experi- 
mental conditions). The Ss were assigned 
to conditions in these orders as they showed 
up at the lab, but with the restriction that 
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all 18 conditions were filled in a given block 
before a new block of 18 was begun. 

Apparatus.—A. Stowe memory drum was 
employed in this experiment to present the 
lists at a study interval of 2 sec., and an 
anticipation interval of either 1, 2, or 4 sec. 

Procedure and design—All Ss learned two 
consecutive lists. Following standard paired- 
associate instructions, List 1 was presented 
at a 2:2 rate to a criterion of one perfect 
«recitation or for 25 trials. Those Ss that 
did not attain the criterion (23 of the 180 
Ss) were about equally distributed among 
conditions. Following instructions, in which 
each S was fully informed about the length 
of the anticipation interval (AI) which 
would be used during the second list, the 
Ss were given List 2 for 10 trials. Only 
10 trials were given since Barnes and Under- 
wood’s (1959) results suggest that more 
List-2 trials add relatively little to the un- 
learning effects. On List 2 the AI was 
either shortened to 1 sec., remained the same, 
or lengthened to 4 sec. 

Three experimental paradigms were used: 
AB,AB'; AB,AC; and AB,CD. In order 
to control for possible list peculiarities, two 
lists appropriate to each paradigm were con- 
structed. The comparable lists were iden- 
tical to those of Barnes and Underwood, and 
the list for the CD condition had the same 
specifications and restrictions as the others. 
Briefly, the stimuli were CVC syllables 
(60-73% Glaze values) and the responses 
were two-syllable adjectives from Haagen’s 
(1949) scale of similarity. Four random 
orders of each list were used for each S to 
avoid serial learning of the responses. 

After the completion of the second or 
transfer list, Ss were presented with a sheet 
containing all the stimulus trigrams from the 
two lists, and MMFR was given as described 
by Barnes and Underwood (1959). 

Before Ss were dismissed, they were 
asked whether the learning of the first list 
had in any way facilitated the learning of 
the second list and, if so, to explain the 
nature of this help. 

Thus, the 9 independent groups of 20 Ss 
(10 Ss on each of the two alternative forms 
of the lists) formed a 3X 3 factorial design, 
with transfer paradigm (AB', AC, or CD) 
and AI on the transfer list (1, 2, or 4 sec.) 
as the independent variables. These condi- 
tions will be designated by combining the 
representative letters and numbers for para- 
digms and Al, eg. AB'-2 represents the 
AB' paradigm in which a 2-sec. AI was 
employed during the presentation of the 


transfer list. In general, the procedure 
(including the method of free recall, termed 
MMER) was held as similar as possible to 
that of Barnes and Underwood (1959). 


RESULTS 


Transfer—The mean number of 
trials to learn List 1 varied from 12.0 
to 14.6 among the 9 conditions. A 
3 x 3 analysis of variance showed that 
neither paradigm, the dummy variable 
of anticipation interval (AT), nor their 
interaction contributed significant vari- 
ance during first-list learning (F <1 
in each case). To the extent that the 
lists were of equal difficulty, this result 
reflects the equivalence of the groups. 

The course of performance over the 
10 trials on List 2 is shown for each 
condition in Fig. 1. It is notable that 
Ss in condition AB’ initially made nu- 
merically more correct responses than 
did Ss in either of the other conditions, 
regardless of AI. The function relat- 
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1234567890 
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Fic. 1. Mean number of correct antici- 
pations for each of the 10 trials given on 
the second or transfer list. 
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MEAN No. CORRECT on LIST 2 


Fig. 2, Mean number of correct antici- 
pations given on the second or transfer list 
durihg the first two transfer trials (bottom) 
and during the combined 10 trials (top). 


ing both early and overall transfer with 
AI may be seen more clearly in Fig. 2. 
Taking the first two transfer trials 
alone, statistically reliable (Pp < .01) 
effects of paradigm, F (2, 171) = 5.09, 
and AI, F (2, 171) = 40.38, were 
found with no interaction, F (4, 171) 
zu However, the numerically 
Superior performance of the AB’ con- 
ditions relative to comparable CD 
conditions did not attain significance, 
even at the .05 level, when evaluated 
by the Duncan multiple-range test 
(Edwards, 1960). When total per- 
formance was evaluated over the 10 
trials on List 2, the effects of paradigm, 
F (2,171) = 6.42, and AI, F (2, 171) 
= 60.02, remained significant, though 
the direction of the differences changed 
somewhat. With this measure, CD 
was numerically superior to AB’; but 
these differences also were not statis- 
tically reliable. Again there was no 
trace of a Paradigm x AI interaction, 
P (4, 171) =1.2, Thus it was deter- 


mined that AI did not differentially 
affect transfer in the “mediation” 
(AB’) and "unlearning" (AC) para- 
digms when early and/or overall trans- 
fer were examined. 

Overt intrusions of formerly correct 
List-1 responses during learning on the 
transfer list were more numerous in 
the AB' condition than in the AC con- 
dition (see also Barnes & Underwood, 
1959). Considering only the first two 
transfer trials, there were 12, 5, and 
10 such intrusions for Cond. AB'-1, 
AB’-2, and AB'-4, respectively, in con- 
trast to only 1, 2, and 3 such intrusions 
found in the corresponding AC condi- 
tions at this point. When all 10 trials 
were included, Cond. AB'-1, AB/-2, 
and AB'4 yielded 41, 34, and 29 in- 
trusions, respectively, while the corre- 
sponding AC conditions had 11, 8, and 
7. Furthermore, the greater numbers 
of intrusions in the AB’ conditions 
occurred in spite of fewer opportuni- 
ties, since there were also more correct 
responses in the AB’ than in the AC 
conditions. 

MMFR.—In view of the negligible 
differences which occurred among the 
alternative ways of scoring (see also 
Barnes & Underwood, 1959), results 
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Fic. 3. Mean number of responses from 
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list which were aired correctly during 
MMFR. k j 
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of this test are reported here only in 
terms of the number of responses which 
were both recalled and paired correctly. 
For example, 97% of the correctly re- 
called and paired responses in the AB’ 
condition and 98% in the AC condi- 
tion were also appropriately identified 
as to list membership. This varied 
somewhat irregularly from 95% to 
*100% among the different Als. The 
probability of a response being paired 
correctly once it was recalled was also 
near 1.00. This occurred in 99% and 
95% of the responses recalled in the 
AB’ and AC conditions, respectively. 
The most basic result of MMFR 
was the number of List-l responses 
which were given correctly. The spe- 
cific relationships may be seen in Fig. 
3. In decreasing order, the AB', CD, 
and AC paradigms differed in this re- 
spect, F (2, 171)— 25.58, p < .001, 
with the overall numerical difference 
between AB' and CD about the same 
as that between CD and AC. How- 
ever, AI had no effect on this produc- 
tion of correct List-l responses, F (2, 
171) = 1.16, and there was no inter- 
action between AI and paradigm, F (4, 
171) 2 75. Comparisons on this 
measure may be made between Cond. 
AB'2 and AC-2 from the present 
experiment, and the identical condi- 
tions in the Barnes and Underwood 
study. In the present experiment, 
Cond. AB'-2 and AC-2 gave 7.15 and 
5.00 correct response pairings. The 
corresponding values found by Barnes 
and Underwood were 7.12 and 5.00. 
It is interesting that AB' gave 
more correct List-l responses during 
MMFR than CD (by the Duncan 
multiple-range test). Summing over 
levels of AI (there was no interaction) 
this difference was statistically reliable 
(p < 01). Although all of these con- 
ditions were undoubtedly subject to 
retroactive interference effects, it can- 


not be determined whether the advan- 
tage of the AB’ conditions was due 
to a compensatory strengthening of 
List-l associations during the second- 
list learning of AB', or if it was be- 
cause of the unlearning of the associa- 
tion between stimulus context and 
List-l responses which presumedly oc- 
curred in the CD condition (see Mc- 
Govern, 1964). 

The number of correct pairings from 
List 2 were remarkably similar among 
conditions during MMFR in spite of 
the relatively powerful effect of AI on 
List-2 performance (see Fig. 3). Ex- 
cept for the AC-1 condition, in which 
a mean of 6.75 List-2 items was re- 
called, all other conditions recalled at 
least a mean of 7.50 of the 8.00 possi- 
ble, That is to say that the long re- 
sponding time available during MMFR 
revealed that the effect of AI was 
primarily upon performance, not learn- 
ing. In terms of the percentage in- 
crease in recall from the last trial of 
List 2 to MMFR, Cond. AB’-1, AC-1, 
and CD-1 increased by 27%, 35%, and 
14%, respectively. Comparing mean 
numbers of List-2 items recalled in 
Cond. AB’-2, and AC-2 with those ob- 
tained by Barnes and Underwood, the 
present experiment’s values of 7.75 and 
7.55, respectively, compare favorably 
with Barnes and Underwood's values 
of 7.83 and 7.33. 

Finally, it is important to consider 
the ordering of recall in those cases in 
which responses from both lists were 
paired correctly with their common 
stimulus. As Barnes and Underwood 
have pointed out, to the extent that 
List-1 responses are recalled first, 
the mediation hypothesis is supported. 
Note, however, that such support 
would not be inconsistent with an in- 
terpretation of AB’ transfer based upon 
Underwood’s (1951) concept of para- 
sitic reinforcement. In any case, this 
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measure also followed closely the rela- 
tionships obtained by Barnes and Un- 
derwood, with no interacting effects 
of AI. Thus, in the present experiment, 
List-1 responses were recalled first in 
50.0%, 58.9%, and 48.5% of the cor- 
rectly given pairs in Cond. AB’-1, 
AB'-2, and AB*'4, respectively. In 
contrast, List-1 items were given first 
in only 26.9%, 34.0%, and 26.7 96 of 
the instances in Cond. AC-1, AC-2, 
and AC-4, (Values obtained by Barnes 
& Underwood in conditions corre- 
sponding to the present AB'-2 and 
AC-2 were 58% and 40%, respec- 
tively.) These data reflect little influ- 
ence of AI and greater support for 
mediation in AB' than in AC. This 
was consistent with the postexperi- 
mental reports of Ss. In Cond. AB’-1, 
AB'2, and AB’-4, respectively, there 
were 20, 18, and 19 Ss who reported 
that they mediated through List-l re- 
sponses while acquiring the second list. 
In Cond. AC-1, AC-2, and AC-4, these 
figures were 2, 4, and 2. 


Discussion 


Although the expected relationships 
occurred clearly in terms of the main 
effects of anticipation interval (AI) and 
transfer paradigm, it will be maintained 
that the major finding of this experiment 
was the absence of interaction between 
these variables. Thus there was no sta- 
tistically reliable evidence that the three 
transfer paradigms were differentially 
affected by increased duration of AT, 
Now obviously the acceptance of the null 
hypothesis does not provide the most 
sound basis for conclusion. And it is true 
that the tendency toward an interaction 
between the AB’ and CD conditions was 
in the appropriate direction on the first 
trial with the transfer list (cf. Runquist 
& Marshall, 1963). But particularly con- 
cerning the AB’ and AC paradigms, there 
was not the slightest hint of interaction, 
Indeed, these two paradigms, with their 
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presumably opposing processes, showed 
remarkably parallel effects of AI. 

It will be recalled that an originating 
notion of this research expected that lim- 
iting the time to mediate in an AB’ con- 
dition would change it to a functional 
AC condition (Postman, 1963, has dis- 
cussed other instances of functional, as 
opposed to nominal, transfer paradigms). 
That is to say that shorter AI would 
presumedly force Ss to abandon the tech- 
nique of mediating through the first-list 
response during second-list learning, and 
to adopt the strategy of unlearning the 
first-list associations before acquiring 
those of the second list. In fact, this did 
not occur, With shorter AI, there was 
no indication in the AB’ paradigm that 
fewer List-l items were given correctly 
in MMFR, that fewer List-1 items were 
given first in MMFR, nor that fewer Ss 
reported the use of mediation while learn- 
ing List 2. Moreover, in each of these 
respects, the behavior found with the AB’ 
condition paralleled that obtained with 
the AC condition. 

While it is true that the effective dif- 
ferences in behavior shown between the 
AB’ and AC conditions in both the pres- 
ent and Barnes and Underwood’s experi- 
ments were statistically reliable, they 
were not very large in absolute terms. 
Thus, less than half of the items, on the 
average, gave evidence of unlearning in 
the AC conditions. Also, in terms of the 
number of List-1 items available in 
MMFR, AC and AB’ conditions were 
separated by only about two items. Put- 
ting this another way, whether Ss “un- 
learned" List-l items or whether they 
"mediated" (and thus strengthened the 
AB association further) made a differ- 


ence in recalling them on only about 25% . 


of the items invalved. In terms of the 
Percentage of List-1 items given first in 
MMER, these conditions also differed by 
no more than 25%, 

Thus, the similarities in the absolute 
values of performance in Cond. AB’ and 
AC, and the congruency between the be- 
havior of Ss in these conditions over the 
three Als, invites three Possibilities con- 
cerning transfer: (a) Mediation occurs 
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in AC as well as AB’. (b) Mediation 
occurs in neither AB’ nor AC. (c) The 
range of AI employed here does not affect 
utilization of the mediation process in- 
volved. This last possibility draws some 
support from the work of Katz (1963), 
which has suggested that at least under 
his training conditions, effective media- 
tion may take place in less time than the 
shortest AI duration (1 sec.) used in 
the present experiment. On the other 
hand, Schulz and Lovelace (1964) have 
demonstrated greater utilization of medi- 
ators with a 4-sec. AI than with a 2-sec. 
AI. Still, since the mediation chain of 
Schulz and Lovelace may have taken 
longer than the presumed A-B-B’ chain 
of the present experiment, this last alter- 
native, though excluded from the remain- 
der of this discussion, probably cannot 
be completely dismissed. ^ 
Consider alternative a from above. It 
seems that, in the present experiment's 
AC conditions, Ss did not unlearn about 
five of the eight items from List 1, Did 
they mediate with these items? The fact 
that there was no a priori connection 
between the B and C items does not dis- 
count the possibility that Ss could con- 
struct workable associations between 
them. Once B is firmly attached to A, 
it would seem intuitively easier to asso- 
ciate B with C than to both unlearn the 
association A-B and learn A-C. That 
there is some evidence of unlearning 
would simply reflect certain instances in 
which the association B-C is so difficult 
that the unlearning is necessary. How- 
ever, if it were conceded that functional 
mediation did occur in the AC condition, 
it would be difficult to maintain that this 
was the same process which occurred in 
AB‘. For example, if there is mediation 
in AC, the Ss themselves are not aware 
of it. The vast majority say they do 
not mediate, and even the 13% that re- 
ported doing it in the present experiment 
also reported that they were successful 
for only one or two of the items, This 
may be contrasted with the fact that 95% 
of the Ss in the AB’ conditions reported 
that they mediated, and without the re- 
strictions mentioned by AC Ss. More- 


over, Ss in the AB’ condition gave List-1 
responses first in MMFR more often than 
did Ss in the AC conditions, and the 
latter gave more List-2 responses first 
than List-l responses. In addition, a 
corollary of this reasoning is that, al- 
though AI affected AB’ and AC in the 
same way, the benefit of the longer AI 
should still have been greater in these 
conditions than in CD, where no common 
elements existed between the lists. As 
shown, the necessary interactions never 
approached statistical reliability. 

Thus, if the alternative conclusions 
were that mediation was employed in 
either both the AC and AB’ conditions 
or in neither, the latter would appear the 
more reasonable. Application of Under- 
wood's (1951) principle of parasitic re- 
inforcement to account for these data 
would not be difficult, though as Barnes 
and Underwood (1959) have pointed out, 
it would require more assumptions than 
the simpler mediation explanation. For 
example, the fact that Ss report that they 
mediate in the AB’ condition may simply 
reflect implicit responding of B to A, 
and not necessarily effective mediation. 
A combination of more sophisticated Ss 
and more careful interrogation may have 
made this clearer, though probably not 
in view of the notorious unreliability of 
S reports in these matters. 

All of these points seem to indicate that 
if mediation is the primary transfer proc- 
ess used in this kind of AB' paradigm, 
it takes place much more rapidly than was 
the case in the chaining investigated by 
Schulz and Lovelace (1964). A second 
inference which may be made from data 
such as those presented here concerns the 
small differences in measured behavior 
which occur in these paradigms. In gross 
terms, one could say that Ss in Cond. AB’ 
and AC yield the same consequences of 
their behavior about 75% of the time. 
More correctly, the behavior of Ss in AB’ 
and AC does not produce differential con- 
sequences in terms of 7596 of the mate- 
rials learned, at least not with the items 
learned in this experiment and that of 
Barnes and Underwood (1959). This 
may be no more than a comment on the 
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resourcefulness of the S population which 
is typically employed in these experi- 
ments. Nevertheless, at least with the 
present methods and materials, the meas- 
urable differences are so slight as to 
practically defy investigation of transfer 
processes through the use of potentially 
interacting variables. That is to say 
that certain potentially interacting varia- 
bles may need to be investigated with 
techniques yielding exaggerated transfer 
effects, perhaps under circumstances dif- 
ferent from the conventional memory- 
drum techniques. 
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VIBROTACTILE THRESHOLDS FOR HAIRY SKIN* 
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Absolute vibrotactile thresholds were determined as a function of 


stimulus frequency and contractor area on 
vibration decrease in 
—3 db. per doubling of area. When 


forearm. Thresholds for 
contactor area with a slope of 


plotted as a function of frequency these data 
12 db. in lower frequencies and +9 db. in frequencies 
Both these findings confirm previous data obtained 
differences between hairy and glabrous skin 


with a slope of — 
above 220 cps. 
on glabrous skin. Some 


the hairy skin of the volar 
direct proportion to the 


yield a U-shaped curve 


were found and discussed. Evidence is presented in support of a 
hypothesis advanced in earlier papers which suggests that there may be 
2 types of mechanoreceptors in cutaneous tissue. 


The effect of contactor area on the 
vibrotactile threshold has been explored 
for glabrous skin on the palm of the 
hand (Verrillo, 1962, 1963, 1966). 
The results of these experiments agreed 
with previous investigations that the 
frequency function is U shaped with a 
maximum of sensitivity in the region 
of 250-300 cps. It was also established 
that the absolute threshold is inversely 
proportional to the total energy de- 
livered to the skin. This result was 
consistent with analogous data for vi- 
sion and hearing. 

Examination of the threshold curves 
plotted as a function of frequency and 
contactor area led to the hypothesis 
that glabrous tissue is host to at least 
two types of mechanoreceptors—one 
that is dependent on frequency and on 
the contactor area and one that is 
independent of these stimulus param- 
eters. 

The present investigations are a sys- 
tematic study of spatial stimulus pa- 

1 This work was supported by contracts 
between the United States Office of Naval 
Research (ONR Contract No. NOnr 669 
(13), NR No. 140-145), the National Insti- 
tutes of Health, United States Department of 
Health, Education, and Welfare, and Syra- 
cuse University. Reproduction in whole or 
in part is permitted for any purpose of the 
United States Government, 
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rameters on hairy skin. Absolute 
thresholds for vibrotactile stimuli were 
determined as a function of frequency 
and contactor area. 


APPARATUS AND METHOD 


The reader is referred to a previous publi- 
cation (Verrillo, 1966) for a detailed de- 
scription of the apparatus used in these 
experiments. Sinusoidal signals were modu- 
lated so that the signal was on for 1 sec. 
and off for 1 sec. A rise-fall time of 100 
msec. was used to minimize switching tran- 
sients. The vibrator assembly and S were 
located within a soundproofed booth which 
provided isolation from building vibrations. 

The site of stimulation was on the skin of 
the volar forearm over the flexor carpi 
ulnaris muscle and approximately 8.0 cm. 
distal to the medial epicondyle. All of the 
Ss had a growth of hair in this region. 
The reported values are median measure- 
ments of three Ss. Thresholds were deter- 
mined at 25, 40, 80, 90, 100, 110, 130, 160, 
200, 220, 250, 320, 400, and 450 cps using 
contactor areas of .02, .08, .32, 72, 13, 29, 
and 5.1 cm?, Not all combinations of fre- 
quency and area were tested. 


RESULTS 


Absolute Threshold as a Function of 
Area and Frequency 


The threshold data shown in Fig. 1 
are plotted as a function of contactor 
area with frequency as the parameter. 
Curves have been drawn through some 
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Fic. 1. Vibrotactile thresholds as a function of contactor area. (The curves drawn 
to fit the experimental points have a slope of —3 db, per doubling of area.) 


frequencies in order to indicate the 
slope of —3 db. per doubling of the 
contactor area for frequencies of 80 
cps and above. The spatial summation 
indicated by these curves is identical 
to that found earlier for glabrous skin. 
It should be noted that for the fre- 
quencies 25 and 40 cps the curve is 
essentially flat.? 

The data replotted in Fig. 2 as a 
function of frequency yield U-shaped 
curves when contactors of sufficient 

2 The figures in this paper and in previous 
publications by the author indicate a refer- 
ence level of 1 a RMS. This should read 
RE 1 MICRON PEAK. Note that this 


adjustment does not alter the slopes of the 
curves, 
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Fic. 2. Vibrotactile thresholds as a function 


size are used. In the low frequencies 
(25-220 cps) the slope of the curve is 
approximately —12 db. per doubling 
of frequency and rises with a slope of 
about 9 db. per doubling in the fre- 
quencies above 220 cps. These slopes 
are the same as those found for glab- 
rous skin (Verrillo, 1963). 

There appears to be a slight shift 
in the region of maximum sensitivity 
to a lower frequency as the size of 
the contactor is increased. Whether 
or not this is a real shift will require 
further experimentation in the middle 
frequencies. No hypothesis can be of- 
fered at this time to explain this ap- 
parent shift, 
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slope of —12 db, below 220 cps and a slope of +9 db. above 220 cps.) 
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Fic. 3. Comparison of vibrotactile thresholds on hairy and glabrous skin for a contactor 
area of 1.3 cm2. (The horizontal lines through 14 and 25 db. represent the curves obtained 
using a contactor area of .02 cm®. Curves drawn to fit the experimental data have a slope 


of —12 db.) 


It should also be noted that there is 
no summation over frequency when a 
very small contactor (.02 cm?) is used, 
just as there is no summation over 
area in the low frequencies (Fig. 1). 
This same phenomenon was observed 
in data taken on glabrous skin (Ver- 
rillo, 1963, 1966) and was interpreted 
as evidence of two types of mechano- 
receptors in human skin. This inter- 
pretation was supported by studies in 
which vibrotactile thresholds were de- 
termined as a function of various tem- 
poral patterns of the physical stimulus 
(Verrillo, 1965). The present find- 
ings may certainly be construed as 
further evidence of a dual system of 
mechanoreceptors. 


Comparison of Hairy and Glabrous 
Skin 


There are several features in these 
data which distinguish them from those 
obtained from glabrous skin. The first 
and most obvious is the upward shift 
in absolute threshold on hairy skin by 
about +11 db. at 25 cps and +20 db. 
at 250 cps. This may be attributed 
to the difference in neural innervation 
between the two sites, 


Another notable difference between 
the responses is the shift in maximum 
sensitivity from about 250 cps for 
glabrous skin to the region of 220 cps 
for hairy skin. This can be seen clearly 
in Fig. 3 which compares the two 
sites for one contactor size. A shift in 
the resonant frequency of the tissue can 
be ruled out as an explanation, since 
there is no observable effect of reso- 
nance when very small contactors were 
used on either glabrous (Verrillo, 1963, 
1966) or on hairy skin (Fig. 2). The 
shift in sensitivity is probably due to 
neural effects not fully understood at 
this time. 

The third and perhaps most signifi- 
cant of the differences can be seen in 
Fig. 3. Horizontal lines have been 
drawn at 14 db. and 25 db. to indicate 
the level at which the smallest con- 
tactors maintain a constant threshold. 
Slopes of —12 db. have been drawn to 
fit the experimental data. Tt was sug- 
gested in an earlier paper (Verrillo, 
1963) that the receptor system which 
is independent of frequency and area 
dominates the response to the point 
where the two lines intersect. Allow- 
ing for some curvature in the data due 
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to an interaction between the two sys- 
tems, the second population of re- 
ceptors then dominates the response 
and completely determines the slope of 
the curve. 

It is apparent in Fig. 3 that the fre- 
quency range in which this interchange 
becomes effective is different in the two 
types of skin. For hairy skin the tran- 
sition occurs in the vicinity of 90 cps 
while in glabrous skin it occurs around 
50 cps. This relationship is main- 
tained even when adjustments are made 
for the differences in absolute thresh- 
old and the shift in maximum sensi- 
tivity. 

The relationships seen in Fig. 3 are 
present when the other contactor sizes 
are plotted in a similar fashion. Within 


the framework of the two-receptor hy- 
pothesis this suggests a difference in 
the proportion of frequency dependent 
and frequency independent receptors in 
glabrous and hairy skin. 


REFERENCES 


Verrito, R. T. Investigation of some 
parameters of the cutaneous threshold for 
vibration. J. Acoust Soc. Amer., 1962, 
34, 1768-1773. 

VznRILLO, R. T. Effect of contactor area on 
the vibrotactile threshold. J. Acoust. Soc. 
Amer., 1963, 35, 1962-1966. 

Verritto, R. T. Temporal summation in 
vibrotactile sensitivity. J. Acoust. Soc. 
Amer. 1965, 37, 843-846. 

Verru, R. T, Effect of spatial parame- 
ters on the vibrotactile threshold. J. exp. 
Psychol. 1966 71, 570-575. 


(Received March 15, 1965) 


Journal of Experimental Psychology 
1966, Vol. 72, No. 1, 51-60 


EFFECT OF INTERPOLATED EXTINCTION AND LEVEL 
OF TRAINING ON THE “DEPRESSION EFFECT”? 
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Rutgers University 


The fact that rats shifted from large to small reward may undershoot 
the performance of a base line control group (depression effect) has 
been previously accounted for in terms of frustration concepts. Experi- 
ment I was designed to test a prediction that frustration experience 
prior to a downward shift in reward magnitude would decrease or 
prevent the occurrence of the depression effect. The effects of 2 levels 
of preshift training were included as controls for number of trials and 
number of rewarded trials. Experiment II studied the effects of inter- 
polated extinction trials when the number of extinction trials exceeded, 
or was less than, the number of rewarded training trials. It was found 
that the depression effect was stronger the greater the number of pre- 
shift trials, and that its strength was reflected differentially among 
the response measures at given points of postshift training. Inter- 
polated extinction trials produced little or no reduction in the depression 
effect. This last finding raises related questions concerning the effect 
of experimental extinction on S’s representational response of reward 
magnitude, and the adequacy of a frustration-theory interpretation of 


the depression effect. 


Crespi (1942) attributed the “de- 
pression effect” (undershooting of an 
expected performance level following a 
shift from large to small reward) to 
emotional factors in the rat. Bower 
(1961, 1962) has extended this inter- 
pretation by viewing such downward 
contrast effects within the context of 
Amsel’s and others’ more formalized 
frustration constructs. Bower’s work 
was especially important since it rep- 
resented an initial step toward subsum- 
ing facts concerning extinction and 
those dealing with other decreases in 
reward magnitude under one set of 
concepts. However, other plausible in- 
terpretations of the depression or down- 
ward-contrast effect have been sug- 
gested as alternatives to the emotion- 
based explanation (eg., Collier & 
Marx, 1959). Certain implications of 
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the emotion-based interpretation are 
relevant to the empirical problem posed 
in the present experiments. 

The basic purpose of the present re- 
search was to evaluate the effect of an 
interpolated disruption of the condi- 
tions of reinforcement on behavior ac- 
companying a subsequent shift in re- 
ward magnitude. In particular, ex- 
perimental extinction was given fol- 
lowing the preshift stage (large re- 
ward) and preceding the postshift 
stage (small reward) of training. 
Such extinction was presumed to have 
two related effects. First, it should 
at least tend to neutralize the differ- 
ential effect of the preshift reward mag- 
nitude. Second, to the extent that 
frustration responses are responsible 
for the depression effect, the interpo- 
lated extinction (which itself elicits 
rys) should dampen the magnitude of 
the rysy which would ordinarily be 
presumed to occur when the smaller 
postshift reward was experienced. 
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EXPERIMENT Í 


Essentially, the procedure was to 
present S initially with large reward 
upon completion of the instrumental 
response, then subject S to several 
trials intended to extinguish the instru- 
mental response by the complete ab- 
sence of any nominal reward, and fi- 
nally present S with only the small 
reward upon completion of the instru- 
mental response. The question was 
whether S would show a reduced mag- 
nitude of the depression effect upon 
receipt of the small reward as com- 
pared with other Ss which had not had 
the interpolated extinction experience. 


Method 


Subjects and apparatus.—The Ss were 30 
experimentally naive female albino rats of 
the Sprague-Dawley strain. All Ss were 
60 days old at the start of experimental 
training. The apparatus was a straight run- 
way with a 34-in. alley, 11.5-in. start box, 
and a 12-in. goal box. The start and run- 
way units were 3.5 in. high and 3.5 in. wide. 
The entire alley was flat gray and had 
Plexiglas covers. The goal box was 11.25 
in. wide and 8-in. high. Plywood doors, 
.25 in, thick, separated the start and goal 
boxes from the runway. Standard Electric 
timers, Model S-1, recording to .01 sec., and 
Hunter photo relays, Model 335-S, were 
used to measure S’s progress through the 
runway. Three time measures were re- 
corded: Start time was recorded from the 
time the start-box door was opened until 
S broke Beam No. 1 located 12 in. down 
the runway. Running time was recorded 
from Beam No. 1 to the goal-box door. 
The light beam at the goal-box door in turn 
activated a third timer which ran until S 
broke a light beam 12 in. away, 1 in. from 
the rear of the goal box and directly over 
the food cup. 

Taming and deprivation.—Each S received 
a total of 10 gm. of food per day throughout 
taming and experimental training (reward 
plus supplemental ration). All Ss were 
given daily 3-min. sessions of handling on 
the 5 days preceding experimental training. 
After handling, S was replaced in his cage 
along with a few of the .045-gm. Noyes 
pellets which later served as reward. Water 
was always available in S’s cage. 


Design and procedure.—Six groups of 
five Ss were formed by random assignment. 
Two groups of rats were given either 10 
pellets (10-0-1) or 1 pellet (1-0-1) as re- 
ward on Trials 1-25. Both groups were 
then given 25 trials of extinction, followed 
by reacquisition with 1-pellet reward. Two 
additional groups were formed to control for 
the number of rewarded experiences prior 
to shift. Group 10-1 was not run on Trials 
1-25, but received 10 pellets as reward on 
Trials 26-50 and 1 pellet as reward on 
Trials 51-75. Group 1-1 received 1 pellet 
as reward on Trials 26-75. Finally, two 
groups were formed to control for the total 
number (rewarded plus nonrewarded) of 
experiences in the runway. Group 10-10-1 
was given 50 trials with 10-pellet rewards, 
followed by 25 trials at 1-pellet reward. 
Group 1-1-1 was given all trials with 1-pellet 
reward. This schedule provided for the 
Presentation of postshift experience at a 
common time for all Ss, Controls for re- 
tention over the relatively short 5-day ex- 
tinction period were not thought necessary in 
view of independent data obtained by the 
third author under comparable conditions, 
which show virtually no important change 
in behavior over such an interval. The 
experiment was therefore an incomplete 
2X2X2 factorial in which interpolated ex- 
tinction appeared in both levels of preshift 
reward magnitude but in only one of the two 
levels of preshift training. 

The Ss were given 5 trials per day with 
an intertrial interval of 25-30 min. Those 
Ss not run on Trials 1-25 were removed 
from their cages during their turn in the 
rotation, held by E for a few seconds, and 
replaced. On each trial, S was placed in 
the start box, and 3 sec. later the door was 
opened. After S left the start box, the door 
was closed; and the goal-box door was 
closed as soon as S' entered the goal box. 
Each S was detained in the goal box for a 
minimum of 20 sec, and allowed longer if 
necessary to consume the reward. 


Results 


Start, running, and goal times for 
each S on each trial were converted to 
speed scores. A suggestive gradient 
appeared with the greatest behavioral 
differences Occurring first nearest the 
goal box. None of the comparisons 
based on starting speed yielded signifi- 
cant differences, and the variables 
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Fic. l. Mean running speed per block 
of five trials. 


generally had less effect on running 
than on goal speed. Mean running 
speeds for each of the groups in blocks 
of five trials are shown in Fig. 1, and 
mean goal speeds are shown in Fig. 2. 

Preshift running speed.—The effect 
of reward amount had not quite at- 
tained significance by Preshift Trials 
21-25, F (1; 16) = 276, p > .10, and 
no differences appeared as a function of 
the "dummy" variables which were 
subsequently introduced. Hereafter, it 
may be assumed that the dummy vari- 
ables had no effect on the basic results ; 
otherwise this fact will be pointed out. 
On Trials 46-50 of preshift, there was 
an overall magnitude effect, F (1, 16) 
— 843, p<.05, but the effect of 
amount of training was not significant, 
F (1, 16) = 294, p>.10. Analysis 


` of the first 5 trials of extinction yielded 


a Groups X Trials interaction that 
barely missed statistical significance, 
F (4, 32) = 246, p > .05. The trend 
was toward greater decline in speed 
by Ss that had had the 10-pellet re- 
ward. Group 10-0-1 did not reliably 
undershoot Group 1-0-1, and these 
groups did not differ at the end of ex- 
tinction, £(8) = .66. 

Preshift goal speed—On Preshift 
Trials 21-25, speed was greater for 
10 than for 1 pellet, F (1, 16) = 15.5, 
p < .01. On the final block of preshift 
trials, Groups 10-10-1 and 1-1-1, who 
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Fic. 2. Mean goal speed per block 
of five trials, 


completed 50 preshift trials, were com- 
pared with Groups 10-1 and 1-1, who 
completed only 25 acquisition trials. 
Faster running accompanied the larger 
reward, F (1, 16) = 8.95, p <.01, and 
the greater number of preshift trials, 
F (1, 16) = 1141, p « .01, and there 
was no interaction. A Groups X 
Trials analysis of variance over the 
first 5 trials of extinction yielded a sig- 
nificant Groups X Trials interaction, F 
(4, 32) = 348, p « .05, indicating that 
Group 10-0-1 decreased its speed in 
extinction more rapidly than Group 
1-0-1. The apparently slower speed 
by Group 10-0-1 on Blocks 8 and 9 
was not significant. These groups did 
not differ at the end of extinction, 
t(8) =.24, 

Postshift running | speed.—Because 
the design of the experiment was an 
incomplete 2 X 2 X 2 factorial, three 
2 x 2 factorial analyses were performed 
on each daily block of postshift trials. 
Group 10-10-1 was the only group to 
show, relative to its base line control 
group, significantly slower speed (de- 
pression effect) in terms of this re- 
sponse measure. The depression effect 
was found on Blocks 12 (i.e., Postshift 
Trials 6-10) 13, and 14 (p< 0l in 
each case); and the mean speed of Ss 
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which had had the larger reward had 
recovered to the level of the base line 
control by Block 15. None of the in- 
teraction terms in these comparisons 
reached significance at the .05 level. 

Postshift goal speed.—A significant 
depression effect, P € .005, was found 
in all groups on Block 12. None of 
the interaction terms were significant, 
indicating that the amount of depres- 
sion did not vary as a function of in- 
terpolated extinction or amount of 
training, On Block 13, Groups 10-1 
and 10-0-1 had recovered from the de- 
pression effect (i.e., no longer differed 
from their respective base line con- 
trols), However, Group 10-10-1 did 
not recover until Block 14, which sug- 
gests a somewhat longer duration of 
the depression effect after the greater 
number of preshift trials, 


Discussion 


The results indicated that interpolated 
trials with no reward have no effect on 
the undershooting or depression-effect 
Phenomenon relative to a condition in 
which an equivalent number of rewarded 
preshift trials are given but without in- 
terpolated nonrewarded trials. That is 
to say that the depression effect found 
comparing Group 10-0-1 with its con- 
trol, Group 1-0-1, did not differ reliably 
(or even apparently) from that compar- 
ing Group 10-1 with 1-l. There was 
some indication that number of rewarded 
preshift trials may affect the duration of 
the depression effect, though the magni- 
tude of the undershooting did not differ 
significantly between a group given an 
equivalent number of Preshift experiences 
which included 25 nonrewarded trials, 
and a group given an equivalent number 
of preshift trials, all of which were 
rewarded. 

Now there are data Showing slower 
reacquisition following extinction under 
large reward for Ss that had previously 
acquired the response for the smaller 
reward (eg., Logan, 1960, p. 188). 
Logan has accounted for this fact, * . . 


with the assumption that different habit: 
profiles were built up during original 
learning and persisted into the relearning 
phase [p. 188]." Thus such data inter 
preted in this way would actually work 
against the present finding of slower 
running by the initially large-reward 
group relative to their control group since 
their original habit profile included faster 
running than the base line control, Tt 
might also be argued that the present 
results may be confounded by differential 
reacquisition rates for Ss switched, com- 
Pared to those not switched, in reward 
magnitude, However, differences in the 
present two experiments are maintained | 
even after asymptotic behavior apparently 
is reached, Finally, it could be suggested 
that the sizable depression effect of Group 

1 was a consequence of the stronger 
Trsy which occurred during extinction in 
this larger reward group relative to their 
control, Group 1-0-1 (see Amsel, 1962). 
Thus, s, associated with the goal box 
should have elicited greater avoidance for 
Group 10-0-1 during the postshift stage. 
It can be readily seen, however, that such 
data as Logan's cited above would argue 
against this explanation since reacquisi- 
tion (following extinction) under a com- 
mon large-reward condition has been 
shown to be more rapid for the previously 
large-reward group. 

Failure of interpolated extinction to 
affect postshift performance was, there- 
fore, quite surprising. So an additional 
experiment (which will not be considered 
here because certain additional orthogonal 
variables make it irrelevant to the main- 
Stream of this report) was run under 
conditions which closely replicated those 
of Exp. L The basic results of Exp. I 
were completely replicated—there ap- 
peared to be a negligible effect of interpo- 
lating extinction trials on the subsequent 
depression effect. Still, it remained both 
intuitively and theoretically appealing 
that if enough extinction trials were 
given relative to the number of preshift 
acquisition trials, the postshift depression 
should fail to occur or at least be de- 
creased. Also, it was apparent that, 
under the parameter values employed in 
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Exp. I and the replication experiment, 
Ss given interpolated extinction appar- 
ently had not reached asymptotic extinc- 
tion performance at the end of extinction. 
The possibility existed that the number 
of interpolated extinction trials, relative 
to the number of preshift acquisition 
trials, was not sufficient to fully reflect 
their role on later postshift performance. 
* Experiment II was designed to clarify 
the problem outlined above. Also, Exp. 
II was intended to provide further evi- 
dence concerning the role of number of 
rewarded preshift trials in postshift per- 
formance, and to replicate the basic phe- 
nomena with a quite differently con- 
structed runway. In Exp. II reward 
magnitude (1 vs. 10 pellets), number of 
rewarded preshift trials (25 vs. 75), and 
number of interpolated extinction trials 
(0 vs. 50) were combined factorially in 
a 2X2X2 design. This design per- 
mitted evaluation of postshift perform- 
ance in groups having more nonrewarded 
than rewarded experience in the appara- 
tus, and vice versa. 


ExPERIMENT II 
Method 


Subjects and apparatus —The Ss were 
experimentally naive female rats of the 
Sprague-Dawley strain. All Ss were 60 
days old at the start of experimental train- 
ing. In order to extend the generality of 
the results of Exp. I, an entirely different 
apparatus was used in Exp. II. The appa- 
ratus was a straight runway with a 70-in. 
alley, 12-in. start box, and a 10.5-in. goal 
box. The start and alley units were 5 in. 
high and 5 in. wide. The goal box was 10.5 
in. wide and 5 in. high. The start box was 
white, and the alley and goal box were flat 
gray. The entire runway had Plexiglas 
covers. Plexiglas doors separated the start 
and goal boxes from the alley. Standard 
Electric timers, Model S-1, recording to .01 
sec, and Hunter photo relays, Model 335-S, 
were used to measure S’s progress through 
the runway. Three time measures were 
recorded: Start time was recorded from the 
time the start-box door was opened until S 
broke Beam No. 1 located 12 in. down the 
runway. Running time was recorded from 
Beam No. 1 to Beam No. 2, 6 in. from the 


goal-box door. Beam No. 2 in turn acti- 
vated a third timer which ran until S broke 
the light beam 15 in. away, 2.5 in. from the 
rear of the goal box and just before the 
food cup. The opaque food cup, which was 
2 in. deep, was suspended above the floor 
such that S had to break the photo beam 
before the contents were visible. 

Design and procedure—The experiment 
was a 2X22 design with reward magni- 
tude (1 or 10 pellets), number of acquisition 
trials (25 or 75), and interpolated extinction 
(0 or 50 trials) combined factorially. Two 
groups of rats were given either 10 pellets 
(10:-05-1) or 1 pellet (15-0&-1) as reward 
on Trials 1-75 and extinction on Trials 76- 
125. Two groups were given 10 pellets 
(10:-05-1) or 1 pellet (15-05-1) on Trials 
51-75 followed by extinction on Trials 76- 
125. Two groups were given 10 pellets 
(105-1) or 1 pellet (1z-1) on Trials 51-125. 
Two additional groups were given 10 pellets 
(105-1) or 1 pellet (15-1) on Trials 100- 
125. All groups were switched to 1-pellet 
reward on Trials 126-170. This schedule 
provided for the presentation of postshift 
experience at a common time, and after the 
same number of days of handling and food 
deprivation, for all Ss. 

The Ss were given 5 trials per day with 
an intertrial interval of 40-45 min. Those 
Ss not run until later trials were removed 
from their cages during their turn in the 
rotation, held by E for a few seconds, and 
replaced. On each trial S was placed in 
the start box, and the door was opened 3 
sec, later. After S left the start box, the 
door was closed; and the goal-box door was 
closed as soon as S entered the goal box. 
Each S was detained in the goal box for a 
minimum of 20 sec. and allowed longer if 
necessary to consume the reward. 


Results 


Median start, running, and goal 
times for each S on each day (block of 
5 trials) were converted to speed 
scores. These three measures are 
shown in Fig. 3 for groups having no 
extinction and in Fig. 4 for groups 
having interpolated extinction. Mean 
contrast-effect scores were computed 
by subtracting the mean of each large- 
reward group from the mean of the 
appropriate small-reward group. Mean 
contrast-effect scores for start, run- 
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Fic. 3. Median speeds per day for groups 
not given interpolated extinction. (The last 
5 days of acquisition are shown.) 


ning, and goal speeds are shown in 
Fig. 5. 

Acquisition —In general, large- and 
small-reward groups clearly differed in 
their speeds at the end of acquisition. 
The groups that received 75 acquisi- 
tion trials appeared to be at asymptotic 
performance levels while those groups 
receiving 25 acquisition trials were 
clearly at preasymptotic levels. 

Acquisition start speed.—Analysis 
of variance on the last 5 days of acqui- 
sition indicated significant effects, p < 
.01, for reward magnitude, F (1, 32) 
= 12.19; and number of acquisition 
trials, F (3, 32) = 87.35; and, p< 
-03, for the Magnitude x Dummy vari- 


able, F (1, 32) = 6.67. The significant 
Magnitude X Dummy interaction was 
due to the spuriously slow mean start 
speed in Group 1,,-1. This group re 
ceived acquisition during the extinction 
trials for groups given 25 trials of re- 
ward and 50 trials of nonreward. The 
effect of this, presumedly chance occur- 
rence, would be one of lessening the 
amount of depression shown by Group 
10,,-1, making it difficult to interpret 
the effects of interpolated extinction on 
postshift start speed, 

Acquisition running speed.—Analy- 
sis of variance on the last 5 days of 
acquisition yielded F (1, 32) = 2.88, 
$ €.10, for reward magnitude with 
the 10-pellet Ss tending to run faster, 
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Fic. 4. Median speeds per day for groups 
given interpolated extinction. (The last 5 
days of acquisition are shown for the 75- 
trial groups, and all of acquisition is shown 
for the 25-trial groups.) 
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and F (1, 32) = 15.47, p < .01, for 
number of acquisition trials. None 
of the other sources of variance were 
statistically significant. Analysis of 
variance on the last 2 days of acqui- 
sition showed that there had been sig- 
nificant effects, p < .01, of number of 
acquisition trials, F (1, 32) = 10.50; 
and Acquisition X Magnitude interac- 
tion, F (1, 32) = 7.57. The effect of 
reward magnitude, F (1, 32) = 6.40, 
was significant at the .05 level. 

Acquisition goal speed.—The effects 
of level of acquisition and the dummy 
variable were both significant, p « .01, 
on the last 5 days of acquisition and the 
last 2 days of acquisition, These dif- 
ferences reflected faster goal speed for 
Ss given longer training and slower 
speeds for Ss who did not subsequently 
receive extinction. This latter differ- 
ence was uniform over the different re- 
ward magnitudes, so did not influence 
conclusions concerning contrast effects. 
No other effects were significant in 
these comparisons. 

Extinction—Analyses of the last 2 
days of extinction for start speed indi- 
cated that none of the effects were sig- 
nificant at this point. Similarly, no 
effect was significant on the last 2 
days for running speed. 
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MEAN CONTRAST EFFECT(ft/sec) 


DAYS 


Fic. 5. Mean contrast effect scores per 
day, obtained by subtracting the mean speed 
of groups shifted from large to small reward 
from the mean speed of the appropriate base 
line control group. 


An analysis of variance (2 X 2 X 
Days) was carried out on Days 1, 2, 
and 3 for goal speed. The main ef- 
fects of number of acquisition trials, 
F (1, 16) = 5.64, p « .05; and days, 
F (2, 32) = 23.21, p < .01, were sig- 
nificant. The interaction effects of 
Acquisition X Days, F (2, 32) = 18.45, 
p< .01, and Magnitude x Days, F (2, 


TABLE 1 


ANALYSES OF VARIANCE (F VALUES, df 


= 1,32) on Posrsuirr Days 1 AND 2 AND 


8 AND 9 FoR ALL RESPONSE MEASURES 


Start Speed Running Speed Goal Speed 
Source Days Days Days 
1-2 8-9 1-2 8-9 1-2 8-9 
Acquisition (A) 1.90 0.35 0.16 0.01 6.92* 4.81* 
Magnitude ( M) 0.14 11.66** 11.88** 4.66* 37.21** 1.27* 
Extinction (E) 1.01 58.11** 2.19 19.01** 8.10** 2.56 
AXM 0.61 8.56** 7.46* 5.22* 8.47** 0.30 
AXE 0.004 0.52 1.34 1.04 1.46 0.06 
MXE 6.61* 0.02 0.61 0.29 2.54 0.21 
AXMXE 0.09 3.60 0.76 0.05 0.46 0.11 
*p «.05. 
5$ fu 
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32) = 19.82, 5 «05, were also sig- 
nificant. The effect was one of greater 
decline in speed by Ss who had the 
10-pellet reward, and greater decline 
following a greater number of acqui- 
sition trials, 

Analysis of goal speeds on the last 
2 days indicated that there were no 
significant effects (i.e., all groups had 
extinguished to the same level). 

Postshift—Analyses of variance (2 
X 2X 2) were carried out on the first 
2 days (Days 1 and 2) and on the 
last 2 days (Days 8 and 9) of postshift 
training for each of the speed scores. 
The results of these analyses are pre- 
sented in Table 1. 

Significant depression eventually oc- 
curred in all speed measures, and 
amount and duration of depression in- 
creased with increasing amounts of 
training prior to reward shift. There 
Was no evidence of any statistically 
reliable decrease in the amount or 
duration of the depression effect as a 
consequence of the interpolated ex- 
tinction. 

Postshift start speed —There was no 
evidence of a depression effect on this 
response during the initial stages of 
postshift training, as the effect on the 
Start-speed measure occurred later than 
with other measures. A statistically 
reliable tendency for a smaller contrast 
effect in groups not given interpolated 
extinction was reflected by the Magni- 
tude x Extinction interaction, F (1, 
32) = 6.61, p < .05. This interaction 
was expected on the basis of the spuri- 
ous Dummy X Magnitude interaction 
in start speed during acquisition. (see 
above). When performance on Post- 
shift Days 6, 7, and 8 was subjected 
to an analysis of variance, the effects 
of magnitude, F (1, 32) — 12.50; ex- 
tinction, F (1, 32) = 47.88; and the 
Acquisition X Magnitude interaction, 
F (1, 32) = 10.14, were all significant 


beyond the .01 level. These data indi- 
cated that those groups given more 
rewarded trials showed a larger de- 
pression effect, and interpolated extinc- 
tion did not reliably influence the 
amount of depression. Analyses of 
performance on Days 8 and 9 indicated 
that although a significant depression 
effect was still occurring, there was 
no effect of number of rewarded trials 
or of interpolated extinction (see 
Table 1). 

Postshift running speed.—The run- 
ning-speed data indicated that the 
amount of depression was increased 
with increasing number of rewarded 
trials but did not vary as a function 
of interpolated extinction. As shown 
in Table 1, the depression effect oc- 
curred on Days 1 and 2 and lasted 
through Days 8 and 9, with a signifi- 
cant Acquisition X Magnitude inter- 
action (greater depression with in- 
creasing number of rewarded trials) 
in both of these comparisons. The 
apparent effect of interpolated extinc- 
tion on running speed (decreased de- 
pression with interpolated nonreward 
—see Fig. 5) during Days 3, 4, and 
5 of postshift was checked by analysis 
of variance. The main effects of ex- 
tinction, F (1, 32) = 1230, p < 01, 
and magnitude, F (1, 32) = 6.33, t 
X 03, were significant. However, 
none of the interactions, including the 
interaction of magnitude and extinc- 
tion, F (1, 32) — 3.05, p» .05, were 
significant. These data indicated a 
slight, though still nonsignificant, tend- 
ency for interpolated extinction to 
lessen the depression effect with the 
Tunning-speed measure on Day 4—the 
point, according to ad hoc inspection, 
at which the apparent effect of interpo- 
lated extinction was greatest. 

Postshift goal speed.—Each of the 
main effects and the Magnitude x Ac- 
quisition interaction were significant 
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on Days 1 and 2 (see Table 1). The 
Magnitude X Acquisition interaction 
once again indicated that the greater 
number of acquisition trials led to 
greater depression. Analysis of Days 
3 and 4 indicated that all main effects 
were significant : acquisition, F (1, 32) 
= 12.58; extinction, F (1, 32) = 
15.96; p < .01; and magnitude, F (1, 
32) 2741, p < 05. On Day 5 all 
of the main effects remained significant, 
p<.05: Acquisition, F (1, 32) = 
5.81; extinction, F (1, 32) = 4.28; and 
magnitude, F (1, 32) = 4.89. None of 
the interactions were significant on 
Days 3 and 4 or Day 5. On Days 8 
and 9 only main effects of acquisition 
and magnitude were significant as 
shown in Table 1. 


Discussion 


On the surface, at least, it is remark- 
able that extensive extinction training 
had little or no differential effect on the 
consequences of previous reward magni- 
tude. Even though prior reward mag- 
nitude did not differentiate the groups at 
the end of extinction training, their post- 
extinction performance reflected com- 
pletely the effect of the original rewards. 
There are two related implications of 
this fact, and both concern the represen- 
tational response of S for the conditions 
of reinforcement. First, and most basic, 
is the matter of what was extinguished 
if not the representational response (say, 
yj) of the original reward magnitude. 
Second, since 7,-s, is apparently not ex- 
tinguished, this fact calls into question 
the related explanatory mechanism rj-s; 
(see Amsel, 1962), and this system has 
been perhaps our most useful and used 
explanatory device concerning effects of 
certain reinforcement conditions. 

It is true that the finding of faster 
extinction following large reward or 
greater number of acquisition trials is 
consistent with the frustration interpre- 
tation of response decrement in extinction 
(cf, Amsel, 1962; Birch, 1961). But the 
fact that the interpolated (frustration) 


experience of extinction does not sub- 
stantially reduce the depression effect 
cannot be readily integrated within a 
frustration-oriented explanation of the 
depression effect. It could be argued, of 
course, that there is really little or no 
empirical evidence that experience with 
frustration does reduce subsequent frus- 
tration responses (cf, Amsel & Penick, 
1962). Still, it is easy to maintain that 
the interpolated extinction should have 
tended to extinguish 7;, or at least should 
have markedly reduced the differential 
between the 7, (and therefore subsequent 
7j-5;) of Ss previously given large reward 
vs. Ss previously given small reward. 
It does not seem likely that the different 
rgs would be reduced during extinction 
by the same absolute amounts in the 
large- and the small-reward groups. 

Therefore, as stated above, the central 
problem is that some differential repre- 
sentation of preshift reward magnitude 
must survive extinction training. Other- 
wise it would be difficult to attribute 
effects to the differential reward magni- 
tudes in preshift. 

There are certain data (e. g., D'Amato 
& Jagoda, 1960) which have demonstrated 
that reintroduction of reward subsequent 
to extinction serves to reinstate, at least 
temporarily, behavior which formerly had 
accompanied the presence of reward. 
Now it would be difficult indeed to main- 
tain that the difference in the 7;'s would 
be reinstated to the degree that had ob- 
tained prior to extinction, particularly 
when 25 rewarded trials are followed by 
50 nonrewarded trials. But this would 
appear to be necessary if these data are 
to be interpreted in terms of frustration 
theory. On the other hand, if these rein- 
stated behaviors included certain percep- 
tual factors such as anchoring points on 
S’s scale of judgments (which may not be 
subject to extinction per se during the in- 
terpolated nonrewarded trials), a percep- 
tual basis for the depression effect could 
be provided. Collier and Marx (1959) 
have proposed an account of contrast ef- 
fects (e.g. the depression effect) which 
includes the notion that reward has the 
properties of a sensory scale, Conse- 
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quently, S’s estimation of reward magni- 
tude may vary as a function of previous 
reward values experienced in this situa- 
tion which act as anchoring stimuli. Seen 
in this light, at least, such an interpreta- 
tion may be more successful than frustra- 
tion-based concepts in accounting for cer- 
tain phenomena accompanying shifts in 
the conditions of reinforcement, Such a 
system would, on the other hand, lack the 
carefully developed history of the frustra- 
tion concepts, and perhaps also the latter's 
broader usefulness, The data may de- 
mand the use of both kinds of explanatory 
constructs, 

The finding of a positive relationship 
between number of rewarded preshift 
trials and strength of the depression effect 
is consistent with the analysis of Birch 
(1964) which has suggested that the ef- 
fect of a shift in reward magnitude should 
be greater the greater the number of re- 
warded preshift trials (within the range 
employed in the present experiment). 

The results of these experiments also 
indicate that there is a combined temporal 
and spatial gradient of shift effects in the 
running situation. The depression effect 
occurred earliest in the region of the goal 
box, but eventually appeared nearer the 
start box while decaying at the goal box, 
This relationship, which has appeared in 
several additional experiments by the third 
author, is such that if one were to take a 
simple measure of total Tunway speed, the 
effect might not appear at all. Further 
investigation of the role of variables 
which determine this gradient will help to 


clarify the implications this phenomenon 
has for earlier reports which have failed 
to demonstrate contrast effects, 


REFERENCES 


AMSEL, A. Frustrative nonreward in partial 
reinforcement and discrimination learning: 
Some recent history and a theoretical ex- 
tension, Psychol. Rev., 1962, 69, 306-328. 

AMSEL, A., & Penick, E. C. The influence 
of early experience of the frustration ef- 
fect. J. exp. Psychol., 1962, 63, 167-176. 

Bicu, D. A motivational interpretation of 
extinction, In M. R. Jones (Ed.), Ne- 
braska symposium on motivation: 1961. 
Lincoln: Univer. Nebraska Press, 1961. 

Brc, D. Extended training and incentive 
shifts. Paper read at Psychonomic So- 
ciety, October 1964, 

Bower, G. H. A contrast effect in differ- 
ential conditioning. J. exp. Psychol., 1961, 
62, 196-200. 

Bower, B. H. The influence of graded re- 
ductions in reward and prior frustrating 
events upon the magnitude of the frustra- 
tion effect. J. comp. physiol. Psychol., 
1962, 55, 582-587. x 

Corer, G, & Marx, M. H. Changes in 
performance as a function of shifts in the 
magnitude of reinforcement. J. exp. Psy- 
chol., 1959, 57, 305-309. n 

Cresrr, L. P. Quantitative variation of in- 
centive and performance in the white rat. 
Amer. J. Psychol, 1942, 55, 467-517. 

D'Amato, M. R., & Jacopa, H. Effects of 
extinction trials on discrimination reversal. 
J. exp. Psychol., 1960, 59, 254-260. 

Locan, F. A. Incentive. New Haven: Yale 
Univer. Press, 1960, 


(Received March 17, 1965) 


Journal of Experimental Psychology 
1966, Vol. 72, No. 1, 61-67 


UNLEARNING IN TWO DIRECTIONS* 


ISABEL M. BIRNBAUM ? 
University of California, Berkeley 


The purpose of the study was to determine the relative reduction in 
the strength of A-B and B-A associations after learning which con- 
formed to the A-C paradigm. For original learning, highly available 
terms were used in a bidirectional list of paired associates (A-B and 
B-A). At the end of interpolated learning for the experimental groups, 
or a "filler task" for the control group, the strength of A-B and B-A 
was measured in terms of the number recalled and the latency of recall 
on an unpaced test. The original associations in 2 directions, A-B 
and B-A, showed significant reductions in strength after A-C learning. 
There was no evidence for a differential reduction in the strength of 


A-B and B-A, 


After learning a list of paired 
associates, S may be shown each stimu- 
lus and asked to recall the appropriate 
response (S-R, forward recall) or S 
may be shown each response and asked 
to recall the appropriate stimulus (R-S, 
backward recall). There have been 
several recent investigations of the ex- 
tent to which R-S associations are 
formed when the task requires only 
the learning of S-R associations. When 
recall of the R-S association involves 
recall of a number or a color (items 
which are readily available for recall), 
there is little or no difference between 
S-R and R-S recall (Houston, 1964a ; 
Richardson, 1960). When the S. term 
is made available by serving as a re- 
sponse for another pair in the list 
(Horowitz, Brown, & Weissbluth, 
1964) there is also evidence for the 
equivalence of S-R and R-S associa- 
tions. These results represent one class 
of evidence which supports the hy- 

1 This research was supported by a Public 
Health Service Postdoctoral Fellowship 
(MH 13,474) from the National Institute of 
Mental Health. The research was carried 
out at the Institute of Human Learning 
which is supported by grants from the Na- 
tional Science Foundation and the National 
Institutes of Health. 

2Now at the University of California, 
Irvine. 


61 


pothesis (Asch & Ebenholtz, 1962) 
that the association between two items 
is symmetrical, but may exhibit greater 
strength in one direction (eg., Jantz 
& Underwood, 1958) if one of the 
items is more available than the other. 

The concept of associative symmetry 
also implies that a reduction in the 
strength of an association in one direc- 
tion (e.g., S-R) should be accompanied 
by an equivalent reduction of strength 
in the other direction (R-S). There 
has been no direct test of this impli- 
cation, although several studies have 
some bearing on the question. The 
learning of two successive lists which 
conform to the A-B, A-C paradigm 
leads to the unlearning of A-B (Barnes 
& Underwood, 1959; McGovern, 
1964). In contrast, Keppel and Un- 
derwood (1962), using the McGovern 
materials, found that the strength of 
B-A was not reduced after A-B, A-C 
learning. These results suggest that 
an association in one direction, B-A, 
may remain intact while the association 
in the other direction, A-B, is weak- 
ened, However, Keppel and Under- 
wood (1962) point out that the failure 
to find a difference in B-A recall be- 
tween the experimental group (A-B, 
A-C) and the control group (A-B) 
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may be the result of overestimation of 
B-A recall in the experimental group. 
The A terms were nonsense syllables, 
and the availability of these terms may 
have been increased in the experimental 
group by the additional exposures to 
A during second-list learning; this in 
turn might increase recall of B-A. In 
another study of backward recall in the 
A-B, A-C paradigm (Houston, 1964b) 
the A terms were the numbers 1 
through 12, minimizing the possibility 
of differences in availability at recall. 
Under these conditions, there was evi- 
dence of a reduction in the strength of 
B-A after A-C learning. It appears 
then that conditions which generally 
produce unlearning of A-B also pro- 
duce at least some unlearning of B-A. 
The question still remains as to 
whether the direct extinction of A-B 
(A-B, A-C) results in an equivalent 
amount of unlearning of B-A. 
Establishing the equivalence of asso- 
ciative strength in two directions pre- 
sents certain problems of definition and 
measurement. Even if the recall of 
A-B and B-A is the same in number, 
it might be argued that this is not 
an adequate definition of equivalence. 
Take, for example, the case in which 
A terms are highly available (as might 
be the numbers 1 through 10, etry; 
Assume that S has learned each asso- 
Ciation between A and B only in one 
direction, A—B (A evokes the re- 
Sponse B). When one of the B terms 
is presented (Bp) and S must supply 
A, the following implicit chain might 
be used by S in arriving at the correct 
response: (a) List the A terms, one 
at a time; (b) Compare the B term 
which is elicited by each A term with 
By; (c) When B is the same as B,, 
respond with that A which elicited B. 
Via this route, S could produce as 
many correct recalls of B-A as A-B, 
yet the responses reflect quite different 


underlying processes. It might be ex- 
pected that this type of chaining mecha- 
nism would produce a longer latency 
of response than would the direct 
A>B connection. Longer latency in 
one direction might, of course, reflect 
processes other than the one described 
above. However, if A-B and B-A 
associations are the same in number 
and in latency, this would provide 
strong evidence for symmetrical asso- 
ciative strength. 

The purpose of the present study 
was to assess the strength of A-B and 
B-A after training in the A-B, A-C 
paradigm. A meaningful comparison 
of the magnitude of unlearning of A-B 
and B-A requires that the strength of 
A-B and B-A be the same prior to the 
experimental treatment. For this rea- 
son, Ss in the present study were ex- 
plicitly required to learn both A-B and 
B-A during original learning (OL). 
Then, after A-C learning or an un- 
related "filler task," the latency and 
accuracy of recall of A-B and B-A 
were measured. 


METHOD 


Design.—There were three groups of 16 
Ss each. In OL, all groups learned 12 
number-letter pairs to a criterion of 8/. 12 
correct. The pairs in OL were learned bi- 
directionally (A-B and B-A, see below) 
according to a procedure similar to the one 
used by Underwood and Keppel (1963). 
The two experimental groups (E,-s and 
Es-a) were then given 16 trials of interpo- 
lated learning (IL). Two interpolated lists 
Were used rather than one (8 trials on A-C 
and 8 trials on A-D), in order to provide 
substantial unlearning (Postman, 1965). For 
brevity, this will be referred to as A-C 
learning, since both lists fit the A-C para- 
digm. Instead of IL, the control group was 
given a Dattern-matching task. At the end 
of IL (or pattern matching), an unpaced 
test of recall of either A-B or B-A was 
given. : 
„Since the pairs in OL were learned bi- 
directionally, the distinction between A and 

terms was provided by arbitrarily desig- 
nating one set of items, e.g., numbers, as 
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A terms and the other set, letters, as B 
terms. For half of the Ss this was re- 
versed, i.e, As were letters and Bs were 
numbers. The procedure for the experimen- 
tal subgroups in which As were numbers is 
outlined in Table 1, with the pair two-G as 
an example. As shown in Table 1 the 
experimental groups Ea-s and En-a are dis- 
tinguished only at the recall stage. For 
Ea», the A term was presented and S was 
to supply B; for Es-a the B term was pre- 
sented and S was to supply A. For the 
control group, there is no distinction be- 
tween A-B and B-A recall. Half of the Ss 
in the control group were simply given 
number-letter recall, and half were given 
letter-number recall. 

Materials —The A and B items were the 
numbers 1 through 12 (written out as 
words) and the letters A through L. Two 
different pairings of the numbers and letters 
were used to form two lists for OL, and 
half of the Ss in each group learned each 
list. There were four different serial orders 
of presenting the pairs, and bidirectional 
learning was arranged as follows. For a 
particular pair in OL, in two of the orders 
the number was shown first and S had to 
anticipate the letter, while in the other two 
orders the letter was shown first and S' had 
to anticipate the number. For a particular 
trial in OL (one presentation of all 12 
pairs), S had to anticipate the letter for 
half of the pairs and the number for the 
other half. 

Two sets (C and D) of 12 two-syllable 
adjectives were used as responses on the 
interpolated lists (A-C and A-D). The 
adjectives were apparently unrelated in 
meaning, and within each set of 12 adjectives 
there was no duplication of first letters. The 
construction of the IL lists may be illus- 
trated with a single pair from OL, for exam- 
ple, G-two. One adjective (local) was se- 
lected at random from Set C and one 
(faithful) from Set D, with the restriction 
that neither of the adjectives began with 
G or T. Then, for experimental subgroups 
in which A was defined as a letter, the IL 
pairs to be learned were G-local and G- 
faithful. For subgroups in which A was 
defined as a number, the IL pairs were two- 
local and two-faithful. There were four 
different serial orders of presenting the pairs, 
and the four orders were used equally often 
as starting orders in IL. 

Procedure.—The lists were presented on a 
Stowe memory drum at a 2:2 sec. rate with 
a 4-sec. intertrial interval. All Ss were 
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TABLE 1 


EXAMPLE OF PROCEDURE FOR 
EXPERIMENTAL GROUPS 


Original Pera 
Learn- | Interpolated Learning 

ae Group 
AB | AC. AD En al EnaA 
two-G s 
G-twa two-local | two-faithful | two-? | G-? 


Note,—Subgroups in which the As = numbers. 


taken to a criterion of 8/12 correct responsés 
on a single trial The Ss were fully in- 
structed about the bidirectionality of the lists 
in OL. Upon reaching criterion, experi- 
mental S's were given instructions and began 
A-C learning (unidirectional). After 8 
trials on A-C, S's were further instructed and 
continued IL on List A-D. Control Ss 
worked on a pattern-matching task for the 
same period of time spent by experimental 
Ss on IL. The List-1 recall test for control 
and experimental groups began 20 min, after 
the end of OL. Instructions just prior to 
the recall test included a reminder about the 
set of numbers and letters used in OL, to 
insure complete availability of all terms. The 
test was unidirectional, ie. a particular $ 
was presented with either the 12 numbers 
or the 12 letters from OL, and was asked to 
supply the second member of the pair. The 
S was not shown the correct responses 
during the recall test. The items were pre- 
sented one at a time, and S was given un- 
limited time in which to respond to each 
item. When S indicated that he was ready 
to go on, the drum was advanced to the 
next item. The order of presentation of the 
items was the reverse of the order in which 
those pairs would have occurred on the next 
trial of OL. 

A tape recorder was located behind the 
memory drum, out of S's sight, and a 
recording was made of the recall test. After 
the experimental session was over, the tape 
was played back and a stopwatch was used 
to measure response latency. Immediately 
preceding the presentation of each item, the 
advance of the drum produced a distinctive 
click and the watch was started at this point. 
The watch was stopped as soon as S re- 
sponded with the correct item. Occasionally, 
Ss corrected themselves before going on to 
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the next item, and in this case latency was 
still defined as the time from the advance of 
the drum to the correct response. If S did 
not respond, or gave only an incorrect re- 
sponse, this was noted. 

Subjects—The Ss were 48 undergraduate 
students at the University of California who 
were paid for their participation in the 
experiment, The Ss were assigned to groups 
in blocks of three, with each block containing 
one S in each of the three groups. The 
order of running groups within blocks, and 
the assignment of lists, starting orders, and 
type of item presented on the recall test 
(numbers or letters) were determined by a 
table of random numbers (subject to the 
restrictions mentioned in the Design and 
Materials sections.) 


RESULTS AND Discussion 


Original and interpolated learning.— 
The mean number of trials to a cri- 
terion of 8/12 in OL was 10.4, A 
two-way analysis of variance on trials 
to criterion indicates that the groups 
were of comparable ability on this task. 
Neither of the main effects (Groups; 
Numbers vs. letters at recall) was sig- 
nificant (Fs < 1.00), nor was the in- 
teraction (F < 1.00). The difference 
between E, and Ega in number of 
correct responses during A-C and A-D 
learning was not significant (Fs < 
1.00). 

Recall: number correct. —The mean 
number of responses given correctly on 
the recall test by the control group was 
1088. E, gave 844 Correct re- 
sponses, and Ep, gave 9.18 Correct 
responses. The overall difference be- 
tween groups is significant, F (2, 42) 
= 6.54, p < 005. There was no sig- 
nificant effect of type of item (numbers 
or letters) at recall, and the interaction 
of Type X Groups was not significant 
(?'s > 25). Orthogonal comparisons 
indicate that there is no significant dif- 
ference between E4 5 and Epa, F (1, 
42) = 118, p > 25, while there is clear 
evidence of unlearning of first-list as- 
sociations, F (1, 42) = 11.89, p < .005 
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(Control vs. E,_» and Ej 4 combined),” 
Thus, on the basis of correct first-list 
responses at recall, it must be concluded 
that associations in both directions are 


1 
| 
| 


equally unlearned, although the explicit | 


paradigm of interference involves only 
one of the two terms from the original 
pair. It is reasonable to assume that 
the observed unlearning is not attribut- 
able to IL irrespective of paradigm, 
since the results of McGovern (1964) 
indicate that there is no specific associa- 
tive loss when the interpolated terms 
are unrelated to first-list terms (A-B, 
C-D). That is, when the A and B 
terms are completely available after 
C-D learning (on a matching task) 
there is no retroactive inhibition (RI). 

Recall: latency.—As pointed out 
earlier, B-A recall may be mediated by 
A-B recall or vice versa, A more 
stringent test of the symmetry of un- 
learning in the A-B, A-C paradigm is 
provided by a comparison of the latency 
of recall of A-B and B-A. In addition, 
it is of interest to determine the effects 
of A-C learning on the latency of A-B 
associations which have survived the 
extinction procedure. The fact that 
A-B recall on an unpaced test is re- 
duced after A-C learning has been well 
established (e.g, McGovern, 1964; 
Postman, Keppel, & Stark, 1965), and 
has been repeated in the present study. 
It might be expected that the effects of 
A-C learning would also be evident in 
the latency of recall of those associa- 
tions which are still available at the end 
of IL. 

In order to provide a latency measure 
which is independent of number fe 
called, only the latency of those re- 
sponses which were recalled correctly 
was used, This measure may be influ- 
enced by systematic differences between 
groups in the type of item which is re- 
called. The possible effects of such 
response selection will be shown fo 
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work in a direction opposed to the 
results which were obtained. In addi- 
tion, for a subset of items on which 
there was virtually no selection, the 
differences between groups in latency 
of response gave essentially the same 
results as the overall analysis (both 
analyses are reported below). 

, Since control group recall is almost 
perfect (10.88 out of 12), the responses 
which were recalled by control Ss may 
include many items which were less 
well learned than were the items re- 
called in the experimental groups. That 
is, responses which were not given cor- 
rectly at recall by experimental Ss, and 
for which no latency measure was ob- 
tained, are likely to be those responses 
which were generally less well learned 
to begin with. A test of this possibility 
was made using, for each S, the mean 
rank of those items recalled (rank was 
based on number of times the response 
was given correctly in OL). As ex- 
pected, there was a significant differ- 
ence between the three groups in the 
rank of items recalled, H(2) = 10.08, 
p<.01 (H is based on a Kruskal- 
Wallis test, Siegel, 1956), with the 
control group showing the lowest mean 
rank (less well-learned items). Clearly, 
the latency measure across the three 
groups is not being made on items of 
the same average strength. The next 
question is, is strength correlated with 
latency in this situation? For each con- 
trol S, the items were sorted into two 
categories on the basis of correct an- 
ticipations in OL: strong items and 
weak items (the four items with the 
highest and the four items with the 
lowest number of correct anticipations 
in OL). A sign test (Siegel, 1956) 
on the median latency of recall of these 
two sets of items indicates that strong 
items have a significantly shorter la- 
tency than do weak items (p < .002). 
Since the items which are recalled in 


the experimental groups are (on the 
average) stronger than the items re- 
called in the control group, there is a 
bias in the direction of a shorter latency 
of response in the experimental groups 
than in the control group. However, if 
factors such as unlearning or competi- 
tion from IL influence the latency of 
first-list responses, a shorter latency of 
response would be expected in the con- 
trol group. 

The results indicate that the effects 
of IL on response latency are strong 
enough to overcome any initial bias 
from differences between groups in the 
original strength of the items recalled. 
Considering all of the items recalled by 
each S, the control group has a signifi- 
cantly shorter latency of recall (median 
latency = 2.50 sec.) than either of the 
experimental groups (median latency 
= 3.55 and 3.65 for E,-p and Epa, 
respectively), H(2) = 10,50, p < .01. 
An analysis of variance on reciprocal 
median latency gives the same results, 
F (2, 45) = 622, p < 01, with Ex» 
and Eg 4 having almost identical mean 
scores (.28 and .27, respectively, and 
40 in the control group). 

An analysis was also done on a sub- 
set of items for which there was no 
selection at recall. For each S, the four 
strongest items were identified (on the 
basis of number of correct anticipations 
inOL). Almost all Ss recalled all four 
of these items correctly, with a few Ss 
recalling three out of four (one S in 
the control group; two Ss in Ey», and 
three Ss in Ep.a). Thus, the median 
latency of only the four strongest items 
gives a measure across all three groups 
which is virtually uninfluenced by re- 
sponse selection. On this measure, 
there is again a significant difference 
between groups, H(2) = 84, p < .02, 
with the control group showing the 
shortest median latency (2.2) in com- 
parison with Eas (2.6) and Epa 


66 ISABEL M. BIRNBAUM 


(28). A Mann-Whitney U test in- 
dicates that the difference between 
Ex,» and Eg 4 is not significant (p > 
10). 


Both sets of results show that the learn- 
ing of A-C and A-D lists leads to ap- 
proximately equivalent increases in the 
latency of A-B and B-A recall. This 
suggests that recall in one of the two 
directions is not mediated by the associa- 
tion in the opposite direction, The in- 
creases in latency of recall in the experi- 
mental groups may be explained in terms 
of competition or unlearning, factors 
which have been postulated to account for 
RI on paced recall tests (Postman, 1961). 
There is no simple way of separating the 
effects of competition from the effects of 
unlearning on response latency. Com- 
petition at recall refers to the elicitation 
of second-list (and third-list) responses 
which leads to a delay in the emission of 
first-list responses. If the increases in 
latency were attributable to the occur- 
rence of implicit responses from IL, it 
might be expected that A-B would be 
slower than B-A at recall, After A-C 
learning, when the A term is presented 
it should elicit the competing responses 
from IL more strongly than does the B 
term, which has not been paired with new 
responses in IL, 


| complete 
unlearning refers here to the failure of 


ing (Postman, 
According to this analysis, when IL con- 


tinguished as a consequence of its unrein- 
forced elicitation. Presumably, some of 
the A-B associations may be weakened, 
although not completely lost, as a conse- 
quence of the extinction procedure. If an 
increase in latency reflects partial unlearn- 


ing, then the present results indicate th 
partial unlearning is equivalent for A-B 
and B-A. This interpretation is com 
sistent with the finding that complete un- 
learning (reduction in number recall 
is of the same magnitude for A-B 
B-A. 


of A-B unlearning presented by Po: 
et al. (1965) to the unlearning of B-Ay 
The elicitation of B during IL may lead 
to the occurrence of B-A, and to the exe 
tinction of B-A, However, during AC 
learning the occurrence of B-A may b 
less likely than the occurrence of A-B. f 
B occurs during IL, it is generally covet 
and may be a less effective stimulus han 
is A, which is presented repeatedly during 
IL. In addition, since A is presented 
during each anticipation interval in I 
there is more time for the occurrence of 
A-B than of B-A. Of course, the possib 
ity that there is equivalent elicitation of 
associations in both directions cannot bë 
ruled out on the basis of speculation: 
There is clearly a need for the specifica 
tion of variables which control the occur 
rence of A-B and B-A during IL. | 

The present results may be taken a 
Support for the principle of associative 
symmetry (Asch & Ebenholtz, 1962) 
since the data on number and laten 
recall point to an equivalence in the Te 
duction of associative strength in two di 
rections during A-C learning. Howevely 
since the occurrence of A-B and B-A dur. 
ing IL have not yet been brought unde 1 
experimental control, it is clear that the) 
question of "symmetry of unlearning" f6- 
quires further investigation. It will als? 
be of interest to discover whether sym- 
metrical unlearning is found when OL f$ 
unidirectional, and when IL conforms t0 
paradigms other than A-C. 
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EFFECT OF SURROUND SIZE ON THE PERCEPTION 


OF TEXTURE PATTERNS' 


JACOB BECK 


Harvard University 


Halftone reproductions can be used to embed symbols in a checkerboard 
pattern so that the symbols cannot be readily identified. 3 experiments 
studied the legibility of embedded symbols as a function of surround 
size. Over a range of experimental conditions legibility was found to 
improve with increasing surround size. The form of the relationship 
varies with the size of the symbol, the exposure time, and the way in 
which the symbol is embedded. Phenomenally, as the size of the sur- 
round increases, those elements of the halftone pattern which can be 
assimilated to the surround are partialed out to form a single frame- 
work in relation to which the symbol is seen. The results are discussed 


in connection with the effect of 


organization, 


Texture refers to the distribution of 
element shapes in a total pattern. The 
perception of relatively segregated units 
within a texture pattern depends upon 
breaking the pattern into components 
each of which in some way is internally 
uniform (Wertheimer, 1923). Not all 
discriminable texture differences, how- 
ever, produce the perception of sep- 
arate regions with distinct boundaries 
(Julesz, 1962). The present experi- 
ments deal with the effect of the sur- 
round on figure-ground organization 
within a texture pattern. 

Halftone reproductions can be used 
to investigate the influence of surround 
size on the organization of a texture 
pattern. In Fig. 1, A, B, and C show 
the texture patterns generated by half- 
toning the letter P on backgrounds of 
varying size. The contrast between 
the discriminability of the P in ASHBY 
and C is most striking when viewing 
distance is increased. In C, the P is 
seen as a figure on an approximately 

1 This research was supported by National 
Science Foundation Grants GB-94 and GB- 
1521 to the Psychological Laboratories, Har- 
vard University, and by a Grant from the 


Carnegie Corporation to the Center for 
Cognitive Studies, Harvard University. 
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the surround on figure-ground 


uniform checkerboard ground, while im 
A only an irregular pattern is seen: 
This illustrates that a surround may 
establish a framework with respect tod} 
which the elements of a texture pattern | 
are unified. That is, those elements of 
Pattern A which can be assimilated to 
the surround are partialed out in Pat 
tern C to form a single ground in rela- 
tion to which the letter is seen. j 
The purpose of the present experi 
ments was to carry out parametri 
studies of the effect of the surroun 
on figure-ground organization. The 
legibility of the embedded symbols pro- 
vides an objective though indirect in- 
dex of the effect of the surround. Three 
related. experiments were performedi 
which examined symbol legibility a5 k 
function of surround size, Experimen 
I varied the visual angle subtended by 
the stimulus. Experiment II aus 
both the visual angle and the stimulus | 
exposure time, Experiment III held 
the visual angle constant and varie | 
both the exposure duration and the " | 
of embedding the symbol. Three con- 
trol experiments were also conducted t0 | 
evaluate the significance of the avera£ | 
luminance of the surround, the conne 
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Fic. 1. 


Sample displays used in experi- 
ments. (A, B, and C illustrate the effects of 
the size of the surround on the visibility 
of the letter P. A shows the letter P with 
a 0 surround, B with a 2 surround, and 
C with a 6 surround. D shows the letter 
P with a 6 surround and with the adjacent 
squares to the symbol blacked out. E, F, 
and G show the letter P when 0, 2, and 
4 of the squares in the surround have been 
enlarged. The size of the total surround is 
6 checkerboard squares.) 


tivity of the surround, and the physical 
size of the stimulus. 


EXPERIMENT I 
Method 


Stimuli—The stimuli were generated by 
halftoning the 35 alpha-numeric symbols (ex- 
cluding the numeral 0) on a gray back- 
ground. The discriminability of the symbols 
was reduced in two ways. First, the half- 
tone negatives were exposed to a safe light 
for approximately 15 sec., causing the con- 
tours of the symbols to run into neighboring 
squares, Secondly, the gray cardboard in 
the immediate area of the symbol was dark- 
ened. This resulted in enlarging and inter- 
connecting the black checkerboard squares 
in this region and had the effect of intro- 
ducing irrelevant textural variations. 

Five magnifications of the stimuli were 
prepared. The symbols measured .1, .13, 
-18, .225, and .5 in. in height, and subtended 
visual angles of approximately .24°, .31° 
43°, 54°, and 1.2°. The effects of six dif- 
ferent surround sizes were studied. Sets of 
175 symbols (5 magnifications of the 35 
alpha-numeric symbols) with 0, 2, 4, 6, 8, 


and 12 rows and columns of checkerboard 
squares surrounding the halftone patterns ob- 
tained from the symbols were prepared. The 
letter P .5 in. in height with a 0, 2, and 
6 checkerboard surround, is shown in A, B, 
and C in Fig. 1. The number of rows and 
columns in the surround were counted from 
the first rows and columns of squares that 
were not in contact with the symbol. Thus, 
the surround included the enlarged squares 
immediately surrounding the symbol. The 
number of enlarged squares in the surround 
differed on each side of a symbol and was 
not constant from symbol to symbol; on the 
average, they covered two rows and columns 
of the surround. 

Observers.—A total of 90 Os served in 
Exp. I. Fifteen Os were run with each of 
the six surrounds. The Os were Harvard 
undergraduates who were paid to partici- 
pate and were naive as to the purpose of 
the experiment. 

Procedure.—The entire set of 175 symbols 
with a given surround was presented suc- 
cessively by means of a Gerbrands tachisto- 
scope in a single experimental session. The 
Os were told that (a) they would be pre- 
sented with English capital letters and nu- 
merals 1 to 9 in their normal position; (b) 
their task was to report the symbol pre- 
sented; and (c) there would be 175 trials 
so that they might expect to see each symbol 
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Fic. 2, Recognition curves for Exp. I. 
(Data for six surround sizes and five visual 
angles. Each point represents 525 judg- 
ments: 15 Os X 35 symbols.) 
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more than once. Each stimulus was exposed 
for 100 msec. On each trial, Os’ responses 
were recorded by E and the next stimulus 
was presented after a ready signal. The Os 
were not told of the correctness of their 
responses. The order of the symbols and 
their magnifications was randomized with 
the constraint that an equal number of sym- 
bols of each magnification would appear in 
every block of 35 trials. Five blocks of 
trials were presented. Three Os were 
started at the beginning of each of the five 
blocks, ie, with Stimuli 1, 36, 71, 106, and 
141. The remaining stimuli were then pre- 
sented in the predetermined random order. 
Rests were given after one half of the trials 
or as Os requested. In order to familiarize 
Os with the task, they were initially pre- 
sented with five sample stimuli of a size not 
used in the experiment. 


Results 


Figure 2 presents the percentage of 
correct responses averaged over Os and 
symbols for the six surrounds as a func- 
tion of symbol size. Enlarging the 
symbol and the surround increases the 
number of correct recognitions. There 
is a marked Symbol Size X Surround 
Size interaction, and the amount of 


benefit derived with increasing sur- 
round varies with the size of the sym- 
bol. Beyond a certain surround and 
symbol size, legibility does not improve. 
Indeed, the results indicate that in- 
creasing the surround past 6 squares 
and the symbol past .225 in. reduces 
legibility. 

The recognition curves for individual 
symbols exhibit about the same trends 
as functions of symbol and surround 
size so that the average curves accu- 
rately reflect the effects of these vari- 
ables. The confusions Os make depend 
on the shape of the symbol and the dis- 
tortion introduced by halftoning. Thus, 
how many times a symbol is identified 
as another symbol is not a useful way 
of examining the results. There is, 
however, a certain pattern to the con- 
fusions which occur. "With a O sur- 
round, the confusions involve a greater 
number of symbols than in the case of 
a larger surround. The confusions 
with an increasing surround become re- 
stricted to the symbols that may be 


TABLE 1 
CONFUSION MATRICES FOR SYMBOLS P AND I .225 AND .5 IN. IN HEIGHT 


Surround Size 
Height 
0 2 6 8 12 
Sym PUR 
-225 in. | G42V20 TY FM FH WJ D'FN FYK FSW?B 
TCOX a LEYM 
HNZD 1 
Sin, X327 B47F WN 8G 83B 
U4WD 25UE WJRC x VERE eae 
MNVQ | 8 
Y 
PM 
.225 in. | 15H HYL LTY? 
LCE CTVJ |J i "bd 1JTL 
5 in NLR BBZ Ly 13 16L? sy? 4 
120P XTNC PETIT m T E x 
SV 3HDJ 


Note.—The superscripts refer to the number of times that a stimulus was identified as that symbol. 
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more directly obtained by leaving out, 
filling in, or altering the curvature of 
the symbol contours. Table 1 shows 
the confusions obtained for the symbols 
P and I .225 in. in height, and .5 in. in 
height for the six surrounds. With an 
8 and a 12 surround, the confusions are 
again with a larger set of symbols, in- 
dicating that Os were finding it difficult 
to segregate the symbol from the 
checkerboard background. 

There is a limited increase in ac- 
curacy with practice. A comparison of 
the first and last blocks of 35 stimuli 
presented show the same overall trends. 
For 2, 6, and 12 surrounds, the per- 
centages of correct responses were 12, 
36, and 30%, respectively, for the first 
35 stimulus presentations, and 18, 41, 
and 32%, respectively, for the last 35 
stimulus presentations. The increases 
with a 2 and a 6 surround are signifi- 
cant at the .01 level (1 tests). 


EXPERIMENT II 


Method 


Experiment II was designed to study the 
effect of stimulus exposure time. Three of 
the six surrounds employed in the previous 
study were selected, Stimuli with 2, 6, and 
12 surrounds were presented for durations 
of 500 and 50 msec. A total of 30 Os were 
run in the experiment. Five Os were run 
with each surround and exposure time. The 
apparatus and procedure were the same as 


in Exp. L 


Results” 


The percentages of correct responses, 
averaged over Os and symbols, are pre- 
sented in Fig. 3. Symbol legibility is 
a joint function of the size of the sym- 
bol, the size of the surround, and the 
exposure time. Comparison of Fig. 2 
and 3 shows that the functions obtained 
at 50 msec. are similar to those obtained 
at 100 msec. Increasing the observa- 
tion time to 500 msec., however, 
changes the shape of the functions as 
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Fic. 3. Recognition curves for Exp. II, 
(Data for three surround sizes, five visual 
angles, and two exposure durations. Each 
point represents 175 judgments: 5 Os X 35 
symbols.) 


well as increasing the accuracy of the 
responses. Itis clear that the increased 
exposure time produces the greatest 
improvement for the smaller symbol 
sizes and that improvement for the 
larger symbols is less marked. As in 
Exp. I, enlarging the symbol and the 
surround beyond certain sizes may 
either have little effect or may reduce 
the accuracy of responses. Improve- 
ment does not occur past a surround of 
6 at either 50 or 500 msec. Thus, 
though the specific relationships be- 
tween legibility and the size of the sym- 
bols and the surround are affected by 
exposure time, the results are qualita- 
tively similar for the three durations 
investigated. 


EXPERIMENT III 


As noted above, to increase the mask- 
ing of the embedded symbols irrelevant 
textural variations were introduced 
by enlarging the squares in the area of 
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the symbol in an irregular fashion. 
Observations indicated that the pattern 
of these enlarged squares affects the 
relation between the size of the sur- 
round and legibility. Consequently, this 
parameter was studied systematically. 


Method 


Three new halftone prints of each of the 
26 alphabet letters were prepared. These 
halftones differed in the number of squares 
in the immediate surround of the symbols 
which were enlarged. E, F, and G in Fig. 1 
show the letter P with 0, 2, and 4 surround 
squares enlarged. The size of the total 
surround for each of the three patterns is 
6 checkerboard squares. Zero enlarged 
squares means that none of the black squares 
outside the rectangle defined by the outer- 
most boundaries of the symbol were en- 
larged. The black squares within the rec- 
tangle containing the letter, however, were 
enlarged and did run together. The vertical 
dimensions of the symbols were always .225 
in. in height. 

Five surround sizes, 2, 4, 6, 8, and 16 
Squares and three exposure durations, 50, 
100, and 500 msec. were studied. The 
stimuli were divided into two groups: (a) 
the 208 stimuli consisting of the 26 letters 
with 2, 4, and 6 surrounds, and (b) the 156 
stimuli consisting of the letters with 8 and 
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Fic. 4. Recognition curves for Exp. III. 
(Data for three degrees of embedding, four 
surround sizes, and three exposure durations. 
Each point represents 260 judgments: 10 
Os X 26 symbols.) 


16 surrounds. The two groups were pre- 
sented in separate experimental sessions. For 
each group the order of the stimuli was 
randomized with the constraint that an equal 
number of symbols of each surround size 
and number of enlarged squares was present 
in every block of 26 letters. In all, 60 Os 
participated in the experiment. Twenty Os 
were run at each of the three exposure 
durations: 10 with the stimuli having 2, 4, 
and 6 surrounds, and 10 with the stimuli 
having 8 and 16 surrounds. The instructions 
and procedure were the same as in Exp. I. 


Results 


Figure 4 shows the percentage of 
correct responses for the 0, 2, and 4 
enlarged squares and the three ex- 
posure times as a function of surround 
size. Two overall trends are clear: 
(a) legibility increases with exposure 
time, and (b) improvement in legibility 
as a function of surround size is great- 
est at 50 msec. Figure 4 also shows 
that at 100 and 500 msec. the percent- 
age of correct curves for stimuli with 
2 enlarged squares exhibit more defi- 
nite growth with surround size, than 
those with 0 and 4 enlarged squares. 
At both 100 and 500 msec., the in- 
creases in the mean correct recognitions 
between surrounds of 2 and 16 squares 
are significantly greater for stimuli with 
2 enlarged squares than for stimuli 
with O enlarged squares (t tests, p < 
01 in both cases). At 500 msec., the 
increase in mean correct recognitions 
from a surround of 4 to a surround of 
16 squares is significantly greater for 
stimuli with 2 enlarged squares than 
tor stimuli with 4 enlarged squares 
(t test, p < .05). 

The results suggest that at 100 msec. 
and at 500 msec. 4 enlarged squares 
are already sufficient to establish a sin- 
gle ground with respect to which the 
boundaries of the embedded symbol be- 
come defined. Two enlarged squares 
are apparently not sufficient to estab- 
lish a ground and act as perturbations 
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to hide the symbol. This result is con- 
sistent with the general finding that the 
tendency for a surround to emerge as a 
ground and thereby change symbol 
legibility is a function of its size. 


CONTROL EXPERIMENTS 


Three experiments were conducted 
*o determine whether: (a) legibility as 
a function of surround size is deter- 
mined by changes in the average lumi- 
nance of the surround; (b) legibility 
as a function of symbol size is deter- 
mined by its angular size, i.e., the ratio 
of its linear size to viewing distance; 
and (c) the effectiveness of a surround 
depends on its connectivity to the 


symbol. 


1. The 26 alphabet letters .225 in. in 
height and with a 0 surround were 
placed on a gray paper having the same 
reflectance as the average reflectance 
of the checkerboard surround, 42%. 
The sizes of the gray background were 
cut to correspond to a 2 and a 16 
checkerboard surround. Two groups 
of 10 Os each were run at exposure 
times of 500 and 50 msec. The per- 
centages of correct recognitions at 500 
msec, were 17% and 21%, and at 
50 msec, 10% and 7% for the large and 
small surrounds, respectively. In con- 
trast, in Exp. III, the percentage of 
correct recognitions at 500 msec. with 
a 16 checkerboard surround was 81%, 
and with a 2 checkerboard surround 
69%; at 50 msec., 55% with a 16 
checkerboard surround, and 27% with 
a 2 checkerboard surround. Thus, it is 
the number of rows and columns of the 
checkerboard pattern and not the aver- 
age luminance of the surround that is 
crucial for enhancing legibility with in- 
creased surround. 

2. Two groups of five Os each were 
shown the embedded symbols, either 6 


or 15 in. in height, projected on a 
screen for 100 msec. The linear sizes 
of the symbols were varied by changing 
the projection distance. A neutral 
density filter was used to maintain con- 
stant luminance of the screen. The 
observation distances of the two groups 
were varied so that the symbols sub- 
tended an angle of approximately 3.4°. 
The group with the 6-in. symbols had 
31% correct responses; the group with 
the 15 in. symbols had 33% correct 
responses. This indicates that it is the 
angular size rather than the linear size 
which affects legibility of the symbols. 

3. The 26 letters, .5 in. in height, 
were presented with a 0 surround, a 
6 surround, and a 6 surround with the 
squares adjacent to the symbol blacked 
out. In D of Fig. 1 the letter P with 
the adjacent squares blacked out is 
shown. The stimuli were exposed for 
100 msec. Five Os served in the ex- 
periment. The percentage of correct 
responses for the 6 surround was 86%, 
for the O surround 41%, and for the 
6 surround with the adjacent squares 
to the symbol blacked out 24%. This 
result suggests that the effectiveness of 
a surround depends on its connectivity 
to the symbol. 


Discussion 


The experiments present ample evi- 
dence that over a range of experimental 
conditions legibility of the embedded sym- 
bols improves with surround size. The 
form of the relationship varies with ex- 
posure time, symbol size, and the way in 
which the symbol is embedded. Also, it is 
important to note that under appropriate 
conditions of presentation there is an ac- 
tual change in the perceptual organization 
of the halftone pattern. Thus, improve- 
ment does not result solely from a cogni- 
tive function, such as following the devia- 
tions which define the symbol. 

As noted, the overall effect of exposure 
duration is to increase the number of cor- 
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rect recognitions. This is in accord with 
the finding that exposure time decisively 
affects the accuracy of responses when 
perception depends on the relational char- 
acteristics of the stimulus (Kaswan & 
Young, 1963). 

The finding of an optimal symbol size 
is not surprising. The tendency for 
legibility to be an inverted U function of 
symbol size appears to have a twofold 
basis. First, as the angular size of the 
stimulus is reduced, tonal changes occur 
which cause the surround including the 
adjacent squares on either side of the 
symbol to take on a gray tone. This en- 
hances discriminability of the symbol. 
If the visual angle is made too small, 
however, the resolving power of the eye 
limits the discriminability of the relevant 
detail and recognition is again impaired. 
Secondly, there is evidence to indicate 
that there is an upper limit to the useful 
magnification of a noisy target (Beck, 
1964; Harrison & Phoenix, 1953). That 
is, the absolute sizes of the distortions 
affect the recognition of a form though 
the relative sizes of the distortions are 
kept constant, 

The most interesting result is the effect 
of surround size. As the size of the sur- 
round increases, so apparently does the 
likelihood for the emergence of a ground 
and a change in the organization of the 
pattern derived from halftoning the sym- 
bol. One interpretation of these results 
is that the visual system abstracts the 
common elements of a pattern to form a 
ground, and structures the remaining ele- 
ments relative to this ground. Johannson 
(1950) has found that in perceiving a 
pattern of movement the visual system 
abstracts those movement components 
common to the moving pattern, and the 
remaining movements are seen relative to 
this framework. Thus, in the present 
study, increasing the number of checker- 
board squares may increase the likelihood 
for a checkerboard ground to be extracted 
on the basis of the continuity and similar- 
ity of the element shapes. If the complex- 
ity of a pattern is defined by the number 
of line segments and the number of angles 
necessary to specify it (Hochberg & 


Brooks,1960), then seeing a symbol on a 
checkerboard ground conforms to an eco- 
nomical encoding. However, an econom- 
ical encoding occurs only in the instance 
when the surround is large enough, for 
in the case of a 0 surround the symbol is 
seen as consisting of a group of irregular 
lines. Thus, the results suggest that an 
economical encoding is a consequence of 
the processes which structure the pattern 
So as to be consistent with the checker- 
board surround. 

It should be noted that the present ex- 
periments do not specify the properties 
of the texture distribution that facilitate 
the establishment of a ground. The role 
of the connectivity of the surround needs 
to be more clearly defined by parametric 
investigation. Moveover, the perception 
of overlay may be generated by textural 
properties other than element shape. The 
Size of the textural elements, their separa- 
tion, the average number of black elements 
per unit area (ie. mean density) may 
provide sufficient textural uniformity for 
figure-ground segregation. Another fac- 
tor which warrants consideration are pos- 
sible brightness differences among the 
black elements in the halftone pattern. 
Due to the distribution of black and white 
elements in the pattern, slight tonal 
changes may be introduced into the lines 
that define the embedded symbol. These 
slight changes may be enhanced by in- 
creasing the number of checkerboard 
Squares in the surround. Studies have 
shown that if a target is surrounded by a 
larger annular area, the differential thresh- 
old between the two decreases as the area 
9f the annulus increases (Bartley, 1941). 
These factors are not mutually exclusive 
and the relative contribution of each in 
establishing a framework may be experi- 
mentally determined. 
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EFFECT OF INTRASTIMULUS AGREEMENT AND CONFLICT 


UPON CONDITIONED DISCRIMINATION AND REVERSAL 
OF THE EYELID RESPONSE? 


C. MICHAEL LEVY? 


University of Wisconsin 


Agreement and conflict within conditioned stimuli were manipulated by 
presenting 2 groups with the words PINK and BLUE written in ap- 
propriate and inappropriate colors, respectively. Control groups were 
presented with these words or colors. 60 differential conditioning trials 
were given to all Ss. 4 of the Ss in each group were given 40 
reversal trials; the remaining Ss were not reversed. Intra-CS conflict 
produced poor discrimination by elevating the CS— function. It was 
speculated that a cognitive interpretation might well account for the 
empirical asymmetries since most Ss met the yoluntary responder crite- 
rion and could verbalize many of the crucial experimental arrangements. 


A great deal of recent Soviet research 
bears upon the interaction of Pavlov’s 
hypothesized first (sensory-perceptual- 
motor) and second (language) signal- 
ing systems of the central nervous sys- 
tem (Pavlov, 1938, pp. 644-655; 
United States Department of Com- 
merce, 1960). Most of the Russian 
investigators have employed situations 
in which there was substantial con- 
gruence between the verbal and non- 
verbal aspects of the stimuli (eg., 
Seredina, 1960; Smolenskaya, 1934; 
Traugott, 1934). With the exception 
of El'kin (1957), few Soviet psychol- 
ogists have systematically manipulated 
conflict between the two signaling sys- 
tems. El'kin found, for example, that 
a bulb-pressing CR was easier to obtain 
to the CS, “It is raining," when that 


1 This investigation was supported in part 
by National Institutes of Health Grant MH- 
06792 to David A. Grant and in part by funds 
available to the author under a National Sci- 
ence Foundation Predoctoral fellowship. This 
paper is based upon a PhD dissertation sub- 
mitted to the University of Wisconsin. The 
author is indebted to his major professor, 
David A. Grant, for his generously given 
advice and support. 

? Now at the University of Florida. 
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state of the weather prevailed than 
when it did not. 

The purpose of the present investiga- 
tion was to compare conditioned dis- 
crimination between complex condi- 
tioned stimuli whose verbal and non- 
verbal components were congruent to 
those whose components were dispa- 
rate. In Pavlovian terminology, agree- 
ment and conflict between the first and 
second signaling systems were manip- 
ulated. Since conditioned discrimina- 
tion involving the eyelid response is 
influenced by verbal factors (eg» 
Hilgard, Campbell, & Sears, 1937, 
1938) this procedure seemed particu- 
larly well suited to the study of inter- 
actions between the signaling systems. 
A condition of intrastimulus agreement 
was achieved by exposing color names 
in appropriate hues—e.g. the word 
BLUE was written in blue letters. In- 
trastimulus conflict was established by 
presenting color names in inappropriate 
hues—e.g., BLUE appeared in pink let- 
ters (Stroop, 1935). It was hoped that 
sizable differences in conditioned dis- 
crimination would be observed by mak- 
ing the conditioned stimuli symmetrical 
—i.., by presenting BLUE and PINK in 
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blue and pink letters, respectively, for 
the agreement condition, and in pink 
and blue letters, respectively, for the 
conflict condition. 

The present study was also conducted 
to provide more information about 
asymmetries in discrimination reversal 
summarized by Ross (1962). North 
(1964) recently attempted unsuccess- 
fully to replicate Levy, Grant, and 
Clark’s (1964) finding of asymmetric 
negative-positive and positive-negative 
reversal of the eyelid response. North 
concluded that her failure to detect pos- 
sible differences in rate and extent of 
reversal may have been due to a large 
between-S variance or a preponderance 
of voluntary responders (Hartman & 
Ross, 1961). 


METHOD 


Apparatus—The S sat in a soundproofed 
room facing into a rectangular box whose 
interior was illuminated to a level of 0.7- 
1.0 mL. At the rear of the enclosure was 
a 45X30 cm. ground-glass screen located 
about 80 cm. from S at his eye level. The 
conditioned stimuli were projected onto the 
rear of this screen by a Dunning Animatic 
projector located in E's room. 

Conditioned stimuli were the words PINK 
and BLUE or were pink and blue colored 
circles appearing upon a black background, 
The circles had a projected diameter of 15 
cm.; the letters were about 11 cm. high. The 
intensity of the stimuli were 0.9, 0.7, and 
12 mL. respectively, for the pink, blue, and 
white colors. Between presentations of the 
conditioned stimuli the screen remained dark. 

The word conditioned stimuli lasted 1200 
msec.; the circle, 1000 msec. Rather than 
use identical CS-UCS intervals for all 
groups, thus equalizing opportunities to re- 
spond, it was ventured that fewer interpretive 
difficulties would arise if optimal intervals for 
conditioned discrimination were employed. 
These intervals were 800 msec. for circles 
(Hartman & Grant, 1962a) and 1000 msec. 
for words. The UCS, a corneal puff of 
nitrogen, lasted 200 msec., and its intensity 
was regulated at the source by a 150-mm. 
column of mercury. The CS, UCS, and CS- 


3 Beeman and Grant, personal communica- 
tion, 1964. 


UCS intervals were controlled by electronic 
timers. The intertrial interval averaged 25 
sec., varying unsystematically between 15 and 
35 sec, and was controlled by a Western 
Union tape transmitter. Eyelid responses 
were detected by a microtorque potentiometer 
fastened to S’s eyelid in the manner described 
by Hartman and Grant (1962a). Amplified 
responses and their first derivatives were 
recorded on an ink-writing polygraph. 

Procedure—Upon entering the experimen- 
tal room, the A-O Pseudoisochromatic Test 
for color blindness was administered to S. 
The E then attached a headband supporting 
the eyelid response pickup apparatus and air- 
puff delivery tube, and left S alone. Tape- 
recorded neutral instructions were presented 
cautioning S to let his “natural eyelid reac- 
tions take care of themselves.” 

Properties of the conditioned stimuli used 
for differential conditioning and reversal de- 
termined the assignment of group names: 

Color group: Conditioned stimuli were 
pink and blue colored circles appearing on 
a black background, 

Word group: Conditioned stimuli were 
the words PINK and BLUE appearing in white 
letters on a black background. 

Agree group: Conditioned stimuli were as 
in the Word group except that PINK and 
BLUE appeared in pink and blue letters, 
respectively. 

Conflict group: Conditioned stimuli were 
as in the Word group except that PINK ap- 
peared in blue letters and BLUE appeared in 
pink letters. 

All Ss were given 60 differential condi- 
tioning trials in which only CS+ was con- 
sistently reinforced. Half of the Ss con- 
tinued under this regimen for 40 additional 
trials (no reversal control S's) ; the remain- 
ing Ss were given 40 reversal trials in which 
the former CS+ was no longer followed by 
an air puff, and the previous CS— was con- 
sistently reinforced. In each series of 10 
trials, five CS+ and five CS— were presented 
in a random order. Following conditioning 
and reversal, S responded to a questionnaire 
designed to determine his awareness of 
stimulus contingencies, knowledge of classical 
conditioning, etc. 

Subjects—The Ss were 128 volunteers 
from the introductory psychology course at 
the University of Wisconsin. The sex ratio 
was balanced for all reversed and control 
subgroups, but Ss were otherwise unsys- 
tematically assigned. One S was discarded 
because he had achromatic vision, and 15 Ss 
due to apparatus failures. They were re- 
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placed by the next arriving Ss. Seven Ss 
were eliminated after the completion of the 
experiment because they failed to make more 
than five anticipatory responses during the 
first 60 differential conditioning trials. 


RESULTS 


Pen deflections of at least 1 mm., as 
recorded, were scored if they occurred 
between 250-800 msec. for the Color 
group, and 250-1000 msec. for all other 
groups. Criteria described by Hart- 
man and Ross (1961) were used for 
classifying Ss as voluntary (V) or 
conditioned (C) responders. These 
criteria indicated that six Ss could be 
categorized as Cs—a number far lower 
than that reported by Levy, Grant, and 
Clark (1964), but in line with recent 
Observations in the same laboratory 
(Fleming, 1965; North, 1964). When 
Ss were divided into the two categories 
on the basis of response form, only one 
additional S fell into the C category. 
As all seven Cs made a considerable 
number of voluntary responses (M 
= 20%), their records were combined 
with those of the Vs for all major 
analyses. The Cs were divided fairly 
evenly among the groups: Group Con- 
flict had one C and the remaining 
groups had two each. 

It was anticipated that separate 
within-S analyses of the voluntary and 
conditioned responses could be made, 
but these were not feasible because of 
the very small number of CRs elicited.4 
The mean percentage of CRs per block 

* Means associated with significant main 
effects and interactions in the various analy- 
ses of variance and the postexperimental 
questionnaire along with a summary of the 
data obtained therefrom have been deposited 
with the American Documentation Institute. 
Order Document No. 8832 from ADI Auxil- 
iary Publications Project, Photoduplication 
Service, Library of Congress, Washington, 
D. C. 20540. Remit in advance $2.50 for 
microfilm or $1.75 for photocopies and make 


checks payable to: Chief, Photoduplication 
Service, Library of Congress. 


for all Ss to CS+ was 16.0% ; to CS-, 
4.39%. The general relationships among 
the four groups were substantially the 
same for both CRs and total (CR + 
V) responses. 

Differential conditioning.—Responses 
to CS+, CS—, and difference scores 
(number of responses to CS+ minus 
number of responses to CS—) per 
block were analyzed separately. Re- 
sponses to CS+ and the difference 
Scores indicated sufficient homogeneity 
of variance to permit analyses of vari- 
ance upon the raw scores: Cochran’s C 
(Winer, 1962, p. 94) values were 0.293 
and 0.271 (4, 185 df), respectively. 
AC (4, 185 df) of 0.367 based upon 
responses per block to CS— was signifi- 
cant beyond the .01 level, but a non- 
significant C of 0.253 was obtained 
after subjecting these data to an arc sin 
transformation. 

As can be seen in Table 1, where 
summaries of analyses of variance on 
the first six blocks are presented, a 
significant Blocks main effect was 
found in all three analyses. Inspection 
of Fig. 1 reveals that these main effects 
were due to a reliable increase in num- 
ber of responses per block to the CS+ 
and a corresponding decrease to CS— 
during Blocks 1-6. There were no 
differences among groups in respon- 
siveness to CS+. The highly signifi- 
cant Groups main effect that appeared 
in the analysis of responses to CS— 
was due entirely to a high level of re- 
sponding in the Conflict group. There 
were no significant differences among 
the Agree, Word, and Color groups’ 
average responses per block to CS— 
during Blocks 1-6 (F « 1.00). 

The obvious group differences in 
conditioned discrimination (i.e., differ- 
ence scores) during Blocks 1-6 are 
significant at the .05 level, F (3, 117) 
= 2.986, MS, = 9.962, but this effect 
was due to the relatively poor condi- 


—s a 


STIMULUS INCONGRUITY AND CONDITIONED DISCRIMINATION 79 


TABLE 1 


SUMMARIES OF ANALYSES OF VARIANCE BASED UPON RESPONSES PER BLock TO 
CS- AND CS— AND UPON DIFFERENCE SCORES FOR BLOCKS 1-6 


cs+* cs—> Diff. Scores* 
Source df 
MS F MS F MS F 
EE (G) 3 1.867 714 14.386 8.773** 29.740 2.986* 
Sex (S) 1 40.572 3.935* 6.305 3.885* 3.229 .324 
xS 3 9.375 .909 .857 .523 .984 .099 
s within 117 10.310 ES 1.640 cx 9.962 oe 
Blocks (B) 5 79.612 | 73.308*** .939 2.784 80.620 |71.841*** 
BX 15 .388 357 .460 1.351 1.987 | 1.771 
BXS 5 1.292 1.190 .049 .046 1.003 .894 
BXGXS 15 .855 187 .358 1.066 27.956 |24.912** 
Ss within X B | 561 1.086 = 337 = 1,122 e 


a Analyses computed on raw scores, 


b Analysis computed on scores subjected to an arc sin transformation. 


tioned discrimination in the Conflict 
group. Analyses of the difference 
scores functions revealed that the Con- 
flict group discriminated worse than 
each of the other groups, F (1, 60) = 
4.917, MS, = 11.023, p < .05 for Con- 


o——° AGREE 
o—e CONFLICT 
a——4 WORD 


MEAN PERCENT RESPONSES (CRs + Vs) 


BLOCKS 


flict vs. Agree; F (1, 58) = 7.550, 
MS, = 9.253, p < .01 for Conflict vs. 
Word; F (1, 59) = 4.354, MS.= 
10.785, p < .05 for Conflict vs. Color. 
Since the difference scores reliably in- 
creased over the first six blocks of 


REVERSAL 


OF FIVE TRIALS 


Fig. 1. Mean percentage of total responses per block during conditioned 
discrimination and reversal. 
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TABLE 2 


SUMMARIES OF ANALYSES OF VARIANCE BASED UPON RESPONSES PER BLOCK MADE 
DURING ComBINED BLocks 3-6 AND CowBINED BLOCKS 7-10 


CS during Blocks 7-10 
Source df CS4 cs—> 
MS F MS F 
G 3 19,267 6.487*** 43.935 13.569*** 

a $ ! 1 20.015 6.732* 33.405 1031778. 
Reversal (R) 1 390.996 131.516*** 492.131 151.986% 
Gxs 3 27.010 9.085*** 8.746 2.701% 
GXR E 9.809 3.229* 19.089 5.895 
SXR 1 7.914 2.662 4.515 194. 
GXSXR 3 13.884 4.670** 11.072 3.419 
Ss within 210 2.973 — 3.238 M 
Blocks 3-6 vs. 7-10 (B) 1 311.365 148.340*** 256.202 187.145 

x 3 677 .322 VR d 
BXS 1 479 .228 í . 
B X R 1 208.778 99.465*** 177.173 129.418*** 
BXGXS 3 1.979 943 .858 627 
BXGXR 3 .988 AT1 4.461 3.259 
BXSXR 1 .001 .000 .360 .263 
BXGXSXR 3 .990 AT2 2.852 2.083 
Ss within X B 726 2.099 — 1.369 x 

y [ncludes Neg-Pos reversal and Pos-Pos control Ss. 
T En Pos-Neg reversal and Neg-Neg control Ss, 
"5 e 01. 
ED «c 001. 


trials, it is evident that, in general, 
Sharp discriminations between CS+ 
and CS— were formed, There was 
little evidence for the hypothesis of 
discrimination developing at differen- 
tial rates in the four groups, for the 
Groups x Blocks interaction was non- 
significant. 

Reversal.—Performance during the 
four blocks of discrimination reversal 
is shown in the right of Fig. 1. It 
may be compared to the performance 
based upon the four blocks of differ- 
ential conditioning occurring (a) prior 
to reversal (i.e., Blocks 3-6), and (5) 
simultaneous to the reversal trials for 
the control Ss (i.e., Blocks 7-10). 

Gross comparisons of Ss during 
Blocks 7-10 with their performance 
during Blocks 3-6 were achieved by 
analyses of variance, summarized in 
Table 2 (see a above). The Fs on the 
right of the table were based upon re- 


sponses of reversed Ss to CS+ and 
control Ss to CS— during Blocks 3-6, 
and all of these Ss to CS— during 
Blocks 7-10. In the parallel analysis, 
the Fs in the left of Table 2 were com- 
puted from responses of negative-posi- 
tive reversal and positive-positive con- 
trol Ss. The Conflict group consist- 
ently made a larger percentage of total 
responses per block than the other 
groups. This was due to the rather 
high level of responding to CS-, 
and was in large measure responsi- 
ble for the significant Groups main 
effects and the Groups x Reversal 
interactions. As anticipated, control 
Ss responded to CS+ at a significantly 
higher level than Ss reversed from 
CS— to CS+ (Ms = 67% and 42% 
responses per block, respectively) and 
control Ss responded to CS— at a sig- 
nificantly lower level than Ss reversed 
from CS+ to CS— (Ms — 2195 and 
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TABLE 3 


SUMMARY OF ANALYSES OF VARIANCE OF TOTAL RESPONSES PER BLOCK MADE TO 
CS+ AND CS— purine BLocks 7-10 


CS+ cs- 
Source df 
MS F MS F 
CEPS (G) 3 9.304 1.197 25.097 7.998*** 
Sex (S) 1 7.150 920 13.882 4.424** | 
Fees (R) 1 14.175 1.824 39.369 12.546*** 
xs 3 25.766 3.317* 5.058 1.612 
GXR 3 4.470 515 7.264 2.315 
RXS 1 4.534 .583 .944 .301 
GXRXS 3 4.304 554 10.097 3.218** 
Ss within 105 7.711 = 3.138 — 
Blocks (B) 3 2.890 3.114* 12.173 11.716*** 
BxG 9 1.676 1.806 1.019 .981 
BXS 3 1.490 1.606 1.650 .588 
BXR 3 .745 803 9.155 8.811*** 
BXGXS 9 .163 176 1.012 974 
BXGXR 9 2.649 2.855* 675 650 
BXRXS 3 .287 309 435 420 
BXGXRXS 9 1.325 1.428 971 935 
Ss within X B 315 .928 = 1.039 = 


50% responses per block, respectively). 
A sharper picture of the significant 
reversal effect appears upon compari- 
sons of the mean percentage of re- 
sponses per block to CS+ and CS— 
during the two phases of the experi- 
ment: During Blocks 3-6, Ss reversed 
from CS— to CS+ averaged 21.0% 
responses per block, while in Blocks 
7-10 they averaged 62.4%. The cor- 
responding means for Ss reversed from 
CS+ to CS— were 68.8% and 31.0%. 
In comparison, control Ss generally 
showed nonsignificant changes in re- 
sponse levels between Blocks 3-6 and 
7-10. 

In Table 3 (see b above) analyses 
of variance of responses per block 
made by reversed and control Ss dur- 
ing Blocks 7-10 are summarized. 
Comparing these summaries with the 
means presented in Fig. 1 several facts 
emerge. In the analysis of responses 
to the CS+ during Blocks 7-10, the 
appearance of a nonsignificant Reversal 
main effect and a Block X Reversal in- 


teraction suggest that negative-positive 
reversal was accomplished, in general, 
rapidly and completely. 

The comparable analysis for re- 
sponses to CS— during Blocks 7-10 
presents a totally different picture, 
The Ss in the Conflict group tended 
upon reversal of CS-- to CS— to drop 
precipitously to the level of the Conflict 
control Ss, which was shown previ- 
ously to be reliably higher than the 
performance level of the other three 
groups. It can be observed from Fig. 
1 that the Agree, Word, and Color 
reversal groups during Blocks 7-10 
slowly and continuously approached the 
low level of responding of the respec- 
tive control Ss. Block-by-block com- 
parisons of reversed and control Ss 
revealed significant differences in 
Block 7 in all groups but Conflict. 
In succeeding blocks these differences 
diminished, and by Blocks 9 and 10 
none was significant. Block-by-block 
comparisons of reversed and control 
Conflict Ss revealed nonsignificant dif- 
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ferences in average responding to CS— 
during Blocks 7-10. 


Discussion 


It is obvious from these data that excel- 
lent conditioned discrimination was pro- 
duced in the Agree, Word, and Color 
groups. The relatively poor discrimina- 
tion displayed by the Conflict group was 
due entirely to the elevated level of re- 
sponding to the nonreinforced CS, There 
was no evidence of differential responding 
among the four groups to CS+. The 
Conflict Ss, in general, must have not 
selectively attended to only one component 
of the complex CS at the expense of the 
other. Had they performed in this man- 
ner, their pattern of conditioned discrim- 
ination would have been indistinguishable 
from the other groups’. 

Hullian (Hull, 1943, 1952) and Pavlo- 
vian (Pavlov, 1927, 1938) theories can 
explain our findings if it is assumed that 
generalization from CS+ to CS— was 
greater than the reverse owing to a 
greater spread of the positive gradient. 
In view of Gynther's (1957) findings it 
would not be unreasonable, however, to 
expect at least a modicum of generaliza- 
tion of inhibition. In fact, a finding of 
symmetrical dimunition of the CS+ and 
an elevation of the CS— functions in the 
Conflict group would not have been sur- 
prising in view of recent Soviet investiga- 
tions (Fadeyeva, 1960; Seredina, 1960). 

Explaining the asymmetry of the CS 
functions in the Conflict group seems to 
require a few post hoc speculations. Sup- 
pose that when a word such as PINK is 
presented to S, a number of representa- 
tional mediating responses ensue, some of 
which are common to those produced 
when the color pink is shown, Assume 
further that these responses elicit self- 
produced stimuli which serve as the func- 
tional conditioned stimuli (Osgood, 1953). 
If these suppositions are met, then the 
functional CS— would have been rein- 
forced 50% vs. 0% of the time for the 
nominal CS—. The final response level 
to a simple object CS reinforced on a 
50% schedule (Ross, 1959) is a close 


match to the CS— function of the Conflict 
group. Unfortunately, this interpretation 
is no more adequate than one based upon 
Hull’s or Pavlov’s theory since it requires 
a corresponding decrement in responsive- 
ness to CS+. 

It is known that word conditioned stim- 
uli require longer CS-UCS intervals for 
optimal conditioned discrimination than 
simple object stimuli ? (Hartman & Grant, 
19624). Hartman and Grant observed 
that long CS-UCS intervals compared to 
shorter intervals benefited discrimination 
between simple object conditioned stimuli 
by producing lower levels of responding 
to CS—. One would not expect differ- 
ences among the Agree, Conflict, and 
Word groups, which all had long CS- 
UCS intervals, unless another variable 
were operating. And this variable should 
logically be related to the arrangement of 
the components of the complex CS. It 
may be, for example, that the more “com- 
plex" the CS the longer it takes for in- 
hibition, relative to excitation, to become 
organized, 

Alternatively, suppose that because of 
the common elements in the Conflict con- 
ditioned stimuli, Ss could not always re- 
member the CS contingencies. If the 
air puff was bothersome, a very adaptive 
strategy for lessening the noxiousness O' 
the UCS would be to blink when not cer- 
tain. This post hoc hypothesis was sug- 
gested by the results of the postexperi- 
mental questionnaire. Almost all of the 
Ss could verbalize the procedure (and 
often, the purpose) of the experiment ex- 
ceptionally well. They certainly pos- 
sessed sufficient knowledge of classical 
conditioning to "help" E by blinking at 
appropriate times if they were so dis- 
posed. Further, over 10% of the sample 
candidly admitted disregarding the neu- 
tral instructions and blinking intentionally 
during the CS-UCS interval in order to 
lessen the noxiousness of the UCS, Addi- 
tional support for the hypothesis comes 
from the fact that 6 of 31 Conflict Ss 
(19.4%) were incorrect in verbally iden- 
tifying CS+ and CS—, while less than 
4% of the remaining Ss were incorrect. 
Although the operationally defined pro- 
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cedure was classical conditioning, there 
probably was a good deal of instrumental 
conditioning owing to the rather high 
level of voluntary responding. 
Reversal—In a recent study of dis- 
crimination reversal Levy, Grant, and 
Clark (1964) noted an asymmetry be- 
tween positive-negative and negative-posi- 
tive reversal. That is, in negative-posi- 
tjve reversal Ss failed to reach the per- 
formance level maintained to CS+ by con- 
trol Ss, but the final levels of responding 
to CS— by these Ss were indistinguishable 
statistically. In a subsequent investiga- 
tion, North (1964) failed to obtain reli- 
able differences during the reversal stage 
between reversed and control Ss' re- 
sponses to CS+, but an unfortunately 
large between-S variance made her tests 
imprecise. In the present study, nega- 
tive-positive reversal also appeared to be 
rapid and complete, for even in Block 7 
there were nonsignificant differences be- 
tween the response levels to CS+ by 
Agree, Conflict, and Word groups. In a 
verbal instrumental conditioning analogue 
to their discrimination reversal experi- 
ment, Levy et al. (1964) observed ex- 
tremely rapid reversals from CS— to 
CS+ and vice versa. The data from the 
present study are quite compatible with 
North’s, perhaps because both studies 
were blessed with a high incidence of 
voluntary responders. Only about 5% of 
the sample obtained by Levy et al. could 
be categorized as voluntary by the Hart- 
man and Ross (1961) criterion, Thus, 
a comparison of the findings of Levy et al. 
with those of North and the present study 
may reveal only another example of the 
well-documented fact that voluntary and 
conditioned responders obey different laws 
(Hartman & Grant, 1962, 1962b ; Spence 
& Ross, 1959). The persistence of in- 
hibitory effects in reversal noted by Ross 
(1962) may well be inappropriate in dis- 
cussions involving voluntary responses. 
As in the Levy et al. and North studies, 
positive-negative reversal was gradual 
and complete. By Blocks 9 and 10 none 
of the differences between reversed and 
control Ss' response levels was significant. 
It is interesting to note that extinction of 


the responses to CS+ by the Conflict Ss 
during reversal developed only to the 
level maintained by the control Conflict 
Ss to CS—. The variable which pre- 
vented greater inhibition to CS— during 
differential conditioning in the Conflict 
group may also have prevented complete 
extinction of responses to CS+ during 
reversal If, as was conjectured, many 
Conflict Ss disregarded E's instructions 
and also had difficulties remembering the 
CS contingencies, the high level of re- 
sponding by Conflict Ss to CS— during 
reversal should not be surprising. It 
should be noted that cognitive interpreta- 
tions of conditioning data are certainly 
not new (e.g., Hilgard & Humphreys, 
1938) and recently have served adequately 
to explain certain extinction data (e.g., 
Spence, Rutledge, & Talbott, 1963). 
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TASK PREDICTABILITY AND THE DEVELOPMENT OF 
TRACKING SKILL UNDER EXTENDED PRACTICE* 
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The influence of task predictability upon the organization of responses 
was studied in the context of an irregular step-function tracking task. 
Task predictability was determined by the proportion of elements in a 
12-unit sequence that were repeated each time the sequence was pre- 
sented. Proportions employed were 1.00, 0.83, 0.75, 0.67, and 0.00. 
Separate groups of 9 Ss each were assigned to conditions. There were 
820 repetitions of the 12-unit sequence during training and 80 repeti- 
tions after a 3-mo. retention interval. Measures of absolute integrated 
error showed that performance efficiency was positively related to pro- 
portion of repeating elements; differences among means were sig- 
nificant. The relations suggested by plotting information measures 
against integrated error were similar but differed in detail. Differences 
in integrated error among conditions were related to specific indexes of 


temporal and spatial error. 


There were substantial losses after the 


3-mo. retention interval, with the greatest absolute and relative losses 


for the more predictable tasks. 


In an earlier report (Trumbo, Noble, 
Cross, & Ulrich, 1965) acquisition, 
organization, and retention of tracking 
skill were investigated as a function 
of the amount of predictability present 
in a step-function task. In that pre- 
vious report, four task conditions—one 
random, one predictable, and two par- 
tially predictable (with 50% and 67% 
systematically repeating targets )— 
were presented at two levels of training 
and three retention intervals. The re- 
sults of that study indicated, among 
other things, that Ss with partially 
predictable tasks did not demonstrate 
significantly greater improvement than 
Ss with a completely random task with 
moderately extensive training. The 
present study was designed to test 
whether significantly greater improve- 

1 This research was supported by the Air 
Force Office of Scientific Research under 
Grant No. 526-64. Jane Quigley, William 
Griffitt, and Marilyn Schaus assisted in data 
collection and reduction, William Hull 
served as Project Engineer. 

2 Now with the Peace Corps. 


3 Now with the Decision Science Labora- 
tory, Hanscom Field, Bedford, Massachusetts. 
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ment could be found for these groups 
with more extensive practice and 
whether target sequences with greater 
degrees of predictability would result 
in performance levels commensurate 
with the amount of task predictability. 

Other findings from the prior study 
dealt with the organization and reten- 
tion of the skill. They suggested, for 
example, that the more critical aspect 
of response strategy and of skill reten- 
tion were to be found in the temporal 
rather than the spatial components of 
the task. Thus, Ss with the smallest 
error scores had the best records on 
timing indexes, frequently at a cost in 
spatial accuracy, and losses during re- 
tention were more directly related to 
changes in timing than to changes in 
spatial accuracy. The present study 
was designed to provide further evi- 
dence on these and similar aspects of 
skill during acquisition and retention. 


METHOD 


Subjects —A total of 45 right-handed male 
students, between 17 and 25 yr. of age, were 
randomly assigned, 9 each, to five experimen- 
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tal conditions. The Ss were volunteers from 
an introductory course and were paid for 
their services. 

Apparatus.—The apparatus has been de- 
scribed elsewhere in detail (Trumbo, Es- 
linger, Noble, & Cross, 1963) and was iden- 
tical with that used in the prior study 
(Trumbo et al., 1965). Target and cursor 
were each displayed as vertical hairlines 4 in. 
long. They overlapped by è in. along 
the X axis of a 5-in. cathode ray tube 
(CRT). The Ss were seated approximately 
28 in. from the CRT, The cursor was con- 
trolled by means of a lateral arm control, 
pivoted at the elbow, with a control-to-display 
ratio of 11° to 1 in. Absolute error (ie, with- 
out regard to sign) was cumulated over the 
duration of the trial by means of electronic 
integrators. Stimulus (target) input, arm- 
control (cursor) output, and absolute error, 
were recorded on magnetic tape continuously 
during Trials 15, 65, 205, and the first re- 
tention trial. "These data were transcribed on 
oscillographic records for purposes of an- 
alytic scoring. 

Tasks.—As in the prior study, each of the 
five groups received irregular step-function 
inputs at one target per second. Targets 
could appear at any of 15 equidistant posi- 
tions along the middle 4 in. of the X axis 
ofthe CRT. Trials were 48 sec. in duration 
with an intertrial interval of 12 sec., the 
final 2 sec. of which were filled with a 
warning buzzer.* 

Experimental conditions.—Five conditions 
of task predictability were employed. Three 
of these were identical with tasks used in 
the prior study, viz., (a) the predictable task 
(Task P), wherein 12 targets selected ran- 
domly from the 15 possible target positions 
were repeated in sequence within and 
throughout all trials; (b) an intermediate 
task (Task I-2) wherein 12 targets were 
selected as in a above, but with every third 
target randomly selected anew on each repeti- 
tion of the sequence (every third target, 
therefore, was unpredictable) but with the 
eight remaining targets fixed and repeated as 
in Task P; and (c) a random task (Task 
R) wherein a sequence of 240 targets was 
selected randomly.  Randomness was re- 
stricted only in that a target could not be 
immediately repeated. 

Two new tasks were included to obtain 
data on intermediate conditions somewhat 
more predictable than Task I-2. These were 
tasks consisting of a sequence of 12 targets, 


* Trials in the prior study were 60 sec. 
in duration. 


wherein every fourth (Task I-3) and every — 
sixth (Task I-5) target was randomly se- 
lected anew on each repetition of the ses 
quence while the remaining targets were re- 
peated as in Tasks P and I-2. The propor- 
tions of systematically repeating targets thus ^ 
were 1.00, 0.83, 0.75, 0.67, and 0.00 for task ` 
P, I-5, I-3, 1-2, and R, respectively. 

Procedure.—The Ss were assigned ran- 
domly to conditions and to the training sched- a 
ule. Orientation and instructions were essen- — 
tially identical with those of the prior study, — 
and included detailed descriptions of various — 
sources of error. The Ss were instructed to - 
look for patterns or sequences of targets, but 
were not told the precise nature of their own | 
task. After the initial instructions, Ss were — 
told at each session only that the task re- — 
mained the same as during previous sessions. 
Aíter each block of five trials Ss were told 
either that their error scores were “O.K.” 
or “worse” relative to the immediately pre- ~ 
ceding block of five trials. ji 

Instructions were followed by 15 trials on. 
Day 1 and by 25 trials on each of seven stic-- 
ceeding days. On Day 9, Ss were first given 
a final 15 trials (for a total of 205 trials 
on their training task, then transferred to à 
random task for 10 trials. All Ss retur 
after a 3-mo. retention interval (+2 days 
for 20 retention trials on their origi 
training tasks. 

Performance measures.—As in the pi 
study, absolute error, integrated over ti 
during each trial, served as the performa 
criterion, while additional analytic sco 
were taken from oscillographic records 
the selected trials. These indexes include 
leads, lags, beneficial anticipations, overshoot 
and undershoots (see Trumbo et al., 1965). 
These are defined in detail below. 7 


RESULTS 
Integrated Error 


experimental groups during traini 
and retention are presented in Fig. 
Training data are plotted in 10 trii 
blocks (except Block 20, which is 15 
trials) and retention data in five tri 
blocks with the first training and reca 
trial plotted separately. M 
Differences among adjacent groups ^ 
at the end of training are nearly iden- - 
tical, ranging from .85 to .95 error 
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INTEGRATED ERROR -Volts 
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Retention. 

5 Triol Blocks 
Fic. 1. Integrated error data for five con- 

ditions of task predictability. (Acquisition 

data are plotted in blocks of 10 trials, reten- 

tion data in blocks of 5 trials. Initial trials 

are plotted for both acquisition and retention.) 


Acquisition Blocks of 10 Triols 


units (volts). A simple between- 
within analysis of variance for the 
individual means of the last 15 trials 
(Block 20) resulted in F (4, 40) = 
15.76, p < .001. Duncan's new multi- 
ple-range test showed that all differ- 
ences were significant (p < .01) except 
those between immediately adjacent 
pairs of means (5 > .05). 

On the first recall trial, groups with 
the more predictable patterns (Cond. 
I-3, I-5, and P) showed absolute losses 
of about .8-1.1 v., as compared with 
mean error scores for Training Block 
20, whereas no losses were demon- 
strated for Tasks R and I-2. Losses 
were significant for Tasks P and I-5 
(correlated t tests, p < .05). Relative 
losses were greatest for Tasks I-3 
(about 40%), 1-5 (about 28%), and 
P (about 29%) when measured against 
the gains from Block 1 to Block 20. 

The losses for the more predictable 
groups reduced the variance between 
groups, but an analysis of variance for 
the first five retention trials yielded a 
significant F (4, 36) = 14.36, p< 
001. Duncan’s test indicated that only 
the difference between Tasks I-3 and 
I-5 was significant for adjacent pairs, 
but that all other differences were 
reliable (p < .05). 


It should be noted that not only did 
the three groups which showed per- 
formance losses at recall fail to im- 
prove upon their final training per- 
formance level during the 20 relearning 
trials, but only Group I-5 regained its 
earlier proficiency. In fact, Group P 
had significantly higher error (p< 
.05) after 20 retention trials than at 
the end of training. 

Performance on the random transfer 
task (not shown in Fig. 1), introduced 
immediately after the final training 
trial, failed to indicate any differential 
transfer effects from the five tasks. 
Means for all groups were highly simi- 
lar to that of Task R before transfer 
and differences between the means 
were extremely small and were non- 
significant. 


Analytic Scores: Timing 


Four indexes of timing were ob- 
tained for Trials 65, 205, and the first 
recall trial for all groups and for Trial 
15 for the “I” groups? The two major 
indexes were lead and lag, scored to 
the nearest 50 msec., as the difference 
between the initiation of a primary 
movement and the displacement of the 
target. Both duration and percentage 
of leads and lags and combined lead- 
lag data were analyzed. In addition, 
frequency counts were made of two 
types of anticipatory responses: Type ii 
included all cases in which the primary 
movement was initiated within +150 
msec. of target displacement; Type II 
included all cases in which the primary 
movement was initiated before target 
displacement but did not arrive at the 
anticipated target position before the 
target. While both anticipation scores 
have been labeled “beneficial anticipa- 

5Data for Trial 15 were inadvertently 
erased after oscillographic records were ob- 
tained for the “I” conditions, but before 
similar records were made for P and R 
conditions. 
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TABLE 1 


PERCENTAGES OF LEADS AND Lacs BY TASKS 
FOR SELECTED TRIALS 


Trial 
Task 
15 65 Ret. 1 
Lags 
R | — | 930 | 650 | 758 
I2 | 695 | 716 | 529 | 59.8 
L3 | 473 | 369 | 182 | 40.9 
I-5 55.1 32.1 12.5 31.2 
POL | dés | 28 | 178 
Leads 5 
R | — | 125 | 239 | 159 
L2 | 196 | 181 | 381 | 296 
L3 | 412 | 540 | 721 | 535 
LS | 369 | 551 | 780 | 3&6 
PO| —» | T&S | 943 | 771 
a Data not obtained. 


tion," the term more accurately de- 
scribes the Type II index, since the 
optimal response strategy includes an- 
ticipation of the next target so as to 
be half the distance to the anticipated 
target at the instant of target displace- 
ment with a rapid, perhaps slightly 
underdamped, movement. 

Lead and lag.—Table 1 presents the 
mean percentages of leads and lags for 
the five conditions on each of the four 
selected trials. All groups appeared to 
decrease lags and increase leads be- 
tween Trials 65 and 205. For the “I” 
groups, only I-2 failed to show the 
same trends between Trials 15 and 65. 
Generally, these trends were reversed 
during the first recall trial, i.e., lags 
increased and leads decreased to ap- 
proximately those percentages found at 
Trial 65. 

Mean lag, lead, and lead-lag values 
are plotted in Fig. 2A, 2B, and 2C, 
respectively, for each of the three se- 
lected trials. Data are plotted in milli- 
seconds on the ordinate and the five 
tasks are represented on the abscissa 
in terms of the proportion of predicta- 
ble targets in the sequence. Targets 
with zero lead or lag were included 


in the n’s. Data in Fig. 2C are for the 
sum of lead values minus the sum of 
lag values divided by the total number 
of targets presented. Duncan’s new 
multiple-range test was applied to the 
lag data and to the lead data for Trial 
205 using individual mean scores. 
Tasks I-2 and R did not differ sig- 
nificantly in either test, while, for lags, 
1-3, I-5, and P did not differ and for 
leads, I-2, I-3, and I-5 did not differ. 
All other differences were significant 
(p < .05). 

The curves were fitted by simple 
visual inspection and are included pri- 
marily to illustrate our interpretation 
of the data, viz., that performance on 
Task I-2 is more similar to that for the 
completely random task than would be 
expected in terms of the proportion of 
predictable elements in the task. This 
appears to be particularly evident for 
lags and for the combined lead-lag 
data. Moreover, when the percentage 
data from Table 2 are plotted as in 
Fig. 2, a similar picture emerges; 
Group I-2 has more lags and fewer 
leads than would be predicted from the 
other data points. In general, the 
curves indicate a decrease in duration 
of lags and a corresponding increase in 
duration of leads during training, con- 
sistent with the percentage data. How- 
ever, the retention interval appears to 
have differential effects dependent on 
task predictability. This is particularly 
evident in the lag data where Task R 
shows continued decrease in lag dura- 
tions, while Task P shows mean lag 
greater than for Trial 65; and in the 
lead-lag data where Groups R and 1-2 
show little change from Trial 205, 
while Groups I-3 and I-5 revert to 
performance like that on Trial 65 and 
Group P's performance shows even 
greater loss. 

Variability of timing —Variances 
were computed for the distributions of 
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leads and lags for each S on Trials 65, 
205, and the first recall trial, and mean 
variance values obtained for each of 
the five groups. These scores repre- 
sent the timing variability, or con- 
versely, the consistency of the duration 
between initiation of primary move- 
ments and displacement of targets for 
each group. Variances tended to de- 
crease with training and, except for 
Task P were lower at recall than at the 
end of training. At the end of training, 
Tasks I-2 (c? = 186 msec.) and I-3 
(c? = 179 msec.) had the highest vari- 
ability scores, followed by Task I-5 
(o? 150 msec.), Task R (o? = 150 
msec.), and Task P (o? = 101 msec.). 
It is evident from these data that the 
conditions of intermediate predictability 
result in the greatest variability in tim- 
ing response, i.e., individual Ss on the 
“I” patterns are more variable in their 
performance within a trial than for 
either Task P or Task R. 

Beneficial anticipations.—All groups 
except Group P increased the fre- 
quency of leads and lags that were 
within +150 msec. of target displace- 
ments, with R, I-2, I-3, and I-5, in 
that order, having beneficial anticipa- 


MEAN DURATION (msec.) 


0 CELH o 
R HR P 


Proportion Predictable Targets 


tions on from 50% to 75% of all tar- 
gets. For Task P, however, the per- 
centage dropped from about 50% to 
33% between Trials 65 and 205, then 
increased to 55% at recall. The “I” 
groups showed 2-5% increases at re- 
call. The reduction of beneficial an- 
ticipation by Group P at the end of 
training was the result of an increasing 
number of leads greater than 150 msec., 
as shown in Fig. 3B. 

The frequency of Type II beneficial 
anticipation increased for all groups be- 
tween Trials 65 and 205, then returned 
to approximately the Trial 65 level at 
recall. Groups P and I-5 had about 
75% Type II responses; I-3 had 70% ; 
I-2 had 40%, and R had 23% at the 
end of training. Thus, while the 
groups were consistently ordered on 
this index from most to least predict- 
able, Group I-2 again appeared to be 
disproportionately more like Group R 
than would be predicted on the basis 
of the proportion of predictable targets. 


Analytic Scores: Spatial Accuracy 


Both percentage and magnitude of 
overshoots and undershoots were ana- 
lyzed for the four selected trials. Both 
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Fic, 2. Mean duration of lags (A), leads (B), and leads-lags (C) on Trials 65, 205, and 
Retention Trial 1 for each of the five task conditions. (Curves were fitted by inspection.) 
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TABLE 2 
PERCENTAGES OF OvERSHOOTS AND UNDER- 
SHOOTS BY TASKS FOR 
TRIALS 


Reees 
UM ee 
LIII 


* Data not obtained, 


indexes were scored to the nearest mil- 
limeter on the oscillographic records 
which corresponded in range to target 
displacements on the oscilloscope. At 
28 in, viewing distance, 1 mm. is equal 
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to approximately 5 min. of visual angle, 
or 12 mm. equal a visual angle of 1*; 

Overshoots and undershoots.—Per- 
centages of overshoots and undershoots 
are presented in Table 2. Generally, 
all groups increased in percentage of 
overshoots and, except for Task R, 
decreased in undershoots throughout 
training, with a reversal of these 
trends, back to approximately Trial 65 
levels, at recall. 

The mean magnitude of overshoot 
errors tended to increase between 
Trials 15 and 65 for the “I” groups, 
then decrease for all groups by Trial 
205, as shown in Fig. 3A. “THe func- 
tion relating task predictability to over- 
shoot errors appears curvilinear with 
both Task R and Task P having lower 
values than the "I" groups. All 
groups, except Group I-5 showed some 
increase in overshoot magnitude at re- 
call. Figure 3B presents a picture 
undershoot errors very similar to that 


undershoot (B), and overshoot plus under- 
Trial 1 for five task conditions, (Curve 
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target positions in the foreward direc- 
tion. Relatively large increases in 
spatial error at the beginning of the 
sequence, especially for Task I-3, may 
indicate a spread of effect from Target 
Position 12, since no cues were pro- 
vided as to the end of the four repeti- 
tions constituting a trial. Finally, the 
large error at retention for Position 12 
of Task I-3 was due largely to one S 
who had unaccountably large errors 
at that specific target position. 

During training, Task I-2 showed 
relatively little difference in spatial 
accuracy as a function of target posi- 
tion. This was also true for the first 
retention trial, except for a large error 
at the sixth position, 

As shown in Fig. 4D, Task P had a 
relatively flat position curve at the end 
of training and showed a large increase 
in variability of error among the target 
positions at retention. 


Discussion 


Acquisition—In the previous report 
(Trumbo et al., 1965) it was noted that 
neither of the intermediately predictable 
tasks (I-1 and I-2) resulted in signif- 
icantly lower error scores than the ran- 
dom task (R). The results of the present 
study indicate that with extended practice 
the predictability inherent in the I-2 
task will probably result in superior per- 
formance. However, the findings also 
Suggest that the performance level at the 
end of training is not directly propor- 
tional to the proportion of predictable ele- 
ments in the task. Rather, the amount of 
error appears to be roughly a sigmoidal 
function of the proportion of random tar- 
gets in the sequence. 

The acquisition data also replicate 
rather closely the results of the prior 
study with respect to the performance on 
the three tasks common to both studies 
(Tasks R, I-2, and P). When differ- 
ences in trial length are adjusted, the 
performance curves for the two studies 


are essentially identical. Furthermore, it 
was not until Trial Block 15 in the pres- 
ent study that the I-2 group showed evi- 
dence of improvement over Cond. R. This 
corresponds to a point of training just 
beyond that given in the prior study. 

The analytic scores of the present study 
also provide new evidence on the relation 
of tracking performance to task predict- 
ability. First, the timing data lend supr 
port to the conclusion that performance 
on Task I-2 tends to be more like that on 
Task R than would be predicted from per- 
formance on the remaining tasks. This 
appears to be true for both proportion 
and duration of leads, lags, and combined 
lead-lag data, as well as on the beneficial 
anticipation indexes. Furthermore, both 
the direction and extent of changes at 
recall are quite similar for these two 
groups. Taking the timing data as a 
whole, it appears that as degree of pre- 
dictability increases, the relative fre- 
quency and the duration of lags decrease, 
but the relative frequency and the dura- 
tion of leads increase. Both for lead and 
for lag indexes and for the combined 
lead-lag index the relationships may not 
be linear functions of the proportion of 
predictable targets, Particularly for mean 
lead and lead-lag durations, the relation 
appears exponential, 

While there is little basis from either 
the error or the timing data for differen- 
tiating response organization of Tasks R 
and 1-2, data on spatial accuracy do indi- 
cate some differences. On mean under- 
shoots and on the combined overshoot- 
undershoot index the I-2 group is clearly 
different from the other groups through- 
out training. Furthermore, at recall the 
L-2 group showed the greatest changes 
in overshoot and undershoot errors, 1.€5 
the greatest losses in spatial accuracy. 

Except for Group P, all groups tended 
to increase mean undershoots during 
training. Overshoot frequencies 10- 
creased, and for the “I” groups mean 
magnitude of overshoots increased as well 
early in training, then decreased between 
Trials 65 and 205. It thus appears that 
during the course of training spatial 
accuracy is neglected while improvements 
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are made in timing (reduced lags and in- 
creased leads) but that later in training 
S makes some improvement in spatial 
accuracy by controlling the amplitude of 
his overshoots. 

This picture of the changes in spatial 
indexes during training and retention is 
to some extent at variance with that re- 
ported in the prior study (Trumbo et al., 
1965) wherein the percentage of over- 
shoots tended to decrease during training, 
but with relatively little change in under- 
shoots. However, when differences in 
scoring methods are considered, the re- 
sults are quite reasonable. In the prior 
study only over- and undershoots greater 
than 4 mm. were counted, whereas in the 
present study all over- and undershoots 
greater than 1 mm. were counted. Thus, 
taken together, the results indicate that 
during training fewer gross (>4 mm.) 
overshoots but more total (71 mm.) 
overshoots occurred, while at retention 
more gross overshoots but fewer total 
overshoots occurred. Comparison of 
Table 2 and Fig. 3A supports this inter- 
pretation, since while the percentage of 
overshoots decreased, the mean amplitude 
increased at retention. 

Groups P and I-2 had the lowest per- 
centages of undershoots at the end of 
training and had similar values. How- 
ever, since Groups I-2 and P are at oppo- 
site ends of the lead-lag data, it seems 
reasonable to assume that they arrive at 
similar undershoot percentages by quite 
different response stategies, ie. Ss on 
Task P reduce undershoots by improved 
accuracy of anticipatory leading re- 
sponses, while Ss on Task I-2 continued 
to lag. 

Intrasequence effects.—One would ex- 
pect that as learning progressed there 
would be a concomitant decrease in the 
relative amount of error associated with 
the fixed and systematically repeated tar- 
gets as compared with those targets 
selected anew on each repetition of the 
sequence. This result was obtained with 
respect to spatial error, but not temporal 
error, for Tasks I-3 and I-5, but not for 
Task 1-2. However, for Task I-2 Ss 
these results were not surprising since 


their integrated error scores were not 
reliably less than those of Ss on the 
random task (R). 

Inspection of Fig. 4 suggests that the 
effects of random elements in an other- 
wise fixed sequence extends to subsequent 
targets. That is, both at the end of train- 
ing and after the 3-mo. retention interval, 
there was consistently more spatial error 
for the first target positions to follow a 
random target than for the other non- 
random positions. Thus, although those 
targets which followed the random tar- 
gets were fixed, i.e., always appeared at 
the same position on the CRT, the fact 
that the direction and amplitude of the 
movements required to attain these tar- 
gets varied from repetition to repetition 
was apparently a critical factor. Even 
though S could predict the position of 
the target, he was unable to fixate, or pre- 
program, a movement pattern when re- 
sponding to the first target that followed 
a random target event. 

Retention.—The differential losses at 
retention appear to be most consistently 
related to changes in timing as reflected 
in the lead and lag data. The more pre- 
dictable tasks, which resulted in greater 
absolute and relative increases in error 
at recall, also had greater increases in 
frequency and duration of lags and 
greater decreases in frequency and dura- 
tion of leads than the remaining tasks. 
On the other hand, while the more pre- 
dictable tasks resulted in about equal 
error increase, changes in spatial accu- 
racy were not consistently related to these 
losses and, furthermore, Task I-2 which 
showed little change in error, had the 
greatest changes in spatial accuracy. This 
interpretation, viz., that retention losses 
are more consistently related to changes 
in timing accuracy, is consistent with that 
given in our earlier report (Trumbo et 
al., 1965). 

It is interesting to note in this re- 
gard that when the temporal aspect of 
the task is fixed and constant, but uncer- 
tainty is introduced into the spatial aspect 
the most predictable effects are in the 
time domain. Furthermore, the effects 
are seen in terms of variability of timing 
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as well as in mean values, as shown in 
the variance scores for Ss within groups. 

The intrasequence analyses suggest for 
Tasks I-3 and I-5 that the increase in 
integrated error at retention was pri- 
marily due to greater spatial error at the 
fixed positions immediately following the 
random positions, rather than at the ran- 
dom positions, per se. The loss after a 
3-mo. interval therefore appears to have 
resulted from partial forgetting of the 
sequence itself, rather than from other 
aspects of skill. For the fixed condition 
(Task P) the greatest error was just 
past the middle of the sequence, as would 
be expected for a serial task. However, 
the evidence for a bowed serial-position 
curve is not compelling, since error asso- 
ciated with Target Positions 2 and 12 
was about as great as that for the other 
targets near the center of the sequence. 
Failure to find a true serial-position curve 
is not surprising in view of the fact that 
the end points of the sequence are not 
cued for S, and a given target position 
may occur more than once in the se- 
quence, 

Brief note should be made of the results 
of the random transfer task presented 
immediately after the final training trial. 
While no differential transfer effects were 
obtained as a function of degree of pre- 
dictability in the training task, positive 
transfer occurred for all groups. That is, 
all groups performed on the transfer task 
at about the level of proficiency achieved 
by the Task R Ss after 205 trials of train- 
ing. Thus, those aspects of the transfer 
task where learning could occur (eg., 
timing, control-to-display ratio, etc.) 
were identical in both training and trans- 
fer tasks and resulted in no transfer loss. 

In summary, the data suggest that the 
strategies developed by Ss in Tasks I-3, 
I-5, and P differ from those developed 


in Tasks I-2 and R. In the former con- 
ditions Ss tended to initiate their re- 
sponses, even those they could not pre- 
dict, before the target changed position. 
Responses tended to be underdamped and 
to overshoot the target, although the mag- 
nitude of overshoots decreased with prac- 
tice. These tendencies were not accom- 
panied by any consistent change in force 
or rate of movement. With less redun- 
dancy (Tasks I-2 and R), Ss did not 
ordinarily initiate a response until about 
175 msec, or so—a reaction time—after 
the target changed position. These $8 
also tended to make an underdamped 
response and to overshoot more often 
than undershoot the target, but their 
greater integrated error scores resulted . 
primarily from their inability to make 
anticipatory responses. 

Finally, relearning performance 3 mo. 
after rather extensive training showed 
less rapid recovery of proficiency than - 
prior research by others would have lead . 
us to predict (Naylor & Briggs, 1961). 
Apparently losses in skill performance, 
at least on tasks such as we have used, 
are neither trivial nor transient and con- 
stitute a problem of both practical and 
theoretical significance. 
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EFFECTS OF DIMENSIONAL REDUNDANCY 
ON VISUAL DISCRIMINATION ! 


G. R. LOCKHEAD? 
Johns Hopkins University 


Absolute judgments of line lengths and line positions, under easy and 
difficult viewing conditions, were obtained when the stimulus dimen- 
sions were varied separately, together and perfectly correlated, and 


together and uncorrelated. Results showed that a redundancy gain 


is obtained—performance is better—from correlated dimensions and 


that this gain is independent of sensory limitation. 


Analyses suggest 


that redundancy gains are obtainable only when stimulus dimensions 


are integral and that dimensions may also have to be continuous. 


An 


empirical method of measuring the amount of integrality of stimulus 


dimensions is suggested. 


If Ss are asked to make absolute 
judgments on several stimuli which 
vary along only one dimension, it is 
reasonably certain that they will 
make errors. This paper is concerned 
with the effectiveness of additional, 
but redundant, information in re- 
ducing these errors. 

Discrimination performance is af- 
fected by the duration, contrast, or 
discriminability of the visual stimuli 
(Garner & Creelman, 1964), or by 
repetition of the stimuli (Swets, 
Shipley, McKey, & Green, 1959). 
Any or all of these stimulus manipula- 
tions may be used to reduce .S's errors 
in an absolute-judgment situation— 
up toa point, Beyond that point the 
channel-limit model (Miller, 1956) 
still holds and performance does not 
improve no matter how much you 
increase the brightness, energy, dura- 
tion, or number of repetitions of the 
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stimuli. It is convenient to think of 
these types of stimulus improvements 
as aiding discrimination performance 
when there are errors due to diffi- 
culties in getting the stimulus in- 
formation into the organism, i.e., 
when there are receptor limitations 
(Garner, in press). 

Thus, once the stimuli have been 
made as available as possible, Ss still 
make errors. If you have 20 black 
lines which vary in length by 3j-in. 
steps, and if you give San opportunity 
to tell you which line has been ran- 
domly chosen for him, he will make 
some errors in identifying the par- 
ticular line you have chosen; even 
though he has had considerable 
practice, the lights are bright, and he 
is motivated. The question asked is 
how we might aid S in his discrimina- 
tion, short of telling him the length or 
giving him a ruler. 

One possible technique, without 
changing the number of stimuli S is 
to discriminate, is to introduce re- 
dundancy. That is, increase the 
number of dimensions presented to S, 
but in a perfectly correlated fashion, 
so that the number of elements to 
be discriminated remains unchanged. 
Were we to do this to the lines, we 
could—as an example—introduce hue. 
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The smallest line could be red and, 
going around the color circle, the 
largest could be violet. If we have 
first made certain that Ss cannot 
judge these hues more accurately than 
they can judge the original 20 lines, 
how should the introduction of this 
redundant dimension affect Ss’ per- 
formance in discriminating the 20 
stimuli? 

Two experiments which attempted 
to answer this question of the effect 
of redundancy on absolute judgments, 
and which logically required the same 
task of Ss, produced strikingly differ- 
ent results. Eriksen and Hake (1955) 
presented Ss with patches made of 20 
different brightnesses, sizes, and hues 
—at first separately and then in a 
perfectly correlated manner—in a 
discrimination experiment. The re- 
sults are clear. Adding dimensional 
cues in a correlated manner, in an 
absolute-judgment situation, leads to 
increased total information trans- 
mission (fewer errors) over that 
obtained from the presentation of 
only one of the dimensions. 

Garner and Lee (1962) provided a 
series of alternative models to predict 
the effect of using redundant stimuli 
under different situations. They pro- 
vide the two observations that (a) the 
maximum improvement possible from 
the correlated situations, over the 
more discriminable single dimension, 
is .5 bit, allowing some reasonable 
assumptions, and (b) that the dimen- 
sions must not interact psychologi- 
cally in order for an improvement 
(redundancy gain) to be obtained. 
In an experimental test of their 
models, Garner and Lee presented Ss 
with patterns of Xs and Os in different 
positions with redundancy introduced 
in terms of additional X-O positions. 
Although this situation is formally 
equivalent to that of Eriksen and 
Hake, the results showed no gain in 


discrimination performance due to the 
correlated dimensions (X-O positions) 
and there was even evidence of a 
transmission loss. These two studies 
have been replicated in different 
laboratories. 

Energic limitations.—One difference 
between the two studies cited was 
that the stimuli used by Garner and 
Lee were of very brief duration while 
those of Eriksen and Hake were not. 
In his analysis of the problem, Garner 
concluded that “information trans- 
mission increases with increased di- 
mensionality only when perceptual 
discrimination is not time limited 
[1962, p. 2091." It was concluded 
that performance in a discrimination 
task can be limited by the dimen- 
sionality of the stimuli and/or by the 
duration of the stimuli, and that if 
performance is restricted by the dura- 
tion of the stimulus presentation, then 
increasing the dimensionality of the 
stimuli will not improve performance. 
Thus, redundancy is assumed to be 
effective in improving discrimination 
performance only where the duration 
or other energic attributes of the 
stimuli are not limited. 

Besides duration of presentation, 
there are at least two ways in which 
the patches used by Eriksen and Hake 
and the X-O patterns of Garner and 
Lee differ; these are the continuity 
and the integrality of the dimensions. 

Dimensional continuity.—The stim- 
uli used by Garner and Lee had either 
an X or an O in each matrix position 
used. Each such position constituted 
a single dimension, but the nature of 
such a dimension is that there is no 
reasonably perceptual continuum be- 
tween an X and an O. Dimensions 
such as brightness, size, and color, 
however, do have the property of 
permitting an intermediate percep- 
tion. Thus there can be a compromise 
perception, and a compromise re- 
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sponse, but with discrete stimulus 
dimensions (Xs and Os) such com- 
promise responses cannot be made. 

Integrality.—A phenomenological 
way in which the stimuli of Eriksen 
and Hake and those of Garner and 
Lee differ is in the integrality of the 
dimensions. Patches of different hues 
and different brightnesses are integral 
stimuli—dimensions join one another 
such that it is extremely difficult for .S 
to take note of one aspect of a 
stimulus without at the same time 
taking note of the other aspects. Per- 
haps they really become new stimuli. 
The X-O patterns are not integral; 
Ss not only had no difficulty ignoring 
some aspects of the stimuli with 
correlated dimensions (some of the 
positions), they reported actively at- 
tempting to ignore them. 

There is no mathematical or logical 
reason that dimensions must be 
integral to provide a redundancy gain 
but there may be a psychological 
reason. The Ss may, when possible, 
choose to ignore some dimensions or 
aspects when stimuli are not integral. 

Purpose of experiment.—This ex- 
periment was designed primarily to 
determine the role of energy limita- 
tions in stimuli on the effectiveness of 
redundancy in aiding discrimination. 
In addition, it was designed to provide 
some clarification of the roles of 
dimensional continuity and integrality 
in determining the effectiveness of 
redundancy. 


METHOD 


Stimuli which could be varied independ- 
ently in two dimensions were presented 
tachistoscopically in an absolute-judgment 
experiment. The stimuli were horizontal 
black lines .03 in. wide and centered on an 
8 X 8in. viewing field. The two dimensions 
were length, with a range of 2.0-2.9 cm. in 
10 steps of 0.1 cm., and vertical positions with 
10 0.1-cm. steps achieved by raising and 
lowering the 10-sided form on which the lines 
were mounted. The viewing conditions were: 


1, Easy: the stimulus duration was 200 
msec. and the bluish-white stimulus and 
adapting fields were easily seen. 

2. Difficult: the stimulus duration was 8 
msec. and the overall luminance was reduced, 
from the easy viewing conditions, by means 
of a 1.0 neutral density filter. The fields were 
quite dim and Ss sometimes reported not 
seeing the stimulus. The Ss partially dark 
adapted by sitting in a dim red-lighted room 
for 10 min. before participating in this 
condition. 

The stimulus conditions were: 


1. Single dimensions: (A) the line lengths 
were presented randomly with position held 
constant in the center of the stimulus field, 
and (B) the positions were presented ran- 
domly with line length held constant at 
2.5 cm. 

2. Correlated dimensions: the 10 ran- 
domly presented stimuli were composed of 
perfectly correlated lines and positions such 
that Position No. 1 and Line Length No. 1, 
Position No. 2 and Line Length No. 2, etc., 
always appeared together. 

3. Orthogonal dimensions: the 10 lengths 
and 10 positions were presented randomly 
and simultaneously such that there were 100 
stimuli, any line with any position. 


Each of these stimulus conditions was used 
with both viewing conditions to provide a 
total of eight experimental conditions. 

Subjects.—The Ss were six paid psychology 
graduate students, all male except GS. Three 
Ss were highly trained (GL, GS, and SI) in 
earlier work while none of the others (MJ, 
BB, and DG) had ever participated in such 
an experiment, 

Procedure, —The stimuli were presented in a 
modified Dodge-type tachistoscope (Merry- 
man & Allen, 1953) with a preadapting field 
of slightly less luminance than the stimulus 
field and with both fields 34 in. from S's eyes. 
The S’s responses represented a number from 
1 to 10 for the shortest to the longest line, 
for the topmost to the bottom position, and 
for the top-short to the bottom-long stimulus. 
In the orthogonal case, Ss gave two responses 
—line and position in that order. The E 
told S the correct response immediately. The 
S indicated his responses, which had been 
provided by E, on a keyboard which had 10 
keys laid out so that 1 key fit comfortably 
under each finger of S's relaxed hands. The 
keyboard could be used by touch with the 
leftmost to rightmost keys representing the 
numbers 1 through 10. The keyboard was 
used for recording purposes and to eliminate 
S's head movements from the chin and fore- 
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head rests were he to give a verbal response. 
The keys pressed were recorded on a remote 
electric receiving typewriter which was 
monitored by E. 

An experimental session lasted from 35 to 
55 min. depending on S and the condition. 
No S was allowed to serve more than once 
a day without at least a 3-hr. interval between 
sessions. There was a brief warm-up period 
before each condition. After a break (every 
one third session) Ss were shown the extremes 
of the current condition to maintain percep- 
tual anchors, All conditions were self-paced 
and Ss worked at the same rate, about 5 sec. 
per stimulus, with the single and correlated 
dimension conditions and at about 8 sec. per 
stimulus with the orthogonal conditions. 
Self-pacing was used because, according to 
pilot-study evidence, Ss very much prefer it. 
The method resulted in different intertrial 
intervals between the single response condi- 
tions and the two-response orthogonal condi- 
tions, but not between the single-dimension 
and correlated-dimensions conditions. The 
total time required to run each of these last 
two conditions was very much the same. 

The Ss gave 250 responses in an experi- 
mental session to the orthogonal conditions 
and, on alternate days, gave 150 responses per 
session to the lengths, to the positions, and to 
the correlated stimuli. The order of pres- 
entation of the single- and correlated-dimen- 
sions conditions was partially counterbalanced 
over Ss and sessions; according to an earlier 
pilot study such counterbalancing is not 
important. There were three training ses- 
sions before the experimental sessions for the 
easy conditions and there was one training 
session before the difficult conditions. For 
all Ss, the easy viewing conditions were run 
first since it was important only to show that 
performance was poorer on the difficult con- 
dition than on the easy condition and thus 
that energic limitations were operating. 

A total of 2,000 responses to the orthogonal 
conditions, and 1,200 responses to each of the 
other conditions, was collected from each S 
at each viewing condition. An exception was 
the naive Ss from whom half this amount of 
data was collected, on the easy conditions 
only, since earlier results indicated that the 
data which were being obtained were consis- 
tent with those of the experienced Ss. One 
S (SI) was not used in the difficult condition 
since he had poor acuity and could not see the 
stimuli. Thus only five Ss were used on the 
difficult conditions and six Ss were used in 
the easy conditions. In any comparisons 
between these conditions, SI's data were 
ignored. 
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RESULTS 


Single and correlated dimensions.— — 
The data for each S were pooled into - 
10 X 10 stimulus-response confusion 
matrices to provide one total matrix: 
for each S, for both the easy and 
difficult viewing conditions, on each 
of the three conditions requiring one - 
response per stimulus. Information” 
transmissions (IT) were determined - 
for each of these matrices and the 
results are shown in Table 1. Analysis 
of variance showed that viewing con- — 
ditions (easy vs. difficult), stimulus. 
conditions (length, position, or co 
related), and the interaction of stim= 
ulus and viewing conditions, each 
contribute a significant source 
variance (p < .01). Each S's da 
showed the same pattern. 
higher ITs in the correlated conditio! 
than in either of the single-dimension ” 
conditions. The effect of the order im” 
which treatments were taken on any 
one day was unimportant as det 
mined from analyses of the dai 
matrices. Learning effects were small, 


TABLE 1 


Torat ITs ror EACH S IN THE EASY AND 
DIFFICULT VIEWING CONDITIONS FOR | 
THE SINGLE- AND CORRELATED- 
DIMENSIONS CONDITIONS 


S Length Position 
Easy 
GS 1.17 1.12 
GL 1.20 1.41 
MJ 1.03 0.60. 
BB 0.86 0.71 
RG 1.07 1.10 
SI* 1.29 1.20 
Means 1.07 0.99 
Difficult 
GS 0.35 0.63 
GL 0.27 0.69 
MJ 0.06 0.06 
BB 0.14 0.50 
RG 0.12 0.47 
SI — -— 
Means 0.19 0.47 


...* Data for SI are not considered in any compari 
since he was not run in the difficult conditions. 
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primarily because data from the train- 
ing sessions are not reported. 

The decrement in performance on 
all conditions in the difficult, as com- 
pared to the easy viewing conditions, 
was quite large. However, the effect 
of the change in viewing conditions 
was not uniform across dimensions; 
the lengths were relatively more 
difficult to discriminate than were the 
positions when the viewing conditions 
were made difficult. It has been re- 
ported that accuracy of reporting 
position is not at all affected by the 
duration or brightness of the stimuli 
(Leibowitz, Myers, & Grant, 1955). 
Those results were for data collected 
only when S was willing to report 
seeing the stimulus. In this experi- 
ment, when Ss are required to respond 
on all trials, impoverished viewing 
conditions lead to a decrement in such 
discrimination performance, although 
not to the same extent as the de- 
crement in performance on judging 
line lengths. 

Orthogonal dimensions—The ITs 
calculated for the 100 X 100 stimulus- 
response matrices, and those calcu- 
lated for the 10 X 100 matrices ob- 
tained when one looks only at those 
correlated stimuli imbedded in the 
orthogonal matrix, are not interpret- 


E able because of the sampling bias 


involved (Miller, 1955). The matrix 
from the orthogonal condition con- 
tains 10,000 cells with, given random 
responses, an expectation of only 0.2 
entries per cell for all of the data for 
any one S on either the easy or difficult 
viewing conditions. The resulting 
sampling bias estimate is larger than 
the information transmitted giving a 
negative estimated 1T—an obviously 
incorrect conclusion. Of several dif- 
ferent possible methods of estimating 
the bias, none is satisfactory to 
give precise transmission measures. 
Therefore, ITs computed from the 


complete 100 X 100 and the 10 X 100 
matrices are not used. The complete 
100 X 100 bivariate matrices were 
collapsed in all six ways possible to 
provide the following 10 X 10 ma- 
trices, the ITs from which were used. 

1. R;Sri—the responses to lengths 
for variations of the stimuli in length, 
This is the sameas the 10 X 10 matrix 
for the length dimension reported in 
Table 1, except that this is for the 
responses for lengths when both di- 
mensions were varied. 

2. RpSp—the responses to positions 
for variations of thestimuli in positions. 
The same comments as above hold. 

3. RiSp—the responses to lengths 
for variations of the stimuli in posi- 
tion. This term is a cross contingent 
uncertainty term and, from S's point 
of view, a measure of the interaction 
between stimulus dimensions. 

4. RySy—the responses to positions 
for variations of the stimuli in length. 
The comments above hold here also. 

5. Ri R»—the responses to lengths 
when there are variations in the re- 
sponses to positions. This term gives 
an indication of response interference 
or interaction and of response bias. 

6. SyP»—the relation between vari- 
ations in lengths as stimuli and vari- 
ations in positions as stimuli. Since 
this relation was built in as random 
(the stimuli were presented ortho- 
gonally) the ITs from these matrices 
serve as a base-line indicator of 
performance. 

ITs were calculated for each of 
these six matrices for each S under 
both viewing conditions and the 
results are shown in Table 2, 

Table 2 has the fortunate property 
that the effects of the conditions 
considered are apparent from the 
average transmission scores. Taking 
about .04 bit, the transmissions from 
the stimulus-stimulus matrices, to 
represent chance transmission, it is 
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TABLE 2 
IT ror EACH S IN THE Easy AND DIFFICULT VIEWING CONDITIONS FOR THE SIX 


PossrBLE COMBINED MATRICES FROM ORTHOGONAL CONDITIONS 


100 
S RıSı RrSp 

Eas 
GS 1.00 0.81 0. 
GL 1.01 1.19 0. 
MJ* 0.78 0.50 0. 
BB» 0.83 0.70 0. 
RG* 1.00 0.74 0. 
SI» 0.98 0.88 0. 
Means 0.92 0.79 0. 

Difficult 
GS 0.23 0.66 0. 
GL 0.26 0.69 0. 
MJ 0.03 0.03 0. 
BB 0.13 0.54 0. 
RG 0.09 0.44 0. 
Means 0.15 0.47 0. 


SRRSSER 


meme ROe 
PNAKRO 


P RiS» RrSL SiS 
0.03 0.03 0.03 
0.02 0.03 0.03 
0.04 0.08 0.05 
0.05 0.09 0.05 
0.05 0.05 0.06 
0.03 0.05 0.04 
0.04 0.06 0.04 
0.03 0.04 0.03 
0.04 0.05 0.04 
0.03 0.02 0.05 
0.05 0.06 0.05 
0.03 0.04 0.04 
0.04 0.04 0.04 


^ These data are for 1,000 observations, all other data are for 2,000 observations. ^. 
» Data for SI are not considered in any comparisons since he was not run on the difficult conditions. 


obvious that the associations between 
stimuli and responses in the matrices 
RzSr and RpSp are considerably above 
chance, The relations between these 
two conditions, and with the easy and 
difficult viewing conditions, are con- 
sistent with those reported in Table 1. 

The ITs from the cross contin- 
gencies RiSp and RpS; are not 
demonstrably different from the ITs 
from the S;Sp matrices, indicating no 
association across dimensions, This 
fact means that the value of the 
stimulus in one dimension does not 
differentially affect the value of the 
response to the other dimension and is 
an indication of the independence of 
the two dimensions, 

The transmission from the RyRp 
matrices in the easy viewing condi- 
tions are trivially larger than those in 
the random matrices. The ITs from 
these response-response matrices in 
the difficult viewing conditions, how- 
ever, are considerably larger than 
those in the S;Sp matrices. It ap- 
pears that, when Ss have difficulty 
getting information from the stimulus, 
Ss generate their own information by 
making their responses contingent on 


one another, This result is consonant 
with one reported by McGill (1957) 
whose Ss gave sequentially predictable 
responses to random tones of very 
low intensity. 


Discussion 


Energic limitations and redundancy.— 
The results of this experiment demon- 
strate that Ss can discriminate redundant 
stimuli better than they can discriminate 
nonredundant stimuli, and that this re- 
dundancy gain is obtainable whether or 
not discrimination is being affected by 
energic limitations. All Ss performed 
better in the correlated-dimensions con- 
dition than in either of the single-dimen- 
sion conditions, under both viewing 
conditions. This result means that the 
failure to get improvement in discrimi- 
nation performance with redundant stim- 
uli, in the study conducted by Garner 
and Lee, is not attributable to the fact 
that the receptor system was energy 
limited. The effectiveness of dimensional 
redundancy does not depend on receptor 
or energy limitations. : 

The redundancy gains obtained in this 
experiment are not as large as those gains 
obtained by Eriksen and Hake. The 
point of these results is not that e 
constant amount of redundancy gain !9 


— 
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obtained independent of energic limited 
situations. Indeed, performance on cor- 
related dimensions cannot be expected 
to improve by .5 bit, over performance 
on single dimensions, if the added dimen- 
sion provides only .2 bit by itself. The 
point is that all of the redundancy gain 
possible may be achieved, but not much 
is available mathematically, when the 
stimuli are extremely difficult to see or 
to discriminate. 

Theoretical and obtained redundancy 
gains.—It is useful to consider just what 
redundancy gains may be expected 
theoretically from the data of this ex- 
periment. The theoretical expectations 
which are given here are based on two 
assumptions: (a) the distribution of 
responses to each stimulus is normal, 
and (b) Ss cannot reliably select, from 
multidimensional stimuli, that dimension 
which was perceived most accurately on 
each presentation, and respond to it alone. 

To determine the theoretical redun- 
dancy gains possible, ITs were trans- 
formed into equivalent univariate vari- 
ances of the stimuli, V(S), according to 
the following rationale. The total vari- 
ance of the actual response values can be 
divided into the variance of the means of 
the response values plus the error variance: 


Sè = Si? +e? [1a] 


SÈ = Su + er. [1b] 


Garner and McGill (1956) have shown 
that, in general, estimated uncertainty : 


est IT = 3 logs (S1?/e:*) [2] 
and, substituting Equation 1a in Equa- 
tion 2, 

est IT = 4 logs (CSa? + e:7J/e:*). [3] 
In the unit normalized system: 

est IT = 4 loge (CSa? + 1]/1) 
4 logs (EV(S) + 17/1}. [4] 
The numerator in Equation 4 is the 
variance of the means plus the error 
variance and the denominator is the 
error variance again. This notion of 
dividing the variance into its components 
with unit error variance, to estimate 
information transmission, was introduced 
by Garner (1960). 


and 


Il 


To determine the effect of combining 
dimensions in a perfectly correlated and 
linear fashion, the univariate variances 
can be combined according to conven- 
tional statistical techniques (McNemar, 
1960, p. 137): 


Sı42/ (e? + e?) 
> { (SP+ Se? + 271251S2)/(e:?-+e2") } 
[5a] 
={ (Sn H Sik 2S Se? 
+ (e1e:)/ (e? Fe?)). [5b] 


In the unit normalized system where 
ei =e = 1, taking }logs of the argu- 
ment in Equation 5b to convert to 
est IT according to Equation 4, and 
replacing S with V(S) to remind us that 
these are estimated variances, the esti- 
mated IT for the correlated dimensions, 
estimated from ITs computed separately 
for the single dimension conditions, is 
shown to be 


est IToorr=4 logs ([EV(51)4- V(8:) 
+2VV (6). V(5)+27/2} [6] 


which is the general formula for esti- 
mating information transmission from 
two correlated dimensions. The same 
rationale as provided here allows ex- 
tending the equation to more than two 
dimensions. i 

If we assume that V(S$)) = V(S:) 
= V(S), then Equation 6 becomes 


est ITeorr = 3logs ([2V(S) + 11/1]. [7] 


which is the formula for combining two 
sets of univariate linearly correlated 
stimuli in the Garner and Lee (1962) 
paper. The effect of assuming that 
V(Si) = V(S2, if in fact the separate 
variances widely differ, provides an 
inflated estimate of redundancy gains. 
Therefore, Equation 6, rather than 7, 
was employed in all estimations given 
in this paper. 

Two sets of data were used to estimate 
the expected ITs in the correlated 
dimensions conditions: 


Seti. Applying Equation 6 to predict 
performance on the correlated conditions 
from the single-dimension conditions of 
length and position as shown in Table 1, 
one would expect a redundancy gain of 


102 G. R. LOCKHEAD 


TABLE 3 


REPRESENTATIVE OBTAINED OR ExPECTED ITs FROM THE SINGLE-DIMENSION, CORRELATED- 


DIMENSIONS, AND ORTHOGONAL-DIMENSIONS CONDITIONS IN THE 


THREE EXPERIMENTS CONSIDERED 


Garner and Lee 
Dimension 
Presentation | Short Duration | Long Duration 
(.02 sec.) (.50 sec.) 
Single 0.20 1.50 
Correlated 0.20 1.50 
Orthogonal (0.11) (1.08) 


Lockhead 
Eriksen and Hake 
Difficult Easy 
0.47 1.07 2.75 
0.58 1.22 3.43 
0.47 0.92 (2.75) 


Note.—Values in parentheses are estimated, see text for procedure. 


about .36 bit in the easy viewing condi- 
tion and about .07 bit in the difficult 
viewing condition. The gains actually 
obtained were .15 bit and .11 bit, 
respectively. The discrepancy for the 
easy viewing conditions is quite large. 

Set2. Applying Equation 6 to predict 
performance on the correlated conditions 
from the data for lengths and positions 
from the two response orthogonal condi- 
tions given in Table 2, one would 
expect a redundancy gain of .32 bit in 
the easy viewing condition and .04 bit 
in the difficult viewing condition. The 
gains obtained were .30 bit and .11 bit, 
respectively. These gains are very close 
to those expected theoretically, given the 
assumptions stated earlier. 


The theoretical estimates of redun- 
dancy gains given above apply to 
different questions. The solutions in 
Set 1 estimate the effect of adding varia- 
tion on a new dimension while the 
solutions of Set 2 estimate the effect 
of adding redundancy when two dimen- 
sions already vary. In the single-dimen- 
sional conditions, the Set 1 comparison 
above, S was always aware of the value 
of the dimension not being reported. 
When responding to lengths S knew that 
the position would be No. 6, when re- 
sponding to positions he knew that the 
length would be No. 6. When responding 
to the correlated stimuli S was always 
uncertain of the value of both dimen- 
sions, not just one. 

When responding to lengths and posi- 
tions in the orthogonal conditions, the 
Set 2 comparison above, S was always 
uncertain as to the value of both dimen- 


sions and had the added task load of 
giving two responses, rather than one, 
to each stimulus presentation. The Set 
1 comparison is analogous to the Eriksen 
and Hake (1955) study. The Set 2 
comparison is not available from their 
study since they did not collect data on 
the orthogonal dimensions conditions. 
Obtained and predicted redundancy 
gains in different experiments.—Repre- 
sentative results of the three relevant 
experiments, those of Garner and Lee 
(1962), Eriksen and Hake (1955), and 
the present study, are summarized in 
Table 3. Garner and Lee used five 
different durations of stimulus exposure 
in their study, of which the extremes are 
selected as roughly equivalent to the 
easy and difficult conditions in this study. 
The only experiment which provides 
data on the orthogonal conditions is the 
present one, and the numbers in paren- 
theses in Table 3 are predictions of those 
numbers which would be obtained if the 
orthogonal conditions were run in those 
studies, provided Ss paid equal attention 
to both dimensions. These predictions 
are obtained by putting IT, into 
Equation 7 and calculating V(3) and, in 
turn, determining est IT by Equation 4- 
In the Eriksen and Hake study, which 
was conducted in the same manner as 
the present single-dimensions conditions, 
the difference in performance between 
the single-dimension conditions and the 
correlated-dimensions conditions is al- 
ready as large as is theoretically possible. 
Taking nonnormal distributions into 
account, the obtained .68-bit gain is 
approximately what is predicted from 
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Ss' performance on the single-dimension 
conditions. Since the maximum theo- 
retically possible gain was obtained in the 
correlated conditions, performance on 
the single and orthogonal dimensions 
should be the same in the Eriksen and 
Hake situation, given the assumptions of 
themodel. Itispredicted thisresult would 
be obtained if the data were collected. 

. In the Garner and Lee results, there 
was no difference between performance 
on the single dimensions and the cor- 
related dimensions. Were Ss required 
to attend equally to both dimensions, 
in an orthogonal situation, the mathe- 
matical prediction is that the average 
performance on the two single dimensions 
in the orthogonal situation would be de- 
pressed to that shown in parentheses in 
Table3. But Ss' subjective reports in the 
Garner and Lee study suggest that they 
would not pay equal attention to all 
dimensions. Also, since the predictions 
are on the basis of combining variances, 
and since there is no guarantee that such 
an operation is possible with discrete 
stimuli, the measures predicted may not 
be those actually obtained. Because of 
these conditions, no prediction is given 
for the empirically obtained results from 
this condition, if it were run. 

Integral stimuli: A definition.—1f stim- 
ulus dimensions are completely inte- 
grated, S does not have to select portions 
of the stimulus presentation and he will 
transmit the maximum gain possible 
from the single response conditions. It 
is reasonable to suppose that a naive S 
can look at a position in the present 
experiment without, at the same time, 
being able to accurately report the 
length of the line. However, it is not 
reasonable to suppose that S can look 
at a colored patch without observing all 
of its aspects, Phenomenologically, it is 
difficult for a normal person to look at 
a lighted and colored incandescent bulb 
without being aware—at one time— of 
its hue, brightness, size, and form. 

Eriksen and Hake's Ss provided maxi- 
mum redundancy gain from the single- 
response conditions. This means that 
they did not select only part of the 
stimulus information presented and that 
the stimulus dimensions were integral. 
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Garner and Lee's Ss provided no re- 
dundancy gain from the single response 
conditions. This suggests that Ss se- 
lected some dimension(s) and ignored 
the others as best they could and that the 
stimulus dimensions were not integral. 

The Ss in the present study provided 
some redundancy gain from the single- 
response conditions. This means that 
they partially selected from the stimuli 
and that there was some integrality of 
the dimensions. 

Taken together, these observations 
suggest that the difference between per- 
formance on the single-dimension and 
the correlated-dimension conditions can 
be interpreted as a measure of the 
amount of integrality of the stimuli. If 
this measure is the maximum possible, 
as it is when the stimuli are composed of 
hues, brightnesses, and sizes, then the 
stimuli are integral. If the measure is 
zero, then the stimuli are discrete or 
nonintegral. 

Shepard (1964) used dimensions simi- 
lar to those of the present study when 
he determined similarities between stim- 
uli composed of the two dimensions, size 
of circle and inclination of radius. His 
Ss did not combine dimensions into an 
integral stimulus and Shepard concluded 
that Ss shift their attention more to one 
dimension or the other. To support his 
assumption that Ss attend to one 
dimension and then the other in his 
study, and to explain why they attend 
to the combined dimensions in the 
Eriksen and Hake study and in a 
classification study by Shepard and 
Chang (1963), Shepard makes the dis- 
tinction between analyzable and unitary 
stimuli. His arguments lead him to 
conclude that some stimuli are analyz- 
able and as such are psychologically 
more complex than are unitary stimuli 
since "they require more elaborate 
characterizations and they lead to more 
variable responses in learning and judg- 
ment tasks [1964, p. 81]." 

The problem is not that some stimuli 
are analyzable while others are not. 
The development of the Munsell system, 
for example, has shown that Ss can 
analyze colors by hue and brightness 
(Newhall, Nickerson, & Judd, 1943); 
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and such dimensions provide a redun- 
dancy gain. Almost certainly Ss can 
analyze any set of multidimensional 
stimuli. But to know that a complex 
stimulus is analyzable is not to know 
that it will provide a redundancy gain 
over a component of it. Indeed, Garner 
and Lee's stimuli and Shepard's stimuli 
are both sets of stimuli which do not 
provide a redundancy gain and are 
analyzable, while the stimuli of the 
present experiment do provide a re- 
dundancy gain and are also analyzable. 
What is important is that, besides being 
analyzable, the stimulus dimensions must 
be integral to provide a redundancy gain. 

What makes stimuli integral remains 
an enigma, but the weight of evidence 
thus far suggests that all of those dimen- 
sions which can be presented simultane- 
ously and in the same place, while 
maintaining psychological independence, 
will form an integral stimulus. 

Continuity of dimensions.—No data 
exist which show a redundancy gain for 
discrete elements, presuming hue can be 
considered to have a psychological con- 
tinuum, but whether or not stimulus 
dimensions must be continuous to pro- 
vide a gain is still not known, The 
dimensions of the present study are at 
least as continuous as those of Eriksen 
and Hake—but they provide a lower 
redundancy gain. Phenomenologically, 
hues, brightnesses, and sizes are more 
integral than are lengths and positions 
and perhaps integrality explains all of 
the gain due to redundancy. But this 
does not mean that integrality is the only 
requirement for a gain. Dimensional 
continuity may be a requirement for 
integrality. “If a redundancy gain is 
achieved by combining dimensions as the 
models of this paper imply, and if the 
arguments about dimensional continuity 
in the introduction are Correct, con- 
tinuing the conjecture would imply that 
continuity is a requirement for a re- 
dundancy gain. But it also seems that 
if the stimuli are not integral, whether 
they are continuous or not, there can be 
no redundancy gain. 


G. R. LOCKHEAD 


REFERENCES 


Eriksen, C. W., & Hake, H. W. Multi- 
dimensional stimulus differences and accu- 
racy of discrimination. J. exp. Psychol, 
1955, 50, 153-160. s 

Garner, W. R. Rating scales, discrimina- 
bility, and information transmission. Psy- 
chol. Rev., 1960, 67, 343-352. 

Garner, W. R. Uncertainty and structure 
as psychological concepts. New York: 
Wiley, 1962. . 

GanNER, W. R. Data presentation. In F. A. 
Geldard (Ed.), Communication processes. 
New York: Pergamon Press, in press. 

Garner, W. R., & CREELMAN, C. D. Effect 
of redundancy and duration on absolute 


judgments of visual stimuli. J. exp. 
Psychol., 1964, 67, 168-172. k 
Garner, W. R, & Lee, W. Analysis of 


redundancy in perceptual discrimination. 
Percept. mot. Skills, 1962, 15, 367-388. 

Garner, W. R, & McGit, W. J. _ The 
relation between information and variance 
analyses. Psychometrika, 1956, 21, 219-227. 

Lersowrtz, H. W., Myers, N. A., & GRANT, 
D. A. Radial localization of a single stimulus 
as a function of luminance and duration of 
exposure. J. Opt. Soc. Amer., 1955,45, 76-78. 

McGitt, W. J. Serial effects in auditory 
threshold judgments. J. exp. Psychol. 
1957, 53, 297-303. 

McNemar, Q. Psychological statistics. New 
York: Wiley, 1960. ; 
Merrymay, J. G., & ALLEN, H. E. An im- 
proved electronic tachistoscope. Amer. J. 

Psychol., 1953, 66, 110-114. 

Miter, G, A. Note on the bias of informa- 
tion estimates. In H. Quastler (Ed.), 
Information theory in psychology. Illinois: 
Free Press, 1955. Pp. 95-100. 

Mitter, G. A. The magical number seven 
plus or minus two: Some limits on our 
capacity for processing information. Psy- 
chol. Rev., 1956, 65, 81-97. 

NrwnaLL, S. M., Nickerson, D., & JUDD, 
D. B. Final report of the OSA subcom- 
mittee on the spacing of the Munsell colors. 
J. Opt. Soc. Amer., 1943, 33, 385-418. — 

Suerarp, R. N. Attention and the metric 
structure of the stimulus space. J. math. 
Psychol., 1964, 1, 54-87. f 

Sueparp, R. N., & Cuane, J. J. Stimulus 
generalization in the learning of classifica- 
tions. J. exp. Psychol., 1963, 65, 94-102. 

Swerts, J., Surprey, E., McKey, M. 
GREEN, D. Multiple observations of 
signals in noise. J. Acoust, Soc. AMêer-, 
1959, 31, 514-518. 


(Received April 9, 1965) 


Journal of Experimental Psychology 
1966, Vol. 72, No. 1, 105-108 


DEVELOPMENTAL STUDY OF COLOR-WORD 
INTERFERENCE* 


PETER H. SCHILLER 
Massachusetts Institute of Technology 


The interference observed in color naming when words and colors ap- 
pear in incongruent combinations was studied in children of different 


, ages. 


The results show that the extent of interference is minimal in 


1st graders, becomes maximal in the 2nd and 3rd grades, and then de- 
clines gradually. It was also found that the speed of reading words 
designating colors is consistently faster after the 2nd grade than the 


naming of color strips. 
questions. 


This study is concerned with the in- 
terference which comes about when 
color words are printed in conflicting 
colors (e.g., the word "red" in the color 
"green," etc.) and the task is to name 
the colors and to ignore the words. 
This task, which was first used by 
Jaensch (1929), is a difficult one for 
adults. Interference can be observed 
by noting the increased time needed to 
complete such a task in contrast to the 
naming of color strips, the errors com- 
mitted, and also by noting various signs 
of physical strain such as embarrassed 
laughter and body movement. 

In an attempt to define some of the 
sources of this interference Klein 
(1964) varied the relationship between 
colors and text. Each word was 
printed either in red, green, blue, or 
yellow. However, different sets of 
words were used; nonsense syllables, 
rare English words, common words, 
common words implicating colors in 
their meaning, color-names related to 
the colors actually employed, and, fi- 
nally, the color-names red, green, blue, 
and yellow, but presented in incon- 
gruous combinations (standard set). 
Klein found that color-naming time in- 
creased from set to set in the order 

1This investigation was supported by a 
National Institutes of Health Predoctoral 
Fellowship FPD-11, 589, 


These findings raise a number of theoretical 


given; this interference was found to 
be minimal with nonsense syllables and 
maximal with the standard set. 

One way to gain further understand- 
ing of the source of interference is to 
investigate it developmentally. Thus, 
by studying the performance of chil- 
dren of various ages on tasks similar 
to those employed by Klein, it should 
be possible to show how interference 
on the color-word interference task 
emerges and how it generalizes to 
words closely and distantly related to 
colors. 


METHOD 


Subjects—The 240 Ss in this experiment 
were obtained during the fall term from 
Grades 1, 2, 3, 5, 8 in a public school, and 
a college freshman class. Each grade was 
divided into four groups of 10 Ss. All Ss 
had normal color vision as assessed by iso- 
chromatic plates. 

Procedure.—The Ss were tested individ- 
ually in one experimental session. They 
were first given an 83 X 11 in. white card on 
which were printed 40 color words in black. 
The four color words, each appearing 10 
times in random order, were red, blue, green, 
and yellow. The words were in large pica 
lower-case type with one space between letters 
and six spaces between words; this made the 
reading task possible for the youngest Ss. 
The reading of this card was followed by a 
card containing 40 color strips of the four 
colors. The length and width of these color 
strips corresponded to that of the color 
words, The Ss were asked to name the 
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eo word reading 


—+ color noming 


mecn reading time in seconds 


college 
GRADES 
Fic. l. Mean word-reading time on black 
color-word card and color-naming time on 
color-strip card. 


colors. On the basis of their speed of per- 
formance on this card, Ss in each grade were 
placed into four matched groups. Each of 
these groups received one of four different 
test cards; for each of these the task was to 
name the colors, On these cards words were 
printed in the colors red, blue, green, and 
yellow as follows: 

Card A: The words RED, BLUE, GREEN, and 
YELLOW were printed in incongruent com- 
binations (ie. the word red was printed in 
green color, etc.). 

Card B: The words TAN, PURPLE, GRAY, 
and BLACK. 

Card C: The words LEMON, FIRE, SKY, and 
GRASS in incongruent color combinations (ie., 
lemon was never printed in yellow). 

Card D: The consonant syllables Hyn, 
CVGJC, BHDR, GSXRQ. 

Thus each S received three cards; the 
black color-word card, the color-strip card, 
and one of the four test cards. Cards A, B, 
C, and D correspond to Cond. F, E, D, and 
A in Klein's experiment. 

Prior to performance on each of the cards 
Ss were given a practice line of five items 
corresponding to the contents of that card. 
After reading the practice line correctly S's 
were instructed on each card to name colors 
or read as quickly and accurately as possible. 
The total reading time on each card was 
measured by means of a stopwatch. 


RESULTS 


The mean reading times on the black 
color-word card and on the color-strip 
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card are shown in Fig. 1 for the six 
grades. The analysis of variance for 
these data shows significant differences 
between the cards (F = 2110, p< 
001), among the grades (F = 7754, 
p < .001), and a significant interaction 
(F = 41.7, p < .001). Time perform- 
ance on both of the cards decreases 
with increasing age. Color naming is 
faster than word reading in Grade 1, 
but this is reversed by Grade 2. After 
Grade 2, the two curves run parallel. 
This suggests that the significant in- 
teraction is due primarily to Ss in 
Grade 1. In order to test this, a second 
analysis of variance was carried out on 
these data, but with Grade 1 omitted, 
This analysis shows significant differ- 
ences between cards (F = 342.3, p< 
001), and among grades (F = 1962, 
P < .001), but no significant interac- 
tion (F = 5.22, p > .05). 

The mean time scores on the Test 
Cards A-D are presented in Table 1. 
The analysis of variance for these data 
shows a significant difference among 
the grades (F = 87.22, p < .001) and 
among the four test cards (F = 25.92, 
P «.001). The interaction between 
cards and grades was not significant 


(F = 1.62, p > 05). 


TABLE 1 
MEAN COLOR-NAMING TIME ON 
Carns A-D 
Grades 
Test LEE 
1 2 3 5 8 | FR 
(Sal bebe 
A 
M | 64.2 | 89.6 | 82.7 | 62.0 | 48.1 33.0 
gsp 11.1 | 14.4 | 14.9 | 12.8 | 9.8] 8 
M | 61.6 | 73.9 | 76.6 | 51.2 | 39.2 30.0 
SD | 13.4 | 12.6 | 16.3 | 68] 9.6} 5: 
M | 65.6 | 66.2 | 63.3 | 48.5 | 36.3 A 
Doe. 8.5 | 13.7 | 142 | 1.3 | 10.3 | 5 
M_| 58.0 | 59.1 | 56.3 | 39.2 | 32.6 ue 
SD | 21.2 | 14.9 | 13.7 | 65| 53| 4 


AGE AND COLOR-WORD INTERFERENCE 


In order to compare the interference 
effects for the various grades, it is 
necessary to have measures which are 
independent of reading speed, since the 
latter increases with age. For this rea- 
son, a ratio score was obtained for each 
S between performance on the color 
strip naming card and the interference 
cards. Thus, a score of 1.00 indicates 
that performance on a test card took 
as long as performance on the color- 
strip card; a score of 2.00 indicates 
that a test card took twice as long as 
the color-strip card. These data are 
presented in Fig. 2. As can be seen, 
there is practically no interference on 
any of the test cards in Grade 1. The 
greatest interference occurs in Grades 
2 and 3 and then gradually decreases. 
The differences between the four test 
cards are minimal in Grade 1. After 
the third grade, the curves for the 
four test cards are parallel. The analy- 
sis or variance for these ratio scores 
showed significant differences among 
the grades (F—418, p<.001), 
among the cards (F = 51.4, p < .001), 
and a significant interaction between 
grades and cards (F = 2.13, p < .05). 


Discussion 


The results for the black color word 
and for the color-strip cards show that 
performance on both of these tasks im- 
proves with age. Except for the first 
grade, words are read faster than colors 
are named, Beyond Grade 1, the rate of 
improvement is similar on these two 
tasks as shown by the parallel curves in 
Table 1. Similar findings were reported 
by Comalli, Wapner, and Werner (1962) 
who studied time performance on the 
color-word test from age 7 to 80. Their 
results showed word reading consistently 
faster than color-strip naming; the curves 
ran parallel over the entire age range. 

Why is word reading faster than color 
naming? A frequently held view (Klein, 
1964) is that differential learning is the 
prime factor which accounts for the faster 
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Fic. 2. Time-ratio scores for the four test 
cards (test card/color-strip card). 


performance on the word-reading task. 
Reading is a highly learned and practiced 
skill; by contrast, color-naming is a much 
less practiced activity. 

The data in this experiment cast some 
doubt on this view. Were learning the 
prime factor in the differential speed 
performance on these tasks, one would 
expect the rate of improvement on word 
reading to be greater with increasing age 
than on color naming. Since, with the 
exception of Grade 1, the differences be- 
tween these two tasks are constant, differ- 
ential practice alone cannot be the reason 
for faster word reading. What then could 
account for this difference? One possi- 
bility is that words provide a more effi- 
cient medium for coding than colors, 
This might be so because printed words 
have greater redundancy. Each letter, 
parts of letters, and parts of words can 
provide the same or related information, 
The number of letters in a word might 
also serve as an important selective cue. 
Simultaneous processing of these cues 
could result in eliciting the correct re- 
sponse in minimal time, A color strip, 
by contrast, carries less information and 
thus takes a longer time to process. 

The data on the four test cards show 
that in the first grade performance on 
these cards is as fast as on the color-strip 
card. In the second and third grades, 
interference is maximal with the color 
words and minimal with nonsense sylla- 
bles. In the second grade, the differences 
among Test Cards 2, 3, and 4 are small, 
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suggesting that any word or letter stimu- 
lus—other than the same color-words— 
may provide interference. After Grade 
2, the differences among the four test 
cards becomes pronounced and the extent 
of the interference decreases gradually. 
Comalli, Wapner, and Werner (1962) 
employed one interference card (similar 
to Card A in this study, but without the 
word and color “yellow”); they found 
that interference declined rapidly from 
age 7 to 17-19 and then rose again. The 
rate of decline on this card was signifi- 
cantly faster than on the color-strip nam- 
ing and word-reading cards. These find- 
ings differ from those of the present study 
in that they did not find an initial increase 
in interference. This difference is most 
likely due to the fact that the first group 
of children tested in the present study did 
not read as proficiently and were younger 
than those in Comalli’s study. 

One possible explanation of the finding 
that interference is greatest in Grades 2 
and 3 in this study is that, at this age 
level, the two competing stimuli are of 
more equal strength, and thus have ap- 
proximately equal probabilities of evoking 
a response. Performance at this level 
thus reflects undifferentiated response 
tendencies. When the two response tend- 
encies become more differentiated, it be- 
comes possible to produce the correct 
response (even when it is the weaker of 
the two) with greater speed and less 
effort. 

The data in this study, as well as in 
the study of Comalli, Wapner, and Wer- 
ner (1962), are based on time scores. 
Some of the factors which underlie the 
increased time scores on the color-word 
interference test have been investigated 
by Rand, Wapner, Werner, and McFar- 
land (1963). They discerned several 
kinds of deviant verbal responses on the 
color-word test. Inappropriate color re- 
sponses, contaminated responses (such as 
“breen,” “gred,” etc.), and omissions 
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were found to decrease with age. Non- 
linguistic utterances, such as “uh,” “ah,” 
etc., were found to increase with age. 

The deviant verbal responses described 
in the above study represent one of the 
behavioral indexes of the interference 
on the color-word test. A more striking 
factor, perhaps, has to do with the physi- 
cal manifestations of strain as seen in 
laughter, twitching, rapid movement of 
the extremities, etc, These factors, which 
are so difficult to study quantitatively, 
seem to be more pronounced in children, 
although there is considerable individual 
variation, Gaining control of these fac- 
tors may be instrumental in reducing the 
interference on the color-word test with 
development. How this comes about is 
probably related to how language exer- 
cises a regulatory function in behavior 
which has been extensively studied by 
Luria (1960) and has also been investi- 
gated by Lovaas (1964). 
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12 groups of 6 rats each were trained to make a lever-pressing response 
using different combinations of continuous- and partial-reinforcement 
schedules and were then extinguished. In general continuous reinforce- 
ment before partial reinforcement was found to lead to a large reduction 


in resistance to extinction. 


One of the difficulties facing E who 
uses partial-reinforcement schedules is 
that of training a naive 5 to respond 
when only a certain proportion of the 
responses are actually reinforced. As 
Jenkins and Stanley (1950) pointed 
out, many investigators in overcoming 
this difficulty have adopted the pro- 
cedure of using continuous reinforce- 
ment at the beginning of acquisition, 
and continuing to reinforce every re- 
sponse until the response tendency at- 
tains some minimal strength. At this 
stage the continuous reinforcement of 
the response is discontinued and the 
partial schedule brought into effect for 
the first time. 

Although the convenience and ex- 
pediency of this procedure cannot be 
questioned, experimental results re- 
ported by Sutherland, Mackintosh, and 
Wolfe (1965) provide a basis for a 
critical assessment of its validity. In 
this experiment continuous reinforce- 
ment before partial reinforcement was 
found to lead to a large reduction in 
the resistance to extinction of a run- 
ning response. It might be argued, on 
the basis of these results, that in any 
experiment in which the effects of dif- 
ferent reinforcement schedules upon re- 
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sistance to extinction are being com- 
pared and assessed, the procedure of 
giving the partial-reinforcement groups 
a number of continuously reinforced 
trials at the beginning of acquisition, 
should not be adopted. An experi- 
ment was consequently performed to 
assess the effects upon resistance to 
extinction of different numbers of con- 
tinuous-reinforcements before partial 
reinforcement. 


METHOD 


Subjects —The Ss were 72 experimentally 
naive, Wistar strain, male, albino rats from 
the closed colony maintained by the Depart- 
ment of Psychology of the University of the 
Witwatersrand. Their mean age at the be- 
ginning of the experiment was 85 days. The 
Ss were randomly assigned to the different 
experimental groups. 

Apparatus—Two Gerbrands operant con- 
ditioning units were used as the apparatus. 
Lever-pressing responses were recorded on 
cumulative recorders and on digital counters. 
The reinforcement used in acquisition was 
one Noyes 45-mg. rat tablet. 

Experimental design.—Twelve groups of 
six Ss each were used in this experiment. 
The conditions which obtained in acquisition 
for these groups are shown in Table 1. 
Groups 1, 2, and 3 might be regarded as the 
control groups, since these groups were not 
given any continuously reinforced trials, but 
were trained from the beginning of acquisi- 
tion on the partial-reinforcement schedule. 
Each of the remaining cells in Table 1 de- 
scribes the acquisition conditions for one of 
the other groups—Group 4, for example, was 
given 10 continuous reinforcements at the 
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TABLE 1 


CONDITIONS OBTAINING IN ACQUISITION 
a EEE T E ETEA 


Continuous Reinforcements 


50 Group 1 | Group 4 | Group 7 | Group 10 
100 Group 2 | Group 5 | Group 8 Group 11 
200 Group 3 | Group 6 | Group 9 | Group 12 


beginning of acquisition, before being given 
50 partial reinforcements of the response. A 
VR 4 schedule of partial reinforcement was 
used in this experiment, the only rationale 
for the use of this particular schedule being 
that it had previously been found to lead to 
a clear and consistent partial-reinforcement 
effect. The numbers in Column 2 of Table 
1 consequently refer to the numbers of rein- 
forcements on this schedule given to the dif- 
ferent groups. 

For all groups the acquisition trials were 
followed by a series of extinction trials. For 
all Ss these trials were continued to an 
extinction criterion of one 5-min. period in 
the box without a single response. 

Pretraining—The Ss were adapted to a 
23-hr. food-deprivation Schedule for 14 days 
prior to the start of the experiment. On 
each of the last four of these days Ss were 
fed individually a number of the food tablets. 

Acquisition and. extinction —In both ac- 
quisition and extinction Ss were given one 
5-min. Skinner-box session per day. The Ss 
were randomly assigned to one of the two 
Skinner boxes and were placed in the same 
box throughout the experiment. In acquisi- 
tion lever-pressing responses were reinforced 
with tablets of food according to the specifi- 
cations of the relevant reinforcement sched- 
ule. If the completion of acquisition training 
occurred prior to the end of a 5-min. period, 
S was removed from the apparatus and the 
extinction series of trials begun on the fol- 
lowing day. In extinction lever-pressing re- 
sponses were not reinforced. Once an $ at- 
tained the criterion of extinction no further 
trials were given. 


RESULTS 


Acquisition —The results of the ac- 
quisition series of trials will not be 
discussed in any detail, but it might 
be mentioned that the acquisition of 
the response was much slower in the 
groups which started acquisition on the 
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partial-reinforcement schedule, than it 
was in the groups given continuous re- 
inforcement. Towards the end of the 
acquisition trials all the groups were 
responding at approximately the same 
rate. This result might have been ex- 
pected since all the groups were being 
reinforced on the same schedule of re- 
inforcement during the later acquisition 
trials, 

Extinction—The group mean num- 
bers of extinction responses (M) and 
the group mean numbers of extinction 
sessions (M’) are shown in Table 2. 
Analysis of variance performed on the 
group mean numbers of extinction re- 
sponses gave a highly significant F 
value, F (11, 60) = 60.61, p < .001. 
Duncan's multiple-range test (Ed- 
wards, 1960) was used to determine 
which of the differences between the 
group means were significant. This 
test showed the differences between the 
mean numbers of extinction responses 
made by the three partial-reinforcement 
control groups—Groups 1, 2, and 3— 
to be highly significant, p< .001. 
Table 2 shows that each of the means 
for the three groups given 10 continu- 
ous reinforcements before partial re- 
inforcement—Groups 4, 5, and 6—was 


TABLE 2 


Group MraN NUMBERS or EXTINCTION 
Responses (M) AND Group MEAN 
Š NUMBERS oF EXTINCTION 


Sessrons ( M1) 

Å M 

Group M SD M SD 
Me 

1 353 32 28 11 

2 481 40 29 9 

3 581 41 32 9 

4 326 27 25 12 

5 445 39 28 5 

6 521 37 29 5 

7 200 13 13 3 

8 216 28 13 3 

9 292 28 14 4 

10 223 23 14 4 

11 241 24 14 4 

12 323 30 15 4 


CONTINUOUS FOLLOWED BY PARTIAL REINFORCEMENT 


Groups 147410 o- —o— o 
Groups 25,841} @---0--» 
Groups 369412 s—s—s 


responses 


o 20 40 60 80 100 
Number of — CRFs 


Fic. 1. Group mean numbers of 
extinction responses. 


smaller than the mean for the appropri- 
ate control group. However Dun- 
can's test showed the differences be- 
tween the means for Groups 1 and 4, 
Groups 2 and 5, and Groups 3 and 6 
to be nonsignificant. Highly signifi- 
cant differences, p < .001, were found 
between Groups 1 and 7, Groups 2 and 
8, and Groups 3 and 9, indicating that 
for each of the three different numbers 
of partial reinforcements the mean 
numbers of extinction responses were 
significantly smaller in the groups 
given 50 continuous reinforcements at 
the beginning of acquisition, than they 
were for the partial-reinforcement con- 
trol groups. An unexpected feature of 
the results of this experiment was the 
small increase in the mean numbers of 
extinction responses made by each of 
the groups given 100 continuous rein- 
forcements before partial reinforce- 
ment, as compared with the groups 
given 50 of these continuous reinforce- 
ments. The upward trend between the 
last two points on the three curves in 
Fig. 1 illustrates this result. Dun- 
can's test however showed the differ- 
ences in the mean numbers of re- 
sponses made by Groups 7 and 10, 
Groups 8 and 11, and Groups 9 and 
12 to be nonsignificant. 
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As can be seen from Table 2, close 
agreement was found in the results of 
this experiment between the two meas- 
ures of resistance to extinction: Both 
the group mean numbers of extinction 
responses and the group mean numbers 
of extinction sessions showed the re- 
sistance to extinction of the response 
to be greatest in the partial-reinforce- 
ment control groups, and smallest in 
the groups given 50 continuous rein- 
forcements at the beginning of acqui- 
sition, before being changed to the 
partial-reinforcement schedule. 


Discussion 


The results of this experiment confirm 
and extend the results reported by Suther- 
land, Mackintosh, and Wolfe (1965), and 
show that giving continuous reinforce- 
ment prior to partial reinforcement leads 
to a reduction in resistance to extinction 
which is highly significant when rela- 
tively large numbers of continuous rein- 
forcements are used. Due to the limited 
range of different numbers of reinforce- 
ments used in these experiments, the gen- 
erality of this effect cannot be assessed. 
It might be proposed, however, on the 
basis of these results, that the procedure 
of giving a number of continuously 
reinforced trials at the beginning of ac- 
quisition should not be adopted in experi- 
ments in which the effects of different 
reinforcement schedules upon resistance 
to extinction are being compared and 
assessed. 

As Sutherland et al. (1965) pointed out, 
results showing a decreased resistance to 
extinction when continuous reinforcement 
is given before partial reinforcement, do 
not support predictions made by the dis- 
crimination theory (Mowrer & Jones, 
1945) and by the theory of Lawrence and 
Festinger (1962)*; on the other hand, 
these results confirm a prediction derived 
from the two-stage model of the learning 


3 The results of this experiment might also 
be regarded as incompatible with Amsel’s 
frustration interpretation of the partial- 
reinforcement effect (Amsel, 1958). 
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process currently being developed by 
Sutherland (1959, 1964). According to 
this model continuous reinforcement leads 
to the switching in of a small number of 
analyzers whose outputs are consistently 
correlated with reinforcement; if partial 
reinforcement is given after continuous 
reinforcement these analyzers interfere 
with the switching in of new analyzers, 
and so the response is conditioned to the 
outputs of a relatively small number of 
analyzers. If, on the other hand, partial 
reinforcement is used from the beginning 
of acquisition, the response is conditioned 
to the outputs of a large number of ana- 
lyzers, since no one analyzer has its out- 
put consistently correlated with rein- 
forcement, and so a large number of 
analyzers are tried out. Since extinction 
of a response is held to depend upon the 
extinction of the outputs of these ana- 
lyzers, groups trained from the beginning 
of acquisition under conditions of partial 
reinforcement would be expected to show 
a greater resistance to extinction than 
would groups given continuous reinforce- 
ment before partially reinforced trials. 
This prediction was confirmed by the 
results of the present experiment. 
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EFFECTS OF NONREWARDED FORCED RESPONDING 


ON ACQUISITION AND REVERSAL OF A 
POSITION DISCRIMINATION 


CHARLES N. UHL: 
University of Utah 


Rat Ss were trained on a T maze position discrimination. Within 
blocks of 5 acquisition trials, 4 forced trials were followed by 1 free 
trial. The 5X 2 design incorporated: (a) 0—, 1—, 2—, 3—, or 4— 
forced responses to the incorrect arm (S—) per block of 4 forced trials, 
and (b) 1 or 6 prediscrimination rewarded-placements in a duplicate of 
the maze arms. 2 control groups were included which were treated 
like Groups 1— and 3— except that all forced responses were rewarded. 
Following discrimination acquisition, all Ss were given reversal train- 
ing. The speed of discrimination acquisition increased with the number 
of forcings to S— per block. Control groups learned more slowly than 
comparable experimental groups. Rewarded placements had no effect 
on acquisition or reversal. The function relating forcing to S— in 
acquisition to trials to learn the reversal was an inverse U with Group 
2— being slowest to learn the reversal. These findings were interpreted 
to support theories of discrimination learning making use of the concept 


of frustrative nonreward. 


The present study was conducted to 
assess the relative contribution of re- 
inforcement and nonreinforcement in 
the acquisition and reversal of a dis- 
crimination. In recent years, several 
theoretical examinations have stressed 
the aversive consequences of nonrein- 
forcement, likening nonreinforcement 
to punishment (Amsel, 1958, 1962; 
Mowrer, 1960; Spence, 1960; Wagner, 
1966). As applied to discrimination 
learning, this conception considers the 
emerging discriminative performance 
to be a product of learning to actively 
avoid S— and of learning to approach 
S+. The mechanisms underlying both 
approach and avoidance are assumed 
to be anticipatory responses conditioned 
to the discriminative stimuli, anticipa- 
tory frustration (r;) in the case of S— 
and anticipatory consummation (r,) in 
the case of S+. It follows that pro- 


1The writer wishes to express his appre- 
ciation for the assistance of D. Mark Larsen, 
a National Science Foundation undergradu- 
ate research participation fellow. 


cedures which increase S’s opportunity 
to acquire ry to S— or to learn to avoid 
S— should affect discrimination acqui- 
sition. Discrimination should be facili- 
tated if S— remains negative following 
such training, and it should be im- 
peded if S— becomes positive, 
Relatively little evidence exists on 
this point. Birch (1955) and Fitz- 
water (1952) gave their Ss prediscrim- 
ination training in which the proportion 
of responses forced to S— and to S+ 
was varied. They both found that sub- 
sequent discrimination performance 
was impeded if all forced responses 
were to S+, but an increasing propor- 
tion of forcings to S— had little effect. 
Denny and Dunham (1951) found that 
forced responses to S— interspersed 
with free trials produced more rapid 
discrimination. Lachman (1961) var- 
ied the proportion of responses forced 
to S— and found that free trial per- 
formance improved with an increase in 
proportion of responses forced to S—. 
A study by Warren and Kimball 
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(1959) suggests that S— may exert 
more control over discrimination per- 
formance than S+. Cats in a learning- 
set procedure displayed greater trans- 
fer effects when S— was retained in 
the transfer test than when S+ was 
retained. 

A forcing procedure was used in the 
present study which permitted con- 
trol over the number and sequence of 
responses forced to S— and to S+ 
throughout the course of acquisition. 
In order to control for any inhibitory 
effects of response repetition, two con- 
trol groups were included all of whose 
forced responses were rewarded. The 
number of rewarded placements in a 
goal box prior to discrimination train- 
ing was varied in order to establish dif- 
ferent degrees of conditioning of an 
anticipatory consummatory response 
(rg) to the goal boxes used in discrimi- 
nation training. If learning to avoid 
S— is based on a frustrative reaction 
to nonreward, and if the frustrative re- 
action does not take place until after 
f, has been conditioned as Amsel 
(1958) claims, then discrimination per- 
formance should be a positive function 
of the number of prediscrimination re- 
warded placements. Information about 
the effects of these variables on dis- 
crimination reversal also was obtained. 


METHOD 


Subjects —The Ss were 56 male albino 
rats (Sprague-Dawley) each of which was 
between 60 and 90 days old upon arrival at 
the laboratory. An additional 5 rats were 
dropped from the experiment due to illness 
or failure to perform. 

Apparatus.—Training took place in an en- 
closed T maze. This consisted of a start 
box 11 in. long, a stem 184 in. long, and 
arms 22 in. long. The interior dimensions 
of all parts of the maze were 4 in. wide and 
5 in. high. Guillotine doors were placed 
between the start box and stem and 3 in. 
from the choice point into each arm. Bar- 
riers 13 in. high extended the width of each 
arm 4 in. from the end of the arm. Their 
purpose was to prevent S from seeing the 
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food reward located in the alcove behind the 
barrier until he inspected the alcove. The 
entire maze was constructed of 3-in. plywood 
painted a uniform gray. 

Design—A block of trials in acquisition 
consisted of four forced trials followed by 
one free trial. Forcing was accomplished by 
lowering one of the retrace doors before S 
was placed in the start box. The basic 
design was a 5 X 2 factorial incorporating: 
(a) 0—, 1—, 2—, 3—, or 4— forced responses 
to S— per block of four forced trials, and 
(b) one or six rewarded placements in a 
duplicate of the maze arms before discrimi- 
nation training began. Four Ss were as- 
signed to each cell of the factorial. 

To control for any effects of the sequence 
of forced rewarded and unrewarded trials, 
each S was run on a random permutation 
of the sequences of forcings to S+ and S- 
possible for his group. Such permutations 
were arranged so that no two Ss within a 
group had the same sequence on the same 
ordinal block of forced trials (except for 
Ss receiving all rewarded or all unreward 
forced trials). 

Two additional groups of eight Ss each 
were run as controls for Groups 1— and 3-. 
The control for Group 1— is referred to as 
1C and the control for Group 3— is referr 
to as 3C. Control Ss were treated exactly 
like Ss in the corresponding experimental 
group except that all forced trials, whether 
to S+ or to S—, were rewarded for control 
Ss. Thus 1C was forced three times to St 
and once to S— in each block of forced 
trials, and vice versa for Group 3C. In the 
case of the control groups, S— refers to the 
maze arm which did not contain reward 0n 
free trials. The purpose of the control pro- 
cedure was to permit a determination of the 
effects of forced nonreward on discrimination 
performance unconfounded by forced re- 
sponse repetition. Any superiority in dis- 
crimination of Groups 1— and 3— over theif 
corresponding controls groups can be attrib- 
uted to the fact that the controls did not 
experience nonreward on forced trials to S~: 
Half of each control group received one 
rewarded maze arm placement and half re- 
ceived six such placements before discrimi- 
nation training. 

Procedure—Upon arrival at the labora 
tory, Ss were housed in individual cages an 
maintained on ad libitum food and water 
for 7 days. On the eighth day Ss were 
placed on a feeding schedule consisting © 
15 gm. Purina chow each day for the T€ 
mainder of the experiment; feeding too! 
place about 15 min. after S had been worke' 
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with. The Ss were handled in pairs 15 min. 
per day for a week. On the sixteenth day, 
Ss were permitted free exploration of the 
maze for 15 min. in pairs. On the seven- 
teenth and eighteenth days, Ss explored the 
maze individually for 15 min. Ten free trials 
without reinforcement were run on the nine- 
teenth day to determine position preferences. 
Immediately after a measure of position pref- 
erence had been obtained for S, this measure 
was used in determining his group assign- 
ment. The Ss were assigned to groups in 
such a way that the mean preference for the 
S+ maze arm was .50+.10, eg., if one S 
in a group made eight responses to the arm 
which was to be S+ for him, a second S 
was assigned to the group who made two 
responses to what was to be his S+ arm. 
The left arm was S+ for half the Ss in each 
group and the right arm was S+ for the 
other half. 

After S had been assigned to a group on 
the day of the position preference test, he 
was placed in the duplicate of the maze arm 
with a clear plastic cup containing 2 Noyes 
45-mg. pellets located in the alcove. The 
S remained in the duplicate maze arm until 
he had consumed the food pellets. If S was 
in one of the groups receiving six rewarded 
placements, he was given five more place- 
ments, 

During training, Ss were run in squads of 
four in rotation within a squad. Each daily 
session consisted of four forced trials, a 
free trial, four forced trials, and a free trial, 
in that order. At the start of each trial, 
S was placed in the start box and the door 
leading to the stem was raised when S faced 
it. A stopwatch was started when the start 
door was raised and was stopped as soon as 
the full length of S’s body passed under a 
retrace door which was lowered behind S. 
The S was removed from the arm after 15 
sec. whether the trial was rewarded or not. 
Reward consisted of 2 Noyes 45-mg. pellets 
located in the alcove. A response to S+, 
whether free or forced, always was rewarded. 
The criterion of choice was that S proceeded 
far enough into an arm to permit closure 
of the retrace door behind him. The inter- 
trial interval was determined by the speed 
with which the remaining members of a 
squad ran; it ranged from 4 min. to 10 min. 
Acquisition continued until S reached a 
criterion of 10 consecutive correct responses 
on free trials. 

After reaching the acquisition criterion, S 
was placed on reversal training which con- 
sisted of 10 free trials per day. Except for 
the fact that all trials were free, reversal 
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training duplicated the acquisition procedure. 
Reversal continued until S reached a cri- 
terion of 10 consecutive correct responses. 


RrsurTS 


Acquisition.—The number of errors 
and the number of trials to criterion 
were tabulated for each S. Since both 
measures provided the same results, 
only the results for errors will be re- 
ported, 

Figure 1 shows the mean number of 
errors to criterion as a function of 
forced trials to S— and rewarded place- 
ments. A two-way analysis of variance 
was conducted, excluding the control 
groups, using a square-root transforma- 
tion of the errors measure. A sum- 
mary of the analysis of variance is pre- 
sented in Table 1. The only significant 
F ratio was that of the main effect of 
forcing (p « .001). Orthogonal poly- 
nomial comparisons of the significant 
forcing effect revealed that errors to 
criterion decreased as a linear function 
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MEAN ERRORS TO ACQUISITION CRITERION 


Or MEA 
NUMBER OF TRIALS / BLOCK FORCED TO S- 

Fic. l Mean errors to the acquisition 
criterion as a function of the number of 
forced responses to S— per block of four 
forced trials and of the number of rewarded 
placements in the duplicate of the maze arms. 
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TABLE 1 


ANALYSIS OF VARIANCE OF TRANSFORMED 
Errors TO ACQUISITION 


CRITERION 
Source af F 
Placements (P) 1 0.79 
Forcing (F) 4 10.84*** 
Linear 1 31.92*** 
uadratic 1 4,58* 
ubic 1 0.57 
PXF 4 0.60 
Within (MS) 30 (0.94) 
$i. 


of the number of forced unrewarded 
trials per block. The significant quad- 
ratic function reflects the diminishing 
effect of forcing to S— with increasing 
numbers of forced responses to S—. 
Since rewarded placements did not en- 
ter into a significant F, all groups, in- 
cluding controls, were collapsed with 
respect to that variable for a Duncan 
multiple-range test of the differences 
between individual groups. This test 
showed (5 < .05) that Groups 3C, 1C, 
and 0— did not differ from each other, 
and that all three made more errors 
than Groups 3— and 4— which, in 
turn, did not differ from each other. 
Also, Group 3C made more errors than 
3—, and 1C made more errors than 
1— (p «.05). 

A derived measure based upon run- 
ning speed on forced trials was ob- 
tained for Groups 3C, 1C, 3—, and 
1— in an attempt to assess the emer- 
gence of any preference for one or the 
other of the arms early in acquisition. 
This preference measure consisted of 
the running speed (10/elapsed time) 
to the arm into which S was forced 3 
of the time (2 arm) minus the running 
speed to the arm into which S was 
forced 4 of the time. The preference 
measure was obtained on each of the 
first 5 days of acquisition. These data 
are shown in Fig. 2. A 2 X 2 factorial, 
incorporating control vs. experimental 
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conditions (C-E) and forcing 1 vs. $ 
to S— (F), was subjected to an analy- 
sis of variance with repeated measures 
over days. Both the main effects of 
C-E and F were significant at the .01 
level or less, F (1, 28) — 8.69 and 
46.55, respectively. The significant (p 
< .001) C-E x F interaction, F (1, 
28) = 24.70, was due to the large dif- 
ference in preference for the } arm be- 
tween experimental groups relative to 
the difference between control groups. 
That is, Ss of Group 1— preferred the 
ł arm which was S+ for them and Ss 
of Group 3— strongly preferred the 1 
arm which was S+ for them, but Ss 
of Control Groups 1C and 3C showed 
no preference for either arm presum- 
ably because they were rewarded in 
both arms on forced trials. A signifi- 
cant (p< .01) Days X F Interaction, 
F (4, 112) — 4.89, reflects the increas- 
ing difference in preference over days 
for the 3 arm between the groups for 
whom that arm was S+ as compared 
to those for whom it was S—. 
Reversal—The number of errors 
and trials to criterion, and the number 
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Fic. 2. Mean preference measure (10/ 
mean elapsed time to the arm into which 
forced 2 of the time minus 10/mean elapse 
time to the 4 forced arm) for comparable 
experimental and control groups during the 
first 5 days of acquisition training. 
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MEAN ERRORS TO REVERSAL CRITERION 


of trials to the first reversal response 
were tabulated for each $. Since the 
results for all three measures were 
equivalent, only the results for errors 
will be reported. 

Figure 3 shows the mean number of 
errors to criterion as a function of non- 
rewarded forced trials and rewarded 
placements. A two-way analysis of 
variance, excluding the control groups, 
was applied to a square-root transfor- 
mation of the errors measure and is 
summarized in Table 2. As in the 
acquisition analysis, only the main ef- 
fect of forcing was significant (p < 
.001). The quadratic component of 
the forcing effect was highly signifi- 
cant (accounting for 58% of the vari- 
ance), while the cubic and quartic 
components were marginally signifi- 
cant. The quadratic component of the 
function reflects the fact that errors to 
reversal rose to a maximum when 
forced acquisition trials were divided 
equally between S+ and S—. The 
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marginally significant cubic and quartic 
components probably are due largely 
to the irregularities in the forcing 
function and to the upturn in errors 
between Groups 3— and 4—. A Dun- 
can multiple-range test was performed 
to compare (p < .05) all groups after 
they had been collapsed across the 
two levels of rewarded placements. 
Group 1— made more errors than 1C, 
Group 3— made more errors than 3C, 
and Group 2— made significantly more 
errors than any group except 1—. 


DISCUSSION 


Acquisition—The facilitation of posi- 
tion discrimination learning by an in- 
crease in the number of responses forced 
to S— attests to the importance in selec- 
tive learning of nonreinforcement of the 
incorrect response. Several factors which 
might underlie this effect must be exam- 
ined before an interpretation of the effect 
can be offered. Accumulation of stimulus 
satiation, not response-produced, cannot 
account for the results since the arms of 
the maze were identical in all respects 
except spatial location. Response repeti- 
tion produces reactive inhibition (Jp) 
which can affect subsequent responding 
with the intertrial interval used in the 
present experiment (Hull, 1943), For- 
tunately, it is possible to make compari- 
sons between pairs of experimental and 
control groups which were equated with 
respect to J, but which differed in expe- 


TABLE 2 


ANALYSIS OF VARIANCE OF TRANSFORMED 
Errors TO REVERSAL CRITERION 


Source df F 
Placements (P) 1 1.52 
Forcing (F) 4 TA 

Linear 1 1.87 
Quadrate 1 1/.15*55 
ubic 1 6.05* 
Quartic 1 4.77* 
POCR. 4 1.16 
Within (MS) 30 (0.61) 
* 5 <.05 
=p $i 
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rience of nonreinforcement of the incor- 
rect response on forced trials. Experi- 
mental Groups 1— and 3— learned the 
discrimination more rapidly than their 
respective control groups, 1C and 3C. 
Groups 1C and 3C ran with about equal 
speed to both maze arms on forced trials 
even though ł of their responses were 
forced to one arm, and Group 3C was 
running ł of the time to the arm in which 
nonreward occurred on free trials. Group 
1— ran more slowly to S— on forced 
trials even though less Jp had accumu- 
lated to inhibit responding to S— as 
compared to S+. These results argue 
for a strong effect of nonreinforcement 
outweighing any effects of Ip. 

The present data are consistent with 
Amsel’s (1958, 1962) and Spence's 
(1960) position that anticipatory forms 
of the frustrative reaction to nonreward 
become conditioned to cues associated 
with the nonrewarded response. It is 
assumed that the anticipatory frustration 
response (rz) produces mediating stimu- 
lation (sj) which tends to evoke behavior 
incompatible with the nonrewarded re- 
sponse to S—. Since diminution of s, is 
assumed to be reinforcing, whatever com- 
peting behavior effects s, diminution is 
strengthened. The slow running speeds 
of Groups 1— and 3— into S— on forced 
trials can be interpreted to reflect com- 
peting behavior evoked by sy. 

The improvement in free choice per- 
formance as a function of increasing 
ambunts of forcing to S— without rein- 
forcement is consistent with a frustration 
interpretation. Such forcing provides for 
the conditioning of r; to S—, and the 
production of s, on a free trial evokes 
responses incompatible with a choice of 
S—. On a free trial, a response incom- 
patible with a choice of S— is reinforced 
through s, diminution, thus tending to 
promote a choice of S+ on subsequent 
free trials. 

The variable of number of rewarded 
goal-box placements prior to discrimina- 
tion training failed to have a significant 
effect upon learning, probably due to an 
insufficient difference in the number of 
placements. It is regrettable that a group 


CHARLES N. UHL 


was not given a larger number of pre- 
discrimination rewarded placements. If 
a strong generalized r,-s, to the maze 
arms had been established prior to dis- 
crimination training, a frustrative reac- 
tion to nonreward should have occurred 
at the outset of discrimination training. 
Had this been the case, the conditioning 
of r; to S— and the consequent develop- 
ment of tendencies to avoid S— should 
have been established more rapidly. 

Reversal.—lInterpretations of the re- 
versal data are compromised by the fact 
that treatment groups differed apprecia- 
bly in the amount of training they re- 
quired to reach the acquisition criterion. 
However, since the intergroup correla- 
tion between amount of acquisition train- 
ing and errors to the reversal criterion 
was far less than perfect, the conditions 
of training can be related to reversal 
learning apart from the effect of amount 
of training on reversal learning. 

The function relating speed of reversal 
learning to amount of forcing to S- 
in acquisition closely approximated am 
inverse U. The proposed explication of 
this function must be recognized as highly 
tentative and ad hoc. If reversal is con- 
ceived to involve extinction of tendencies 
to approach the previously S+ and to 
avoid the previously S— followed by 
acquisition of approach and avoidance 
tendencies to the discriminanda which 
become S+ and S— in reversal, the tê- 
versal function becomes explicable, Re- 
versal should be slowest when both strong 
approach S+ and avoid S— tendencies 
must be extinguished before new ap- 
proach and avoidance tendencies can be 
acquired. Reversal should be faster thé 
more the tendency to be extinguished 15 
Preponderantly approach S+ or avo! 
S—. The present reversal learning T€ 
sults agree with this analysis if it 5 
assumed that the acquisition of such aP- 
proach and avoidance tendencies takes 
place equally well on forced and free 
trials. Group 2— which had equal forc- 
ing experience to S+ and to S— # 
acquisition was slowest to reverse 
Groups 1— and 3— were intermediate i? 
Speed of reversal learning. Groups 0 
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and 4— which had all of their forced 
trials in acquisition to S+ and to S—, 
respectively, were fastest to learn the 
reversal. 

The only groups in the experiment 
which did not experience nonreward on 
forced acquisition trials, Groups 0—, 1C, 
and 3C, were not only the three slowest 
to acquire the discrimination but were 
the three fastest to learn discrimination 
reversal In addition to supporting the 
interpretation offered for the acquisition 
results, this result is consistent with the 
foregoing interpretation of the reversal 
function. Groups 0—, 1C, and 3C, as 
compared to the remaining groups which 
did experience nonrewarded forcing to 
S— in acquisition, presumably did not 
develop strong avoidance of S—. Also, 
these three groups ceased to experience 
nonreward for response to S— on free 
trials when they began the criterion run 
(which consumed 50 trials before reversal 
training began) whereas the remaining 
groups continued to experience non- 
reward when forced to S— right up to 
the time they met the acquisition cri- 
terion. D’Amato and Jagoda (1961) 
reported a similar result. They failed 
to obtain the overlearning reversal effect 
when 20% of the overtraining trials were 
forced to S— without reward. Appar- 
ently, reversal learning is more difficult 
when strong tendencies to avoid the pre- 
viously S— must be extinguished in addi- 
tion to the extinction of approach to the 
previously S+. 
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FIRST-ORDER RESPONSE DEPENDENCIES AT A 


DIFFERENTIAL BRIGHTNESS THRESHOLD 


R. G. LATHROP 
Chico State College 


Prior studies have shown the existence of significant sequential de- 
pendencies at the visual absolute threshold. Results of the current 
study indicate no such relationship at a differential brightness threshold. 
A tentative hypothesis advanced to account for these results assumes 
that, with initially subthreshold stimuli, reticular activity is low and 
excitability of cells in the visual cortex increases. With a now supra- 
threshold input, reticular activity facilitates further visual stimuli until 
adaptation occurs. With adaptation, stimuli are again subthreshold 
and the cycle begins again. This hypothesis may account for contra- 
dictory findings concerning intersensory stimulation. 


The current paper is an extension 
of a series of studies by Verplanck 
and his associates which demonstrated 
that successive responses to stimuli 
at the visual threshold are non- 
independent. The first of these 
studies (Verplanck, Collier, & Cotton, 
1952) effectively ruled out the possi- 
bility of retinal events and ganglionic 
summation as possible explanations 
of this response dependency. Al- 
though one class of central events, 
that of prior training, was shown by 
Verplanck, Cotton, and Collier (1953) 
to influence this relationship, it was 
not the causative agent. Finally, in 
a study by Collier (1954), essentially 
all peripheral events were eliminated 
as the source of this sequential re- 
sponse dependency. 

Since all of these studies were con- 
ducted at the absolute visual thresh- 
old, there remained the possibility 
that the nonrandom response pattern 
demonstrated was a function of spe- 
cific factors peculiar to the method of 
investigation. The current study, 
consequently, analyzes successive re- 
Sponses at a differential threshold 
(DL) as determined by the method of 
adjustment. The existence or non- 
existence of significant sequential 
response relationships at the DL 


should have specific implications for 
the locus of these relationships oc- 
curring at the absolute threshold. 


METHOD 


Subjects. —Eighteen students enrolled in an 
introductory psychology class at Chico State 
College served as Ss for the current study. 
The data of one S were eliminated from the 
analysis due to a change in adaptation condi- 
tions occurring during the experiment. 

Apparatus.—All determinations of the DL 
were made on the Model 701 brightness- 
discrimination apparatus produced by E 
Lafayette Instrument Company. Althoug 
this instrument is not accurate in an absolute 
sense, it is accurate for the relative settings 
necessary to determine response relationships. 
For the current study, the constant stimulus 
was set at a numerical reading of 150 (cor- 
responding to approximately 0.382 foot- 
candles (ftc.) 1$ in. from the source). T 
variable stimulus when set at numerical 
readings of 100 and 200 corresponded to 
levels of approximately 0.163 ftc. and 1.1 
ftc., respectively, 1$ in. from the source. 

Procedure. —All settings of the DL w 
taken in a lightproof room with the overhea 
lighting reduced to the minimum neces d 
to permit S to see the apparatus and control 
knob. As each S entered this experimental 
room, the time at which adaptation began 
wasrecorded. During the 15-min. adaptation 
period, the classical method of adjustmen 
was demonstrated and special emphasis wa’ 
Placed on the fact that retracing was not 
Permitted. Each S was instructed to adjust 
as precisely as possible, the variable stimulus 


120 


RESPONSE DEPENDENCIES AT DIFFERENTIAL THRESHOLD 


light by means of the control knob until its 
intensity matched that of the standard 
stimulus. These instructions were followed 
by a series of 10 practice trials, In the first 
5 trials, the variable stimulus was initially 
brighter than the constant stimulus and in the 
second 5 trials the variable stimulus was 
initially dimmer. No knowledge of results 
was given except that in any trial where re- 
tracing was apparent, that trial was repeated 
following a repetition of the basic instructions. 

At the conclusion of the 15-min. adaptation 
period, one series of 10 ascending and one 
series of 10 descending trials were adminis- 
tered. The order of series presentation was 
systematically varied between Ss to control 
for order effects. Each setting was recorded 
to the nearest numerical scale unit. 


RESULTS 


Measure of sequential dependencies.— 
Since the basic data for the current 
study were distributed (range = 125- 
195) rather than dichotomous, the 
runs technique used by Verplanck and 
his associates for their analyses was 
inappropriate. The method of anal- 
ysis utilized for the current study is 
that given by Lathrop (1965). This 
is the à statistic, a measure of the 
average absolute slope of successive 
response settings. When used for 
normally distributed data, the com- 
putational formula is: 


v Egg 
Where: 


dy = value of the statistic for the jth 
sequence (typically S no. j). 
Xi XG40j = Successive responses in the jth 
sequence. 
c = within j's ø (across i's). 


A Ixen 
The term x N-1 
age absolute slope of the members of 
the jth sequence assuming that the 
scores are taken at equal intervals 
(this interval given a unit value). 
The division by ø transforms this to 


is theaver- 
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standard score form and the square 
root is taken for the purpose of 
normalizing the resulting distributions 
when },’s are computed for N-length 
sequences of random normal numbers. 
The parameters of such distributions 
are: 


= 1.00 


These parameters have been em- 
pirically validated at the Rutgers 
University Information Processing 
Center against distributions of A 
values computed on a basic distribu- 
tion of 40,000 random normal devi- 
ates. These parameter values are 
based on a zero correlation between 
successive responses and define the 
expected population values. For em- 
pirically drawn values, a positive 
relationship between successive meas- 
ures yields distributions of à lower 
than 1.00 and, conversely, a negative 
relationship yields values greater than 
1.00. Thus, stable sequences are in 
the lower tail of the theoretical dis- 
tribution and alternating sequences 
are in the upper tail. The midpoint 
represents those sequences where no 
significant first-order sequential de- 
pendencies are apparent. 

Two items are worthy of particular 
note, First, since the « value included 
in the computational formula is the 
within-/s standard deviation, the 
typical analysis may directly follow an 
analysis of variance by use of the 
within-Ss error term. Secondly, in 
such an analysis, it is not required 
that the S variable be nonsignificant. 
The mean for a given S is, in effect, 
subtracted during the determination 
of the value for that S’s sequence of 
responses. Note that this relaxes 
somewhat the requirement for nor- 
mally distributed data. All that is 
assumed is that the within-Ss residuals 
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TABLE 1 
ANALYSIS OF VARIANCE ON SETTINGS 
or THE DL 
Source MS af F 
Systematic Variance] 722.90 | 32 | 14.56* 
Within Ss 49,65 | 307 
*p «01. 


are normally distributed—in most 
cases, a fairly easy assumption to 
make. 

Data analysis.—The results of an 
analysis of variance completed on the 
340 settings of the DL are shown in 
Table 1. Although the systematic 
variance (that attributable to ascend- 
ing-descending presentation, differ- 
ences between Ss, and the interaction 
of these two factors) is highly signifi- 
cant, it is of only minor interest to the 
current study and, as noted above, 
does not influence computations of 
within presentation, within S values. 
The c required for the analysis of 
sequential dependencies, based on the 
MS within Ss listed in Table 1 was 
7.117. Using this value in the com- 
putational formula previously given, 
^ values were computed separately 
for the ascending and descending 
series of each S. 

"Thus 34 ^ values (2 values each for 
17 Ss) were available for analysis. 
To determine whether there were 
differences due to ascending-descend- 
ing presentation, a / test for correlated 
scores was completed on these two 
setsofA values. Here, ¢ (16) = 0.732. 
Since there were no significant differ- 
ences due to presentation, the 34 À 
values were grouped into a single 
distribution for comparison with the 
theoretical A values based on random 
sampling. This comparison is shown 
in Table 2. As noted there, neither 
the overall mean A value nor the 
variability is significantly different 
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TABLE 2 { 


COMPARISON OF EMPIRICAL AND THEORETICA 
RANDOM À VALUES 


Theoretical |. Empirical 
Parameter Random ct df t 
M 1.000 1.028 33 | 0,74 
SD 0.158 0.193 32 | 1,491 
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from what might be expected froi 
sampling from a distribution with mt 
relationship between successive 
sponses. It appears that there are m 
significant first-order sequential de 


pendencies existing between succe 
sive responses at a visual DL. 


DISCUSSION 


The current findings of no signific 
response dependencies at a visual DI 
have direct relevance to the determim 
tion of the locus of these relationshif 
occurring at the absolute threshold. 1 
would appear that, due to the constam 
exposure to suprathreshold stimulatio 
many of the possible peripheral explant 
tions of the Verplanck group's result 
have been eliminated. For example, 
effects of both ganglionic and pos 
ganglionic summation are relatively co 
stant throughout the current stud! 
Consequently, central factors woul 
have been suspected as a causati 
factor had significant response depenc 
encies been found. ‘ 

The inability to find these effe 
however, leads to the tentative ci 
clusion that general central factors (i-€ 
associative functions) are not in them 
selves responsible for the sequentlé 
aspects of the Verplanck results. Th 
critical factor is assumed to be a functio 
of the method of stimulation. 

A reasonable hypothesis to accom 
for the presence of sequential depen¢ 
encies at the absolute threshold but nO 
at the DL is concerned with the arousé 
function of the reticular formation. 4 
is assumed that the absence over time 
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a perceived visual stimulus (a “No” 
response to stimulation) leads to dimin- 
ished reticular activity, and in turn, to 
increased long-term excitability of cells 
in the primary visual cortex. Such 
activity has been demonstrated by 
Robertson and Evans (1965) at the 
visual cortex in a cerveau isolé preparation 
of a cat. 

As a consequence of this increased 
excitability, stimuli previously sub- 
threshold are able to evoke a specific 
response in the cortex and subsequently 
a positive verbal report (a ‘Yes’ 
response). This positive response is 
assumed to function in the same fashion 
as a novel stimulus to activate the non- 
specific projection network within the 
reticular formation in a manner similar 
to that reported by Lindsley (1957). 
This reticular activity then facilitates 
further positive responses to incoming 
visual stimulation. After a few positive 
responses, however, adaptation to in- 
coming stimulation occurs and the 
facilitative effect of the reticular activity 
is diminished. Incoming stimulation is 
thus incapable of evoking a positive 
response and the cyclical pattern begins 
again. 

With the constant suprathreshold stim- 
ulation (preceded by an adaptation 
period) provided in the current study, 
this particular interactive effect of the 
reticular system is not present. With 
this combination of adaptation and 
constant stimulation, reticular activity is 
relatively constant, permitting neither 
the increase in cortical excitability due to 
low levels of activation nor the facilatory 
effect of the novel stimulus. Asa result, 
variations in responses at a DL appear 
to be random. 

The hypothesis advanced here would 
appear to relate what seem to be other- 
wise incompatible findings in the area of 
intersensory stimulation effects upon 
determination of the absolute visual 
threshold. Sokolov’s (1960) finding of a 
lowered visual threshold due to discrete 
presentation of a suprathreshold auditory 
stimulus may be related to the “novel 
stimulus” portion of the proposed hy- 
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pothesis. This novel stimulus causes 
reticular activity which facilitates the 
response to previously subthreshold 
visual stimulation. 

As a contrast, Chapanis, Rouse, and 
Schachter (1949) found no such inter- 
sensory facilitation of the visual thresh- 
old. Their study, however, involved 
auditory, olfactory, and kinesthetic stim- 
ulation over a 40-min. period of dark 
adaptation. It is assumed that reticular 
adaptation to these extraneous stimuli 
occurred rather soon in the course of the 
experiment and that no facilitative effect 
was present. One might predict that the 
thresholds may actually have been 
slightly higher than normal due to 
elimination of the increased cortical 
excitability hypothesized under more 
normal viewing conditions. 

The hypothesis advanced here is 
amenable to direct experimental test. 
It may be predicted that some measure 
of reticular activity is directly correlated 
to the presence or absence of a positive 
response to threshold stimulation. More 
specifically, it is predicted that the first 
positive response following a series of 
negative responses will occur at the low 
point of reticular activity. The second 
positive response in a series should follow 
the high point of reticular activity due to 
the novel stimulus element of the first 
positive response. With modern record- 
ing techniques, such a study is currently 
feasible. 
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EFFECT OF PUNISHMENT DURATION AND INTENSITY ON 
THE EXTINCTION OF AN INSTRUMENTAL RESPONSE* 


ERLING E. BOE 


University of Victoria 


2 factorial experiments were carried out in which a learned barrier- 
crossing response by rats was punished with shocks of various dura- 
tions and intensities during extinction. Each response during extinc- 


tion was punished in Experiment 
attained; only the 1st response du 


I until a criterion of extinction was 
ring extinction was punished in Ex- 


periment II. In general, the degree of depression in response strength 
produced by punishment was directly related to its intensity and to its 
duration. Amount of postpunishment recovery of response strength 


appeared to be directly related to 
punishment. 


Punishment refers to the operation 
in which the onset of an aversive stimu- 
lus is response contingent. The major 
parameters of punishment are delay, 
intensity, duration, schedule, and fre- 
quency. Although the delay and in- 
tensity parameters have been subjected 
to considerable experimental research 
(Church, 1963), it was not until re- 
cently (Ferster & Appel, 1961) that 
the duration parameter was first inves- 
tigated systematically. Several para- 
metric experiments have since been 
conducted which suggest that the de- 
pressing effect of punishment on the 
strength of a punished response is di- 
rectly related to punishment duration. 
This generalization is applicable to the 
punishment (a) of erroneous responses 
during visual discrimination learning 
in pigeons and rats (Ferster & Appel, 
1961, Exp. II; Wischner & Fowler, 
1964), and (b) of a learned operant 
response in rats when positive rein- 
forcement is continued during punish- 
ment sessions (Boroczi, Storms, & 
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the degree of depression induced by 


Broen, 1964; Storms, Boroczi, & 
Broen, 1963). 

No experiments have yet been re- 
ported concerning the effects of punish- 
ment on responses in different cate- 
gories than the two mentioned above. 
For example, the effect of punishment 
duration on learned responses under- 
going extinction has not been investi- 
gated, even though the effects of many 
other parameters of punishment have 
been explored (e.g., Boe, 1964; Estes, 
1944). The present experiment was 
designed to provide a parametric analy- 
sis of the depression in strength of an 
instrumental response undergoing ex- 
tinction produced by several durations 
of the punishing stimulus. Because 
much past research has demonstrated 
that the behavioral effects of punish- 
ment are determined to a large extent 
by its intensity, punishment intensity 
was simultaneously varied with dura- 
tion in a factorial design to test for 
an interaction. 

As Boe (1964) has previously ob- 
served, the common conception (e.g., 
Hilgard, 1956) that the reduced 
strength of a punished response under- 
going extinction will recover upon dis- 
continuation of punishment is sup- 
ported by only a few experiments, and 
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is inconsistent with the results of many 
more. In view of these results, it is 
apparent that such recovery is a special 
instead of a general punishment phe- 
nomenon. Because little is known 
about the relationships between degree 
of recovery and parameters of punish- 
ment, postpunishment trials were also 
given in the present experiment so that 
the effects of punishment duration and 
intensity on the recovery phenomenon 
could be observed. 


EXPERIMENT I 


A learned instrumental response 
undergoing extinction was punished 
until a criterion of extinction was 
reached in a factorial experiment in- 
corporating five levels of punishment 
duration and two levels of punishment 
intensity. 


Method 


Subjects—The Ss were 55 albino rats 
purchased from the University of British 
Columbia Animal Depot. All Ss were male, 
experimentally naive, and approximately 90 
days old at the beginning of training. 

Apparatus—A two-compartment shuttle 
box with hinged, grid floors was used, Each 
compartment was 6 in. wide, 18 in. long, and 
24 in. high. A bronze barrier, 2 in. high, 
separated the two compartments. A vertical 
sliding door above the barrier was available 
to completely isolate one compartment from 
the other, and prevented spontaneous (ie, 
between-trial) barrier-crossing responses. 
Except for the grid assembly, the barrier, 
and a 5-in. strip of clear Plexiglas at grid 
level, the apparatus was constructed of wood. 
It was painted flat white, with the exception 
of the sliding door, which was painted flat 
black to facilitate the door-orienting re- 
sponse. Each compartment was constantly 
illuminated by a 15-w. light bulb installed 
in its ceiling. The experimental room was 
dark. 

The two identical grids (one for each com- 
partment) were constructed of 2-in. brass 
welding rods mounted 3 in. apart (center to 
center) in a clear Plexiglas frame, and were 
hinged at the end opposite the barrier sepa- 
rating them. The unhinged end of each grid 
was held in place by an electromagnet which 


would release upon receiving the weight of 
an S, thereby allowing that end of the grid 
to drop 4 in. This grid movement closed 
two microswitches operating the shock and 
timing circuits. Each grid was wired sepa- 
rately, but included the bronze barrier. Elec- 
trical stimulation was provided by a Grason- 
Stadler Model E1064GS shock generator 
which was calibrated in short-circuit current. 
This stimulator provided for a series of 
discrete shock duration and intensity settings. 
Although the scrambler was intentionally 
bypassed because most of the shocks were 
very brief, shock duration was never altered 
by feces lodging between two bars. 

When the sliding door was raised, a micro- 
switch automatically closed a timing circuit 
thereby starting a y$y-sec. Standard Electric 
timer. Upon S crossing the barrier, a grid 
microswitch opened the timing circuit, thus 
automatically furnishing a measure of the 
latency of the barrier-crossing response. 

Design.—Five Ss were assigned by means 
of a table of random numbers to each of 10 
treatment groups in a 2 X 5 factorial design, 
and to an eleventh no-punishment control 
group (Control Group I). Two levels of 
punishment intensity (.25 ma. and 2.00 ma. 
as measured in short-circuit current) and 
five levels of punishment duration (.05 sec., 
10 sec, .30 sec, 1.00 sec., and 3.00 sec.) 
were used. Pretesting with different Ss 
indicated that the least severe punishment 
which would reliably produce behavioral evi- 
dence of stimulation was the .25 ma/.05 sec 
shock, while the most severe shock (2.00 
ma/3.00 sec shock) used would elicit vigor- 
ous motor reaction, vocalization, and elimi- 
nation. 

The two specific dependent variables meas- 
ured automatically by the apparatus were: 
(a) the number of barrier-crossing re- 
sponses, and (b) the latency of each barrier- 
crossing response. 

Procedure.—The Ss were removed from 
an ad lib drinking schedule and placed on a 
24-hr. water-deprivation schedule 7-8 days 
before the pretraining day. For the remain- 
der of the experiment, they were given 
access to water bottles for 10 min. each day 
in their individual home cages. Food was 
available at all times except when Ss were 
actually in the experimental apparatus. 

Pretraining consisted of placing a single 
S in the apparatus (with the sliding door 
raised) for 10 min, and allowing explora- 
tion of both compartments without restric- 
tion. No primary reinforcement was pro- 
vided. On the day following pretraining, 
an S was placed in one of the shuttle-box 
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compartments (the starting compartment 
was alternated on successive days), and two 
drops of water were placed in a cup at the 
end of the opposite compartment. After 
1 min., the sliding door was raised until S 
crossed the barrier and entered the other 
compartment. It was lowered following the 
crossing response to prevent retracing. One 
minute following the barrier-crossing re- 
sponse (during which E loaded the water 
cup in the opposite compartment), the sliding 
door was again raised and the next trial 
commenced. Five trials per day were given 
all Ss throughout 6 training and 6 extinction 
days. Once pretraining commenced, all Ss 
underwent daily experimental sessions until 
completion. 

A barrier-crossing response was defined 
by the release of the electromagnet holding 
the opposite grid upon receiving the weight 
of an S crossing the barrier. If an S failed 
to cross the barrier within 1 min. following 
the raising of the sliding door, the door was 
lowered and the trial terminated. If, how- 
ever, an S was in the act of crossing the 
barrier when 1 min. had expired, the door 
was not lowered until the response was 
complete, or until S had retreated back to 
the original compartment. One minute after 
the fifth trial of each day, S was removed 
from the apparatus, returned to his home 
cage, and, after a 10-min. delay, watered 
for 10 min. 

In all, five Es were involved in running 
Ss, However, no biasing effects due to E 
differences should have occurred because Es 
were assigned to Ss by means of a table of 
random numbers (and always ran an S 
through all phases of the experiment), and 
because the apparatus was largely auto- 
mated. Thus, both S and E differences were 
randomly distributed across the 10 experi- 
mental treatments and the control condition. 

Following the sixth day (or thirtieth 
trial) of training, extinction began (ie., 
the cups were not loaded with water). In 
addition, all barrier-crossing responses were 
automatically punished with the appropriate 
duration and intensity when the hinged grid 
dropped until the criterion of 1 day without 
a barrier-crossing response was attained. 
When an S reached this criterion, punish- 
ment was discontinued for the remainder of 
the experiment. As noted in the apparatus 
section, the grid would drop + in. when it 
received the full weight of an S. The door 
was lowered just as S completed the barrier- 
crossing response and shock commenced. 

hus, it was impossible for him to retreat 
back across the barrier to escape shock 


(which would have altered its programmed 
duration) or become entrapped in a closing 
door. Upon completion of 30 extinction 
trials (5 trials per day for 6 days), experi- 
mentation was discontinued. 


Results 


The results were analyzed in terms 
of the mean number of barrier-crossing 
responses during extinction before the 
criterion of 1 day (five trials) with 
no barrier-crossing was attained. Be- 
cause each response was punished, this 
mean is also the mean number of pun- 
ishers received. Since the experimental 
schedule required that the experiment 
be discontinued after 6 days of extinc- 
tion, some of the Ss did not reach this 
criterion. Fortunately, this criterion 
was reached by all Ss in the 2.00-ma. 
groups, by all Ss in the .25 ma/1.00 
sec and .25 ma/3.00 sec groups. These 
results are reported in the lower sec- 
tion of Fig. 1. For all Ss in the .05-, 
.10-, and .30-sec. groups with .25 ma. 
and Control Group I (.00 sec.), it was 


DURING THIRTY TRIALS. 


MEAN RESPONSES 


TO CRITERION 


PUNISHMENT DURATION (SEC) 


Fic. 1, Mean responses during extinction 
as a function of punishment duration for two 
levels of punishment intensity. (The curve 
in the upper section for .25 ma. is based on 
the mean number of barrier-crossing re- 
sponses during 30 extinction trials. The 
highest point—near 30—is for the control 
group—or .00-sec. duration. For the other 
three groups connected by the upper curve, 
all responses were punished. The curves 
in the lower section are based on the mean 
number of punished responses to the criterion 
of 1 day—five trials—with no barrier-cross- 
ing response.) 
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possible to determine the number of 
punished responses during the 30 ex- 
tinction trials. This dependent varia- 
ble is plotted in the upper section of 
Fig. 1 for all four groups. While it 
is regrettable that the same dependent 
variable could not be used for all 
groups, the fact that the shorter dura- 
tion punishment groups with .25-ma. 
intensity did not reach the criterion of 
extinction is evidence that longer dura- 
tion punishments have a stronger de- 
pressing effect on response strength. 
It is also apparent that brief, weak 
punishers have relatively little effect, 
while the effect of brief, strong pun- 
ishers is much more comparable to the 
effect of prolonged, strong punishers. 

A simple analysis of variance of the 
mean responses to the criterion of one 
no-response day for the five 2.00-ma. 
punishment groups indicated that they 
were significantly different at the .05 
level of confidence, while another sim- 
ple analysis of variance of the number 
of punished responses during the 30 
extinction trials indicated that the three 
brief (ie, .05, .10, and .30 sec.) 
.25-ma. punishment groups and Con- 
trol Group I were significantly differ- 
ent at the 01 leve. A2x2 analysis 
of variance of the mean responses to 
the criterion of extinction for the two 
1.00-sec. and two 3.00-sec. punishment 
groups showed that the intensity factor 
was significant at the .01 level, while 
the duration factor and interaction did 
not approach significance at the .05 
level. 

With one exception, all Ss in the 
2.00-ma. groups reached the criterion 
of extinction by the third day, thus 
providing an opportunity to observe 
recovery of the barrier-crossing re- 
sponse after punishment was discon- 
tinued for 3 full days, or 15 trials. 
A trend analysis across the 3 post- 
punishment days for the five different 


duration groups (with 2.00 ma.) re- 
vealed a recovery effect significant at 
the .01 level, but a nonsignificant pun- 
ishment duration effect and Day x 
Duration interaction. The mean num- 
ber of barrier-crossing responses, based 
on the 25 Ss of the five groups, for the 
3 postpunishment days were, respec- 
tively, .40, .92, and 1.16. 


EXPERIMENT II 


If recovery of response strength fol- 
lowing the discontinuation of punish- 
ment had been significantly related to 
the duration variable in Exp. I, it 
would not have been possible to con- 
clude that different durations produced 
different rates of recovery because pun- 
ishment duration was confounded with 
the frequency, or number, of punishing 
stimuli presented. Experiment II was 
designed to vary punishment duration 
and intensity in a factorial experiment 
which eliminated the possibility of 
either being confounded with fre- 
quency. Apparently no previous re- 
search has been published in which the 
frequency variable was intentionally 
controlled while parameters of the pun- 
ishing stimulus were varied. Such 
control was accomplished by punishing 
only the first response during extinc- 
tion, thereby limiting the frequency of 
punishment to 1 for each S. 


Method 


The four changes described below were 
made in the method reported for Exp. I. 
In other respects, such as number of Ss per 
group, apparatus, and procedure, the experi- 
mental methods were identical. 


1. The briefer punishment durations were 
eliminated and an additional intensity (4.00 
ma.) was added. The resulting design was 
a 3X3 factorial experiment with three in- 
tensities (25 ma, 200 ma, and 4.00 ma.) 
and three durations (.30 sec., 1.00 sec. and 
3.00 sec.). Five Ss each were assigned to 
the nine treatment groups and two additional 
control groups (Control Group II and Con- 
trol Group III) by means of a table of 
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random numbers. Actually, Ss were ran- 
domly assigned to all conditions of both 
experiments. Thus, cross-experimental com- 
parisons can be made without concern for 
systematic S bias. 

2. Control Group II was given the normal 
training procedure for 6 days in the absence 
of primary reinforcement. The purpose of 
this control group was to ascertain the 
operant level of the barrier-crossing re- 
sponse. 

$. Control Group III was given the same 
training and extinction procedure as the 
4.00 ma/3.00 sec experimental group, with 
the exception that the punishing stimulus 
was administered to an S 1 min. after being 
placed in the apparatus on the first trial of 
the first day of extinction. The purpose of 
this control group was to examine the effect 
of aversive stimulation on the strength of 
the barrier-crossing response when it was 
not response contingent (ie, did not imme- 
diately follow the barrier-crossing response). 
One of the issues in punishment research has 
been the degree to which a response-con- 
tingent shock produces a greater effect than 
a noncontingent shock that is otherwise 
equivalent. 

4. Only the first-barrier crossing response 
of the first day of extinction (or the thirty- 
first trials of the experiment) was punished, 
in contrast with the punishment of all re- 
sponses to a criterion of extinction in 
Exp. I. 


Results 


The results were analyzed in terms 
of the mean log latency of the barrier- 
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crossing response during extinction. 
The latency measure was used instead 
of the number of barrier-crossing re- 
sponses, as in Exp. I, because the 
barrier was usually crossed under the 
conditions of Exp. II. Two techniques 
were used to remove extreme positive 
skewness of the original time scores. 
First, the median latency during five 
trials of each day was computed for 
each S. An exception to this procedure 
was the first day of extinction in which 
only the latencies of the four post- 
punishment trials were used to com- 
pute the median latency. Next, the 
median latencies were transformed to 
logarithms. 

In the absence of a significant inter- 
action between duration and intensity, 
the main effect on each variable is 
presented separately as a function of 
extinction day in Fig. 2. Since the 
points plotted for Extinction Day 1 
are based on response latencies during 
the four trials following the first trial 
which entailed the single punishment, 
all points report postpunishment per- 
formance. The extinction curve of 
Control Group I was included in Fig. 
2 for comparison purposes. A trend 
analysis of these data (exclusive of the 
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Fic. 2. Mean log latency of the barrier-crossing response by extinction day for three 
levels of punishment intensity—left—and three levels of punishment duration—right. (The 
punishment curves are each based on the performance of 15 Ss representing main effects 
in a 3X3 factorial design with five Ss per treatment group. The no-punishment control 
curve represents the performance of five additional Ss.) 
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Fic. 3. Mean log latency of the barrier- 
crossing response by extinction day for one 
experimental group and three different con- 
trol groups. (Each group was composed of 
five different Ss.) 


control group) indicated that the fol- 
lowing effects were significant: main 
effect of intensity, F (2) =3.75, p< 
05; main effect of day, F (5) = 9.26, 
p « 01; Intensity x Day interaction, 
E (2,5) = 6.26, p < .01; Duration x 
Day interaction, F (2, 5) = 4.04, p< 
.01; Intensity x Duration x Day inter- 
action, F (2, 2, 5) = 7.17, p < 01. 

In view of Fig. 2 and the highly sig- 
nificant Intensity x Day and Duration 
X Day interactions, it is apparent that 
a single shock of suitable intensity or 
duration was sufficient to immediately 
and permanently increase response la- 
tency to a level reached only by the 
no-punishment extinction group only 
after 25 extinction trials, Further- 
more, there was no evidence of sub- 
stantial decrease in response latency 
(ie, recovery) following the initial 
increase produced by punishment. Al- 
though the 1.00-sec. duration curve is 
higher than the 3.00-sec. curve during 
Days 1 and 2, the difference is not 
significant, 

The performances of the three con- 
trol groups and the 4.00 ma/3.00 sec 
shock groups are plotted in Fig. 3 as 
a function of experimental day. A 
simple analysis of variance of the four 
groups on Day 1 indicated that they 
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were significantly different at the .01 
level of confidence, while the F for a 
similar analysis on Day 6 was less 
than unity. It is noteworthy that a 
single strong shock was sufficient to 
increase latencies to operant level 
(Control Group II). Although the 
mean latency on Day 1 of the non- 
response contingent shock group (Con- 
trol Group III) is somewhat lower 
than the mean of the 4.00 ma/3.00 sec 
shock group, the difference was not 
significant at the .05 level (1 test). 


Discussion 


The results of Exp. I are consistent 
with the main trend of previous findings, 
viz., that depression of response strength 
by punishment is directly related to both 
its duration and intensity. The inter- 
action between duration and intensity that 
is obvious in Fig. 1 indicates that high 
levels of either minimize the effects of 
variations in the other. That is, varia- 
tions in duration have relatively little 
effect at a high level of intensity, while 
variations in intensity seem to have rela- 
tively little effect when duration is long. 
Although Boroczi et al. (1964) observed 
the same main effects of the two parame- 
ters (each at four levels), they did not 
find an interaction between intensity and 
duration with respect to the dependent 
variable of number of shocks to the point 
where no bar pressing occurred during a 
full I-hr. daily session. The difference 
between the present results and those of 
Boroczi et al. with regard to the Inten- 
sity X Duration interaction is probably 
due to the narrower range of these pa- 
rameters (.3-1.1 sec.; .4-1,0 ma.) used 
in the latter experiment, 

The strong effect of a single punisher 
of suitable intensity and duration in per- 
manently reducing the strength of a 
learned response undergoing extinction 
to operant level latency on one trial is 
particularly noteworthy. When the ex- 
tinction procedure was not combined with 
punishment, the learned response attained 
operant level latency only after 25 trials. 


Bi 
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Apparently no other Es have specifically 
studied the effect of a single punisher on 
a learned response although Jarvik and 
Essman (1960) have found that a single 
punisher will significantly depress the 
strength of an unlearned response. 

The data on recovery of response 
strength following discontinuation of pun- 
ishment suggest that amount of recovery 
may depend upon the degree of punish- 
ment-produced depression in response 
strength. Evidence of some recovery was 
observed in Exp. I in which response 
strength was depressed far below operant 
level by punishment. The amount of 
recovery occurring over 15 postpunish- 
ment trials was 25% of operant level 
strength. No recovery was found in Exp. 
II where response strength was not re- 
duced below operant level. Perhaps spe- 
cific manipulation of amount of punish- 
ment-produced response depression in 
future experiments would be useful in 
providing a better understanding of the 
recovery phenomenon, 
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SQUIRREL MONKEYS AND DISCRIMINATION LEARNING: 
FIGURAL INTERACTIONS, REDUNDANCIES, AND 
RANDOM SHAPES* 
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The experiment investigated the extent to which contextual variables 
which arise from a multiplicity of forms in the visual field can influence 
the discrimination of random shapes. 10 squirrel monkeys performed a 
series of 2-choice discrimination tasks. Each problem was classified by 
the number of shapes present, the redundancy among the shapes, their 
relative spatial positions, and by their individual complexity. Analysis 
of the number of correct responses to the different problem types 
showed complex interactions of the variables. It was concluded that 
a comprehensive psychophysics of shape, in addition to isolating the 
stimulus characteristics of individual shapes, must specify the collective 
and interactive properties of a plurality of shapes as they typically 


appear in natural visual experience. 


Some recent attempts to quantify 
visual experience have dealt specifically 
with relationships between the physical 
properties of random shapes and vari- 
ous measures of perceptual judgment, 
recognition, and discriminability. For 
example, figural complexity (number 
of sides) can be reliably related to per- 
ceptual phenomena in humans and sub- 
humans (Attneave, 1957; Michels, 
Pittman, Hitchcock, & Brown, 1962; 
Vanderplas & Garvin, 1959) as can 
axial rotation and areal asymmetry 
(Brown, Hitchcock, & Michels, 1962; 
Hitchcock, Michels, & Brown, 1963). 
A number of other physical dimensions 
have been examined and the interested 
reader is referred to a recent review 
by Michels and Zusne (1965). 

To date, the majority of studies in 
this area have been concerned with the 
relationship between the physical char- 
acteristics of individual shapes and per- 


1 This investigation was supported in part 
by Research Grant HD-1454 from the Na- 
tional Institute of Child Health and Human 
Development and in part by the Purdue Re- 
search Foundation and the Bolton Fund of 
Temple University. 


ception, and little, if any, quantitative 
data are available regarding perception 
when there are groups of shapes in the 
field. 

Seldom are higher organisms in their 
natural environments confronted by 
visual experiences involving only a 
single shape. The visual world typi- 
cally involves the display of a number 
of contiguous shapes in some context 
which collectively define additional di- 
mensions of the visual stimulus. The 
present work evaluated some potential 
dimensions of discrimination between 
groups of shapes. The variables se- 
lected for examination included: the 
number of shapes making up each 
group; the extent to which the shapes 
within each group differed from one 
another; and the structure of the spa- 
tial relationships among the shapes 
within each group. Shape complexity, 
an internal property of individual 
shapes, was also studied since this di- 
mension has been extensively used by 
other researchers and it was felt that 
the potential interactions of complexity 
with the other contextual variables 
would be of broadest interest. 
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SHAPE DISCRIMINATION IN SQUIRREL MONKEYS 


MeErHop 


Subjects—Ten experimentally naive squir- 
rel monkeys were used in this study. At the 
time of testing, the monkeys had been in 
the laboratory for 2 mo. and it was estimated 
that all were preadult. 

Apparatus.—Figure 1 shows a model of 
the Purdue General Test Apparatus which 
was used to present the discrimination prob- 
lems. This model consisted of a transport 
cage separated from a main test chamber by 
two Plexiglas guillotine screens, one clear 
and the other opaque, which could be inde- 
pendently raised or lowered. The test cham- 
ber was divided by a central partition into 
two compartments, each large enough to 
permit entry by the monkey. At the rear of 
each compartment was a transparent plastic 
door, hinged at the top, which could be 
pushed open to expose a small foodwell 
when not latched. Each door was slotted at 
the top to permit the insertion of a 31 X 4- 
in. stimulus plate; the doors were back- 
illuminated by a 40-w. white fluorescent 
tube. Both doors could be viewed simul- 
taneously from the transport cage when the 
opaque screen was raised. A red error 
light, not visible to the animal, lit up when- 
ever the monkey attempted to push open a 
locked door. 

Stimuli.—Each discrimination problem was 
defined by two stimulus plates containing 
random shapes. These shapes were gener- 
ated by a procedure comparable to the 
Method I described by Attneave and Arnoult 


Purdue General Test Apparatus. 


Fic. 1. 
(A—transport cage, B:—clear restraining 
screen, Bs—opaque restraining screen, C— 
transparent doors, D—stimulus plate par- 
tially inserted into a door, E—foodwells, 
F—fluorescent light.) 
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‘A-Munber d Figures per Ground (2er 4) 
B -Nonter of Sides per Fire (40r6) 


C- Redondan (low or Nigh) 
D- Figure Position (systematic or rondom) 


Fic. 2. Complete factorial design with 
illustrations of sample problems. 


(1956) and detailed by Brown et al. (1962). 
Random samples of 32 four-sided and 32 six- 
sided shapes were selected and equated to a 
constant .80-cm* area. Photographic nega- 
tives, showing solid black shapes in a trans- 
parent ground, were obtained through a 
fine-grain development process and were 
mounted between two 31 X 4-in. glass lantern 
slide plates. These stimulus plates were 
then combined in pairs according to the 
experimental problem types. 

Experimental design.—The experimental 
design consisted of a 2X 2X 2X 2 factorial 
experiment in two incomplete blocks of eight 
treatments each. The 10 monkeys were ran- 
domly divided into two groups of five and 
assigned to the two blocks. All 16 treatment 
combinations were represented in the experi- 
ment but each monkey was tested under only 
half of the treatment conditions. The four- 
factor interaction was confounded with dif- 
Íerences between blocks. Each factor was 
investigated at two levels and all treatment 
combinations referred to problem types. The 
first factor identified the number of shapes 
(two or four) appearing on each stimulus 
plate of a given problem. The second factor, 
complexity, described the number of sides 
(four or six) of the shapes. The third fac- 
tor, redundancy (low or high) was defined 
as the extent to which identical shapes 
appeared on the same plate. Low redun- 
dancy meant that no two shapes on any 
given plate were identical. High redundancy 
meant that all shapes on a given plate were 
identical. No shape appeared on more than 
one plate. The fourth factor, relative posi- 
tion (systematically or randomly deter- 
mined), referred to the physical arrangement 
of the shapes on the plate. For purposes of 
positioning, the "center" of each shape was 
determined by placing it in a coordinate 
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TABLE 1 


PERCENTAGE OF CORRECT RESPONSES FOR 
THE 16 TREATMENT COMBINATIONS 


At 
Bo Bi 
Co | Do d 75.6 | 820 | 68.4 
Di | 64.2 | 940 | 754 | 67.4 
C; | Do | 82.8 | 65.2 | 880 | 842 
Di | 92.4 | 87.4 | 84.6 | 90.8 


Note,—A—number of figures 
four), B—number of sides 
C—redundancy (low or high), 
(systematic or random). 


ground (two or 
re (four or six), 
D—figure position 


system and enclosing it in a rectangle whose 
sides were defined by the maximum hori- 
zontal and vertical extents of the shape. 
The center of a shape was then defined as 
the intersection of the diagonals of the en- 
closing rectangle. Shapes were positioned 
by regarding the viewing area on each door 
as a grid, divided into four equal quadrants 
by a set of horizontal and vertical axes. 
Four shapes were systematically positioned 
by assigning their center points at random 
to the center points of the four quadrants. 
Two shapes were systematically positioned 
by randomly assigning them to positions 
125 cm. to the right or left of the origin 
on the horizontal axis. Random positioning 
for either two or four shapes was accom- 
plished by placing the center point of each 
shape on a point of the grid whose coordi- 
nates had been determined by reference to 
a table of random numbers. This procedure 
restricted the acceptable points to those 
which permitted the entire shape to appear 
on the grid without touching the grid edge 
or without overlapping any previously posi- 
tioned shape. A schematic representation of 
the complete factorial including sample prob- 
lems (not used in the study) is given in 
Fig. 2. 

One problem per treatment condition was 
used. Under each of these 16 problem types, 
the correct (reinforced) stimulus plate was 
randomly determined. The order in which 
the treatments were administered to the 
monkeys was randomized and the number of 
correct responses in 100 consecutive trials 
served as the dependent variable. 

Procedure —NWith both Plexiglas screens 
lowered, the foodwells were baited, the door 
locks set, and the stimulus slides were in- 
serted into the appropriate doors. The posi- 
tion (right or left) of the correct plate 
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yaried from trial to trial according to a 
random sequence. A trial commenced when 
the opaque screen was raised. The clear 
Plexiglas screen allowed the animal to view 
the shapes on both doors but prevented him 
from making an immediate response. After 
a forced inspection period of 7 sec, the 
clear screen was raised and the monkey 
responded by entering one compartment and 
trying to push open the door. A noncorrec- 
tion procedure was used. For a correct 
response, the door opened and the animal 
was rewarded with an apple chip. If the 
response was incorrect, the error light 
alerted E and the opaque screen was lowered, 
forcing the animal back into the transport 
cage to await the next trial. Each animal 
received 50 trials daily and was fed his 
normal food ration immediately after the 
day's testing. 


RESULTS 


Since the dependent variable was 
the number of correct responses over 
100 trials, it was convenient to sum- 
marize this information in the form of 
percentages. Table 1 shows the per- 
centage of correct responses for each 
treatment combination. Performance 
under each of the 16 problem types ex- 
ceeded chance expectancy (p < 01). 
To evaluate the effects of the treat- 


TABLE 2 


SuMMARY OF THE ANALYSIS OF VARIANCE ON 
Squrrret-Monxey Data ror A Two- 
Cnuorck DISCRIMINATION TASK 


Source of Variation df MS F 

Between Ss 9 
Groups (ABCD) 1 | 234.612 = 
Ss within Groups 8 817.375 

Within Ss 70 
Number of Shapes (A) 1 1.512 ral 
Number of Sides (B) 1 99.012 Re. 
Redundancy (C) 1 | 1402.812 | 10.53 
Position (D) 1 | 255.612 | 192 
AXB 1| 132.612 m 
AXC 1 | 546012 | 4.10* 
AXD 1 | 437.112 | 3.28 
BXC 1| 159.612 | 120. 
BXD 1 |1272.012 | 9.55 
CXD 1 | 535.612 | 4.02* 
AXBXC 1 | 1557.612 | 11.69** 
AXBXD 1 | 332.112 | 249 
AXCXD 1 | 122.512 = 
a | gh ea 

u: 133.22. 
Total 79 
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ment factors themselves, the data were 
subjected to an analysis of variance and 
the results are presented in Table 2. 

It can be seen in Table 2 that the 
only significant main effect was the de- 
gree of redundancy among the shapes 
(p< .01). In general, problems of 
high redundancy were easier than prob- 
lems of low redundancy. The num- 
ber of shapes appearing together on a 
stimulus plate (two or four), the com- 
plexity of the component shapes (four- 
sided or six-sided), and the relative 
spatial positioning of the shapes (sys- 
tematic or random) did not yield sig- 
nificant main effects at the .05 criterion 
level. The complex character of the 
treatment conditions is seen, however, 
by an examination of the interaction ef- 
fects. The two-factor interactions, 
Number of Shapes X Redundancy (p 
< 05), Redundancy x Position (p < 
.05), and Complexity (number of sides) 
X Position (p < .01), indicated sig- 
nificant effects which could be attrib- 
uted to certain unique combinations of 
the stimulus and contextual variables. 
The three-factor interaction of Num- 
ber X Complexity x Redundancy was 
also found significant (p < .01). 

To facilitate the interpretation of the 
significant interaction effects, an analy- 


TABLE 3 


SUMMARY OF THE ANALYSIS OF 
SIMPLE EFFECTS 


Variation | 4 MS F 

B at Do 1 | 1040400 | 7.81** 
Bat Di 1 | 330625 | 248 
D at C 1 25.600 Tu 
Dat C, 1 | 76595 | &T75* 
ACatBo | 1 | 129.600 e 
ACatB, | 1 | 1971020 | 14.82** 
Error 56 133.225 

*p «.05. 
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PERCENT CORRECT RESPONSES 
A 
[] 


Systematic (D; ) 
POSITION 


Random (D, ) 


Fic. 3. Percentage of correct responses 
for four-sided and six-sided shapes under 
two types of relative positioning. 


sis of simple effects was run on the 
Redundancy x Position and on the 
Complexity X Position interactions. 
The Number X Redundancy interac- 
tion was included under the more in- 
clusive effect, the Number x Com- 
plexity X  Redundancy interaction. 
These tests are summarized in Table 
3. The analysis of simple effects is 
best interpreted in conjunction with 
the graphical representations of the in- 
teractions which are being probed. 

Figure 3 graphically portrays the 
Complexity X Position interaction, It 
is noted that four-sided shapes repre- 
sented significantly easier discrimina- 
tion problems than did six-sided shapes 
when they were both positioned sys- 
tematically, but the apparent superior- 
ity of six-sided shapes over four-sided 
shapes under random positioning was 
not significant. 

The interaction of Redundancy x 
Position is illustrated in Fig. 4. The 
two types of positioning did not differ 
when the component shapes were of 
low redundancy but random position- 
ing yielded significantly higher per- 
formance than systematic positioning 
(p < .05) when redundancy was high. 

The three-factor interaction of Num- 
ber X Complexity X Redundancy was 


136 ALLAN J. NASH AND KENNETH M. MICHELS 


O-——-O Systematic (Dg) 


do €— ——9 Random (D,) 
2 
P 
ul 
[3 
5 80) 
ui 
[4 
[4 
g 75| 
z 
ù 70) 
o 
© 
ul 
a 
Low(Co) High (C) 
REDUNDANCY 


Fic. 4. Percentage of correct responses 
for systematically and randomly positioned 
shapes under two levels of redundancy. 


interpreted by an analysis of simple in- 
teraction effects which is summarized 
in Table 3 and graphically represented 
in Fig. 5, 

Figure 5 probes the Number of 
Shapes x Redundancy interaction sep- 
arately for problems involving four- 
Sided shapes and problems involving 
six-sided shapes. These simple two- 
factor interaction effects were signifi- 
cant only for problems involving six- 
sided shapes. 


Discussion 


The highly interactive nature of the 
Present data cautions against simple gen- 
eralizations about the effects of the num- 
ber of shapes presented, their complexity, 
relative positioning, and degree of re- 
dundancy. When the unique two-, and 
three-factor combinations of these varia- 
bles did produce significant effects, the 
general findings showed performance to 
be highest with problems consisting of 
two four-sided shapes, high in redun- 
dancy, and randomly positioned. Great- 
est difficulty in discrimination learning 
was generally experienced with problems 
consisting of four six-sided shapes, low 
in redundancy, and systematically posi- 
tioned. 

The present study generally supports 
the findings of earlier researchers who 


concluded that the discriminability of ran- 
dom shapes is affected by their complexity, 
Michels et al. (1962), working with tree 
Squirrels, reported that four-sided shapes 
characterized the most difficult discrimi- 
nation problems and that six-sided shapes 
characterized the easiest. This effect was 
also suggested by Brown et al. (1962) 
for human Ss when it was observed that 
the time required to correctly perform a 
discrimination task was greatest for four- 
sided shapes and least for eight-sided 
shapes (six-sided shapes were not stud- 
ied). The present study, while confirm- 
ing the importance of the complexity 
dimension, found that the specific effect 
of shape complexity is not independent 
of other contextual dimensions when the 
discrimination task involves groups of 
shapes. The present finding that groups 
of four-sided shapes are more easily dis- 
criminated than groups of six-sided 
shapes when they are systematically 
positioned would not have been predicted 
from the relationships noted in the earlier 
studies. The significant three-factor in- 
teraction reported in the present study 
warns that the effects of shape complexity 
become even less consistent when other 
contextual dimensions such as the number 
of shapes, and their redundancy, are con- 
sidered. While substantial differences 
between the present and earlier works 
exist with respect to methodology, species 
studied, and dependent variables, it is also 
possible that psychophysical relationships 
established for single-shape discrimina- 


ee in fatty t) 
99 High min (C) 
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39460) 4 Shapes (A) 
A-SIDED SHAPES (B;) 


4 Shapes A) 


2 Shapes Q) 
6-SIDED SHAPES B) 


Fic. 5. Interaction between the number 
of shapes and the levels of redundancy pre- 
sented separately for four-sided and six-sided 
shapes. 
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tions do not generalize to more complex 
visual stimuli. 

In interpreting the results of the pres- 
ent study, it should also be noted that the 
variable, spatial position, potentially in- 
volves a basis for discrimination that is 
quite different from the others. When 
shapes within a group are positioned at 
random, it is possible for S to perform 
the discrimination without reference to 
the form properties of the individual 
shapes. Random positioning permits the 
discrimination problem to be solved solely 
on the basis of location cues. This may 
mean a shift in cue utilization from 
“What shapes are correct?” under sys- 
tematic positioning to “What and/or 
where are the correct shapes?" under 
random positioning. 

It appears that the multiplicity of 
shapes which characterize natural visual 
experience define dimensions, or contexts, 
which a comprehensive psychophysics of 
shape must include. It is unlikely that 
a simple proliferation of the parameters 
of individual shapes will lead to stable 
principles of the visual discriminant re- 
sponse. For earlier researchers to ap- 
proach the problem, it may have been 
essential to reduce the visual world to 
the simple basis of a single shape. Hav- 
ing established a number of basic rela- 
tionships between the properties of single 
shapes and certain aspects of perception, 
it now appears advantageous to add the 
more complex dimensions defined by a 
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multiplicity of shapes in an attempt 
to achieve greater ecological validity. 
Jointly, the two approaches to the quan- 
tification of shape can be mutually clari- 
fying and help to expose the psycho- 
physical changes which occur on the way 
from the laboratory to the world outside. 
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RECOVERY IN THE ABSENCE OF RECALL: AN INVESTI- 
GATION OF COLOR-WORD INTERFERENCE' 
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The color-word (CW) task requires S to name the colors in which 
words are printed. The subsequent conflict over saying the words may 
lead to their persistent activation, and thus increase their occurrence 
as associates on an association test (“priming”) more effectively than 
would a less conflictful yowel-crossing (VC) task. 4 separate experi- 
ments (N = 10, 12, 40, 40) compared priming and recall of words ex- 
posed in these two tasks, Paradoxically, CW priming was consistently 
superior to VC priming, while CW recall was significantly lower than 
VC recall. CW priming was always greater than CW recall, while VC 
recall and priming were approximately equal. An explanation was of- 
fered based on the theory that conflict, and its concomitant arousal, 


leads to better consolidation. 


The Stroop color-word task, in its 
standard form, consists of color names 
printed in incongruous colored inks; 
Ss must name the colors, ignoring the 
words in which they are embedded. 
The Ss experience a great deal of diffi- 
culty doing this and their performance 
is characterized by stumbling, response 
errors, and slowness of response, all 
typically associated with conflict. Héer- 
mann (1960) found that color-word 
conflict is related to difficulty in deci- 
sion making. Berlyne (1960) has re- 
ported, and recently Johnson (1963) 
has demonstrated that difficult conflict- 
producing decision processes are ac- 
companied by high levels of physiologi- 
cal arousal, suggesting that high levels 
of arousal probably accompany per- 
formance of this task. If so, this 
conflict and arousal should have de- 
monstrable effects on the registration 
and recovery of words, other than color 
names, which were embedded in the 
colored inks. 


1 We would like to express our gratitude 
to George S. Klein for his many helpful 
suggestions during all phases of these ex- 
periments, and for his comments on the 
paper. 


Variations in the color-word text 
have been shown to produce conflict 
with the color-naming response (Klein, 
1964), suggesting that these words do 
register. Recovery of these text words, 
however, has been meager as measured 
by recall or recognition (Klein, 1954). 
A recent finding by Kleinsmith and 
Kaplan (1963, 1964) indicated that 
immediate recall of high-arousal items 
is poor, but delayed recall (reminis- 
cence) of these high-arousal words is 
significantly better. This suggested 
that high arousal may be antagonistic 
to immediate direct recovery measures, 
and might require a more sensitive and 
indirect recovery procedure, less ad- 
versely affected by the conflict than 
recall or recognition. 

A word-association technique of 
“priming” or “recency” (Segal & 
Cofer, 1960; Storms, 1958) was se- 
lected for this purpose. In this method, 
a word-association test is administered 
which includes “cue” stimuli that are 
known to elicit certain associates. 
When these associates have been acti- 
vated in an immediately preceding task, 
the cue stimuli elicit them more fre- 
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TABL 
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E1 


Kent-Rosanorr Cuz WORDS AND THEIR PRIMED ASSOCIATES 


List A List B 
Cue Word Associate Frequency % Cue Word Associate Frequency % 
Exp. II and 1V 

THIEF STEAL* 27.3 JUSTICE PEACE 27.0 
SMOOTH ROUGH* 25.0 COMFORT EASE 20.7 
DOCTOR NURSE^ 20.3 STOVE COOK* 19.0 
WISH DESIRE* 20.0 HAND FINGER 17.7 
STEM LEAF 17.3 CITY TOWN 11.3 
MOON SHINE* 16.0 RIVER STREAM 16.7 
MOUNTAIN HILL^ 16.0 SHEEP WOOL 16.3 
MUSIC SONG 140 BIBLE BOOK 16.0 
SOLDIER ARMY 13.3 BEAUTIFUL PRETTY^ 13.3 
WINDOW GLASS* 12.3 CARPET FLOOR 13.3 
CHEESE BREAD 11.3 HEALTH SICK 13.3 
LION ROAR® 11.3 WHISTLE BLOW 12.7 
SWEET SUGAR* 113 EARTH GROUND 12.0 
BUTTERFLY MOTH* 10.7 MEMORY FORGET 11.0 
jov SORROW* 10.3 SQUARE CIRCLE 10.3 

Cue Word Associate Frequency % 

Exp. I and III 

PRIEST CHURCH 22.0 

SLEEP DREAM 15.3 

SHEEP LAMB 15.0 

QUIET LOUD 14.7 

COTTAGE CHEESE 14.0 


^ The list for Exp. I and III includes these items also, 


quently than usual. This increased 
frequency provides a sensitive measure 
of the words’ prior activation. 

Prior activation of words via the 
C-W task was compared with the acti- 
vation produced by a less conflictful 
vowel-crossing task, by giving Ss a 
subsequent word-association task con- 
taining cut stimuli known to elicit the 
exposed words. The increase in asso- 
ciative frequency of these words was 
considered a measure of the activation 


effect. 


EXPERIMENT I 


Method and procedure—The Ss were 20 
New York University undergraduate stu- 
dents solicited in the student lounge and 


tested individually. Five males and five 
females were randomly assigned to the color- 
word group; five males and five females, to 
the vowel-crossing group. 

The color-word Ss were given a 9 X 11 
in. card on which were 15 associates, each 
repeated three times in a random arrange- 
ment, printed on five horizontal lines of 9 
words each. The 45 words were printed 
in four different colored inks (RED, 
GREEN, YELLOW, BLUE), varied ran- 
domly. The Ss were told to name the colors 
quickly and accurately and to ignore the 
words. The reading of each line was timed 
with a stopwatch. Following this task, Ss 
were given an oral word-association test 
containing the 15 cue stimuli, known to 
elicit the color associates, and 15 buffer 
items. 

Vowel-crossing Ss were given a mimeo- 
graphed sheet containing the identical 45 
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TABLE 2 


MEAN NUMBER PRIMED ASSOCIATES AND THE 
COMMONALITY RATING FOR THE COLOR- 
WORD AND THE VOWEL-MARKING 
GRours, Exp, I 


Mean 
Mean 
N | Number of |Commonality 


i Associates | Rating 
Color-Word 10 4.5** 11.4 
Vowel-Marking | 10 25: 10.4 


** Significant at p « .01. 


items, but printed in black. They were asked 
to cross out all vowels quickly and accu- 
rately. Following this, they were given the 
word-association test. 

The 15 key associates, according to nor- 
mative data collected with the Kent-Rosanoft 
list at Brooklyn College (Segal, 1960), were 
elicited with an average frequency of 17%, 
a range from 10 to 27% (cf. Table 1). The 
15 buffer words, also selected from the Kent- 
Rosanoff list (Russell & Jenkins, 1954), 
measured the commonality of the groups 
tested, that is, the number of popular asso- 
ciates given, Each of the buffer items 
selected had a primary response with a fre- 
quency of 50% or greater. 


Results.—Table 2 lists the mean 
number of primed words which oc- 
curred as associates in each condition, 
and the commonality score, the mean 
number of primary responses to the 15 
buffer stimuli. The color-word group, 
had a significantly higher number of 
primed associates, t(18)= 3.38, p< 
01, and also slightly but not signif- 
icantly higher commonality scores, t 
(18) = 1.03, p < .30, 

The mean number of responses ex- 
pected according to the Brooklyn Col- 
lege norms, is 2.6 (or 17% of the pos- 
sible total of 15). The vowel-crossing 
task mean of 2.5 showed no priming 
effect, but the color-word task yielded 
a significant priming mean of 4.5 re- 
sponses, #(28) = 190, p< .05, one- 
tailed, 


Discussion.—The results confirmed our 
expectation that the color-word task 
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would produce a heightened activation of 
the embedded words that would persist 
into the subsequent word-association task, 
By contrast, the vowel-crossing task 
showed no priming whatever. Possibly 
the vowel-crossing task broke up the 
words so they could not register as com- 
plete units? This would imply that our 
finding was not so much determined by 
high priming with the color-word task 
as by low priming following the vowel- 
crossing task. Experiment II tested this 
possibility. 

If the vowel-crossing task broke up 
the words into component letters there 
Should be poor recall of these words 
following this task. But if the words did 
register as complete units, they should 
be recalled as easily as the color-word 
items, and possibly better, since recall 
of color-word items is generally poor 
(Klein, 1954). 


EXPERIMENT II 


Method and  procedure.—Twelve New 
York University undergraduate | students 
were tested as before. K 
greatly among Ss, each S was used as his 
own control. A second set of associates 
(List B) was therefore selected for this 
experiment, containing 15 associates equated 
with List A for response frequency to their 
cue stimuli (cf. Table 1). Order of task 
and list was systematically varied. All Ss 
were asked to recall the words from the 
first task immediately following its presen- 
tation; they were then given the second task, 
and asked to recall its words, 


Results—The difference between 


Lists A and B was not significant, 
t(11) = 46, p< -60, so the lists were 


*We are indebted to Donald P. Spence 
for this suggestion. 


TABLE 3 


MEAN Numer Wonps RECALLED FROM THE 
CoLon-Wonp AND VOWEL-MARKING 
Tasks, Exp, II 


Task 


=- 
Color-Word 
Vowel-Marking 


As First Task | As Second Task 
Ee. Ifl saggi 


1.33 


2.50 4.33 


As recall varies ` 
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combined in the data analyses. Recall 
was indeed poor after the color-word 
task, averaging only about one and one- 
half words; recall following the vowel- 
crossing task was significantly better 
(cf. Table 3). An analysis of variance 
for repeated measurements (Winer, 
1962) yielded a significant condition 
effect, F (1, 10) =842, p< 025. 
Neither order nor interaction effects 
were significant. However, there was 
a strong trend for Ss to recall the 
vowel-task words better when this task 
occurred second. 


Discussion.—Crossing out vowels does 
not break up the structure of the words, 
for vowel crossing produced better recall 
than did the color-word task. This re- 
sult-highlighted the finding of Exp. I 
where vowel crossing had been less effec- 
tive in priming these words for associa- 
tion, In our next experiment, we com- 
pared priming and recall on the same Ss. 


ExPEnIMENT III 


Method and procedure—Forty New York 
University undergraduates were selected and 
tested as before; 10 men and 10 women were 
randomly assigned to the color-word group, 
10 men and 10 women to the vowel group. 
The color-word Ss were given the color- 
word card of Exp. I, then the appropriate 
word-association test; and finally, they were 
asked to recall the words from the color- 
word sheet. The vowel-crossing Ss were 
given the mimeographed page of Exp. I 
and asked to cross out all the vowels. Asso- 
ciation and recall tasks followed. 


Results.—As in Exp. II, significantly 
more words were recalled after the 
vowel-crossing task than after the color- 
word task, £(38) = 471, p < .01. Prim- 
ing was higher in the color-word group, 
but this difference fell short of the .05 
level of significance, t (38) = 1.69, p < 
.10. Table 4 presents these data. 

As in Exp. I, the mean number of 
responses following the color-word task 
was significantly greater than the mean 
according to the norms, ¢ (28) = 2.3, 
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TABLE 4 


Mean NuwBER Worps Prep AND RE- 
CALLED, AND THE COMMONALITY RATINGS 
IN THE COLOR-WORD AND VOWEL- 
MankiNG Tasks, Expr, III 


Mean 


Mean | Mean 
alae wy | Number | Number | Common- 
rimed | Words | “ality 
Amos | Recalled | Rating 
Color-Word 20 3.90 85 8.60 
Vowel-Marking| 20 | 3:20 | 3.30" | 7.55 


*** Significant at p < ,001. 


p > 025, one-tailed; but there was not 
significant priming after the vowel- 
crossing task, £ (28) =.93, p < .10, 
one-tailed. 

Commonality scores for these groups 
were lower than in Exp. I, but again 
Ss assigned to the two tasks did not 
differ significantly in commonality. 
However, there was a significantly 
greater variability in commonality 
scores for the vowel-crossings Ss, F 
(19, 19) = 241, p < .05. 


Discussion.—1n Exp. III, recall was 
better in the vowel-crossing task while 
priming was again superior in the color- 
word task. The extent of priming pro- 
duced by the color-word tasks was quite 
similar in Exp. I and III; the vowel- 
crossing task, however, yielded consid- 
erably more priming in this experiment 
than it had in Exp. I. The high varia- 
bility of the commonality scores for the 
vowel-crossing Ss suggest that the latter 
may have been a special group. There- 
fore, in the next experiment, we obtained 
both priming and recall data for both 
tasks from the same Ss. 


Experiment IV 


Method and procedure.—The Ss were 40 
New York University undergraduates. Half 
the Ss took the color-word task first, the 
other half began with the vowel-crossing 
task. At the end of the experiment, they 
were asked to recall the words from the 
second task, and then the words from the 
first task. Both recalls were placed at the 
end of the experiment to minimize contami- 
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nation of the priming data. The two 
matched lists of associates from Exp. II 
were systematically varied for task and 
order, 

Results—We combined Lists A and 
B, as the differences were not signif- 
icant. Again, there was more priming 
in the color-word condition than in the 
vowel-crossing condition, the F value 
reaching the .05 level of significance, 
F (1, 38) 24.00, p=.05. Recall 
again was clearly better for the vowel- 
crossing Ss, F (1, 28) = 28.90, p< 
005. These data are presented in 
Table 5, 

Priming in the color-word groups re- 
mained consistent whether the task was 
first or second, whereas priming for the 
vowel-crossing groups was considerably 
lower when it was the second task, 
Condition x Order interaction, F (1, 
38) = 4.09, p = .05. Recall following 
the color-word task was also stable, but 
recall following the vowel task was 
much higher when this task occurred 
second, Condition x Order interaction, 
F (1, 28) = 12.10, p <.005. The de- 
sign permitted immediate recall of 
words from the second task, while re- 
call of words from the first task was 
delayed, 

The priming effects were compared 
to an unprimed control group: 32 New 
York University undergraduates took 
Association Tests A and B only, with 
list order varied. All experimental 
groups showed a significant priming 


TABLE 5 


MEAN NuwBER Wonmps PRIMED AND RE- 
CALLED IN THE COLOR-WORD AND VOWEL- 
ManktNG Conpitions, Exp, IV 


Mean Number 


M 
Primed Associates oda dE 


We 


Color-Word 


4.25 
Vowel-Marking | 3.90 
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TABLE 6 


MEAN NUMBER CRITICAL RESPONSES IN THE 
CONTROL Group, COMPARED TO MEAN 
NUMBER OF ASSOCIATES AFTER 

PRIMING IN THE Ex 


Order of 


Task " 
Control 
Color- | Vowel- 
Word | Marking 
First 241 4.25 3,86** 
3.90 | 3,22** 
Second 1.91 3.90 5.08** 
2.80 | 2.35* 


* Experimental condition indicated, compared to 
the Jelevant control, 


5$ f 

effect compared to their appropriate 
control (cf. Table 6). It was also 
found that the frequency of critical as- 
sociates declined somewhat on the sec- 
Ond association test for the control 
groups as well as the experimental 
groups. 

Since Ss were used as their own con- 
trol on the priming tasks, the group 
differences in priming were not affected 
by commonality differences. The mean 
commonality score for the 40 Ss was 


DiscussioN 


As predicted, the exposed words did 
Occur as associates more frequently after 
priming with the color-word task than 
after the vowel-crossing task. The ex- 
tent of color-word priming was consistent 
in all the experiments, the means ranging 
from 4.5 to 3.9, However, priming was 
not so consistently obtained with the 
vowel-crossing task. Comparable mean 
figures for this task ranged from 3.9 to 
2.5; the lower figure is indistinguishable 
from the average frequency with which 
these words occurred in the normative 
data collected in Exp. IV, and at Brooklyn 
College (Segal, 1960). 
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As our experiments progressed, it be- 
came apparent that even though the asso- 
ciation-test results suggested that the 
words were strongly activated, Ss could 
remember very few of them. Typically, 
Ss were surprised when asked to recall 
these words, reporting that they had not 
even looked at them. The vowel-crossing 
task, however, produced no such surprise, 
and Ss could readily recall a large num- 
ber of words from this task. We were 
thus confronted with a paradox: effective 
priming (color word) is associated with 
poor direct recall; ineffective priming 
(vowel crossing) with good recall. 

The similarity of these findings to 
those of Kleinsmith and Kaplan (1963, 
1964) was striking, when it is recalled 
that the vowel-crossing task had been 
selected as a low-arousal, minimal con- 
flict task, while the color-word task was 
described as a conflict-producing, high- 
arousal task. However, there were also 
some rather striking differences. The 
Kleinsmith and Kaplan studies involved 
straightforward learning of items whose 
arousal value was measured physiologi- 
cally (with the GSR). They found poor 
immediate recall of high-arousal items, 
but a marked reminiscence effect. In the 
present study, however, all the effects 
of our experiment were observed within 
15 min., using an indirect but immediate 
recovery measure rather than the direct, 
but delayed recall. Furthermore, in the 
present study, there was incidental learn- 
ing of the same material under task con- 
ditions of high and low conflict, with 
arousal inferred. 

Both experiments could be explained 
by assuming that items learned under 
high-arousal conditions undergo a period 
of neural reverberation when they are 
unavailable for direct refiring (as in 
recall). These reverberating circuits, 
however, presumably result in better con- 
solidation, as inferred by the reminis- 
cence effect in Kleinsmith and Kaplan's 
work, the priming effect in our studies. 
It would be helpful to determine if an 
indirect recovery measure, such as prim- 
ing, is effective with high-arousal items 
in a focal learning task; also, if a delayed 


recovery measure would yield a reminis- 
cence effect after the color-word task. 
Further understanding of the influence 
of arousal on consolidation would be 
served by determining if the recovery 
measures are task specific or of greater 
generality. If these recovery measures 
can be used interchangeably to elicit 
material registered under high arousal, 
this would tend to support a Hebbian 
model, using neural reverberation and 
consequent consolidation as intervening 
variables (Hebb, 1949, 1958). 

However, the differences in the experi- 
ments are sufficiently great that alterna- 
tive explanations of color-word priming 
should be considered. One possibility is 
that, because of the incidental nature of 
the task, registration was not in full 
awareness, and so could only be tapped 
by measures which depend on appropriate 
types of organization. Spence (1962) 
has suggested that associations play an 
important role in the organization of 
structures that are not fully in awareness ; 
and by this token, priming may be effec- 
tive because it depends on associational 
organization. 
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ASSOCIATIVE MEMORY? 


ENDEL TULVING ann TANNIS Y. ARBUCKLE? 


University of Toronto 


A single paired-associate unit (critical unit) was presented in a fixed 
position in a series of other similar units and its recall tested either 
once or twice, following various combinations of interpolated inputs 
and outputs in the short retention intervals. Probability of recall of the 
critical unit on the Ist test was higher following 2 interpolated outputs 
than 2 interpolated inputs, while probability of recall on the 2nd test 
seemed to depend on the nature of interpolated events in a more com- 
plex fashion. The findings were interpreted in terms of differences in 
the ease of coding the critical unit with respect to its position in the 


input and output sequences. 


When a single paired-associate (PA) 
unit is presented in a series of similar 
units and tested for recall a very short 
time after its presentation, its probabil- 
ity of recall has been shown to depend 
on the number of other units presented 
for study and recall in the retention in- 
terval (e.g., Arbuckle, 1964; Murdock, 
1961, 1963a, 1963c; Peterson, Saltz- 
man, Hillner, & Land, 1962; Tulving & 
Arbuckle, 1963). In some of these ex- 
periments it is possible to compare the 
effects of interpolation of study units 
(input units) on the recall of a given 
unit with that of interpolation of recall 
units (output units). These compari- 
sons show that recall is higher follow- 
ing interpolated outputs than following 
the same number of interpolated inputs. 

Tulving and Arbuckle (1963) pre- 
sented to Ss series of 10 PA units con- 
sisting of single digits as stimulus items 
and common words as response items. 


1 This research was supported by the Na- 
tional Science Foundation under Grant No. 
GB-810. The paper was prepared for publi- 
cation while the senior author was a Na- 
tional Research Council of Canada Senior 
Research Fellow at the Institute of Hu- 
man Learning, University of California at 
Berkeley. 

?Now at Sir George Williams University. 


Under the condition that mimicked 
serial learning (referred to as ordered 
input condition in the experiment), re- 
call of a single PA unit was higher after 
n interpolated outputs than after n in- 
terpolated inputs, provided that the re- 
call data from the first few items in the 
input sequence, showing a pronounced 
primacy effect, were omitted from con- 
sideration. Thus, following 1, 2, 3, 4, 
5, and 6 interpolated outputs, probabil- 
ity of recall was .84, .66, .52, .50, .46, 
and .44, respectively, whereas following 
the same number of interpolated inputs 
the probability of recall was .76, .50, 
48, 44, .24, and .22, respectively. 
Similar data have been obtained by 
Arbuckle (1964), who used digit-word 
pairs in lists of 6, 9, and 12 units, and 
by Murdock (1963a, Exp. III), who 
used lists of 6 PA units consisting of 
words. In all these studies the length 
of the retention interval varied with n, 
but it was constant for corresponding 
numbers of interpolated inputs and out- 
puts. 

On the basis of these observations it 
might be concluded that when the 
length of retention interval is constant, 
input interference in short-term asso- 
ciative memory is greater than output 
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interference, but the evidence is not 
conclusive. In all the experiments 
cited, lists of a fixed length were used. 
This means that the effects of output 
interference are inferred from the recall 
scores of the last unit in the list, while 
the effects of input interference are in- 
ferred from the recall scores of other 
units in the list. Thus the nature of 
events in the intratrial retention in- 
terval, under these conditions, is con- 
founded with the position of a unit in 
the list. If S pays unequal attention 
to different units in the list, or for 
some other reason does not “learn” all 
units to the same degree, no comparison 
between input and output interference 
is justifiable. For example, if the last 
unit in the list is fixated more strongly 
than others, then the findings men- 
tioned above could at least partly be 
accounted for in terms of differential 
learning rather than in terms of differ- 
ences in input and output interference. 

The experiment reported in this pa- 
per was designed to compare the effects 
of input and output interference under 
conditions where differential learning 
effects were excluded. A single PA 
unit, the “critical unit,” always occupy- 
ing the same position in the input series 
of similar units, was tested for recall 
following various numbers of inter- 
polated inputs and outputs. Some of 
the critical units were tested for recall 
twice, in accordance with the RTT 
paradigm (Estes, 1960). Thus, the 
experiment provided data on recall 
probabilities of the critical unit on both 
the first and the second recall test as a 
function of number and nature of 
events interpolated in the single reten- 
tion interval or two successive intervals. 

The experiment was in many ways 
similar to a series of experiments re- 
ported by Peterson et al. (1962), ex- 
cept that in the present experiment the 
events in the intratrial retention in- 
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tervals were classified as to their nature 
(inputs or outputs). 


METHOD 


Design—The first recall test of the criti- 
cal unit (Test 1) followed Interval A filled 
with events described in Table 1. The 
interpolated events were defined in terms of 
all combinations of zero, one, or two interpo- 
lated inputs with zero, one, or two inter- 
polated outputs. To facilitate exposition, the 
nine conditions of interpolation in Interval A 
are numbered and will be referred to as 
shown in Table 1. Thus, for instance, Cond. 
1 refers to the testing of the critical unit 
immediately after its presentation, following 
no other inputs or outputs; under Cond. 3, 
recall of the critical unit was tested following 
two outputs of “dummy” items; under Cond. 
5, it was tested following one input and one 
output, or one output and one input; and 
so forth. 

The second recall test (Test 2) was given 
only for those units that were tested on 
Test 1 under Cond. 1, 3, 7, and 9. The 
second retention interval of these units (In- 
terval B) contained either two additional 
inputs (Cond. 1) or two additional outputs 
(Cond. 0). 

All Ss served under all experimental con- 
ditions, with the order of conditions counter- 
balanced among Ss. Each S was given two 
tests for each of the nine conditions with 
respect to Test 1, and one test for each of 
the eight conditions with respect to Test 2. 

Materials—The PA units consisted of 
common adjectives as stimulus items and 
single digits from 1 to 9 as response items. 
These items were printed in black India ink 
on white 3X5 index cards, with both the 
stimulus and response item on input cards 
and the stimulus item alone on output cards. 


TABLE 1 


DESIGN OF THE EXPERIMENT WITH RESPECT 
TO INTERVAL A BETWEEN THE 
PRESENTATION AND TEST 1 
OF THE CriticaL Unit 


Number of Interpolated Outputs 
dS 


Number of 
Interpolated 
Inputs 


INPUT AND OUTPUT INTERFERENCE IN STM 


The cards were arranged into series of 9-16 
units, 

Eighteen different series were used, two 
for each of the nine conditions in Interval A. 
Each series began with five inputs. The first 
four inputs served as dummies to eliminate 
possible primacy effects and provided mate- 
rials for interpolated outputs where required. 
The fifth input unit in each series was the 
critical unit. Eight of the 18 series, corre- 
sponding to Cond. 1, 3, 7, and 9 of Interval A, 
included a second recall test of the critical 
unit, In one of the two series corresponding 
to each of these four Interval A conditions, 
the’ critical unit was tested following two 
further inputs, in the other it was tested 
following two outputs of dummy units. 

Examples of some of the series used in the 
experiment are shown in Table 2. Series 
1-I yielded recall data for the critical unit 
(TIRED 8) on Test 1 immediately after its 
presentation. (Cond. 1 of Interval A), and 
on Test 2 following two interpolated inputs 
in Interval B (Cond. I). Series 5 included 
only Test 1 of the critical unit (GENTLE 
4) following one output and one input in 
Interval A. Series 7-O tested the critical 
unit (HEARTY 7) following two interpo- 
lated inputs in Interval A and two inter- 
polated outputs in Interval B. Different 
kinds of series, as well as additional tests 
of dummy units in each series after the final 
test of the critical unit, as shown in Table 
2, served to camouflage the critical unit in 
the series. In Table 2, the stimulus items 
of critical units are shown in capital letters, 
but in actual series, of course, they were 
not distinguished from the other units in 
any way. 

Subjects.—The Ss were 144 adults of both 
sexes enrolled as students in various psy- 
chology courses at the University of Toronto 
in the summer of 1963. Their median age 
was 27 yr. None of the Ss had previously 
participated in an experiment of this type, 
although some had served in other psycho- 
logical experiments. 

The first set of 72 Ss were tested with 
the 18 series as described above. For the 
second set of 72 Ss the critical units in the 
18 series were counterbalanced among Ss. 
The response item remained always the same 
in a given series, but the stimulus item 
varied. This arrangement eliminated the 
confounding between experimental conditions 
and specific critical pairs present in the first 
set. Since the overall pattern of results was 
very similar for both sets, no further dis- 
tinction will be made between the two sets 
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TABLE 2 


EXAMPLES OF SERIES USED IN 
THE EXPERIMENT 


Set 1-I Set 5 Set 7-O 
unknown 9 | upright 8 | polite 3 
divine 1 | hostile 6 | healthy 4 
silly 5 | deadly 2 | ready 1 
severe 4 | active 1 | formal 6 
TIRED 8 | GENTLE 4 | HEARTY 7 
TIRED hostile fiery 5 
wooden 3 | unseen 9 | sturdy 9 
remote 6 | GENTLE HEARTY 
TIRED upright formal 
remote unseen sturdy 

HEARTY 
polite 


of Ss. All the data reported below apply 
equally well to both arrangements, 

Procedure,.—All Ss were tested individu- 
ally. At the beginning of the session S 
received rather detailed instructions to fa- 
miliarize him with the task. He was told 
about the nature of the materials, about the 
total number of series he would be exposed 
to, the fact that series would differ from one 
another, and the fact that once he had finished 
with a series he would not be asked to recall 
any part of it again at a later time. More 
specifically, S was told to read out aloud 
both the adjective and the digit when the 
two appeared together on a card, and to say 
the correct digit whenever an adjective alone 
was presented. When he was not sure of 
the correct answer he was to respond with 
the first digit that came to mind. Thus 
guessing was not only encouraged but in- 
sisted upon, That these instructions were 
effective was shown by the fact that there 
were fewer than 2% response omissions, The 
S was also told that sometimes an adjective 
would be shown alone more than once in a 
given series and that his task was to say 
the right digit every time this occurred. The 
S was not informed, of course, that one of 
the pairs was of especial interest to E. 

Prior to the first experimental series, S 
was given preliminary practice on three prac- 
tice series in which both the stimulus and 
response items were monosyllabic words, 
but which otherwise resembled the experi- 
mental series in their composition. These 
practice lists were administered to acquaint 
S with the nature of the task, and to reduce 
initial warm-up effects. 

The cards with stimulus materials were 
presented by means of a manually operated 
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TABLE 3 


Proportion OF PA Units RECALLED ON 
Test 1 AS A FUNCTION OF NUMBER OF 
INTERPOLATED INPUTS AND 
Ourruts 


Number of Interpolated Outputs 


Number of 
Interpolated 
Inputs 0 1 1 
0 1.00 .83 48 
1 .80 49 29 
2 36 32 31 


card holder attached to a plywood screen 
that separated E from S. All cards belong- 
ing to a given series were stacked in the 
holder in the predetermined order and were 
exposed to S by successive removals of cards 
from the top of the stack. The first card 
was always a blank. Its removal constituted 
the beginning of the series. Cards were 
exposed at a constant rate of 2 sec/card, 
with no intervals between exposures. Thus 
the length of Interval A varied from 0 to 8 
sec, while the length of Interval B, when- 
ever it occurred, was always constant at 
4 sec. 

All responses given by S were recorded 
by E. No knowledge of results was pro- 
vided to S until the end of the whole session. 


RrsurrS 


Test 1.—Proportions of critical units 
recalled on Test 1 for various combina- 
tions of inputs and outputs interpolated 
in Interval A are shown in Table 3. 


Each entry in Table 3 is based on a 


total of 288 responses, two responses 
from each of 144 Ss, Of primary in- 
terest in Table 3 is the comparison be- 
tween the first row, showing probability 
of recall following zero, one, and two 
interpolated outputs, and the first col- 
umn, showing probability of recall fol- 
lowing zero, one, and two interpolated 
inputs. Immediately after the presen- 
tation, the recall of the critical unit was 
practically always successful. Only one 
error occurred in 288 tests under Cond. 
1. Probability of recall was approxi- 
mately equal following one interpolated 
input and one interpolated output, with 
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a small but clearly insignificant advan- 
tage for the recall following the output. 
Recall following two interpolated out- 
puts, however, was higher than it was 
following two interpolated inputs. The 
difference between the two proportions 
(48 and .36) was significant at the 
.01 level, on the basis of the test for cor- 
related proportions (Ferguson, 1959, 
p. 148). 

Table 3 also shows that when an in- 
terpolated output was added to two in- 
terpolated inputs in Interval A (Cond. 
8), the probability of recall decreased 
only slightly, from .36 to .32, whereas 
the addition of an interpolated input to 
two interpolated outputs (Cond. 6) 
produced a larger decrement, from .48 
to.29. The addition of yet another in- 
terpolated unit (Cond. 9) did not seem 
to have any further effect on probabil- 
ity of recall. It looks as if the recall 
probability might have reached the 
asymptote after three interpolated items. 
The level of recall in Cond. 6, 8, and 9 
agrees well with the asymptotic value 
of the short-term memory function 
from various other studies (Murdock, 
1963b). 


TABLE 4 


Proportion oF PA Units RECALLED ON 
Test 2, GIVEN THAT THE UNIT WAS RECALLED 
ON Test 1, FoR Born Correct (C,C:) AND 
Incorrect (lls) Test 1 RESPONSES, AS A 
FUNCTION OF NUMBER or INTERPOLATED 
INPUTS AND OUTPUTS IN THE First AND 
SECOND RETENTION INTERVALS 


Events in the 
Second Interval (B) 


OCDE .— -—- 


Events in the 
First Interval (A) 2Inputs | 2 Outputs 
eiie. 
CiC2| Iil: | Cic: | Tals 
0 Inputs, 0 Outputs | .51 | — | .68 | — 


0 Inputs, 2 Outputs | .71 | .48 | .50 | -36 
2 Inputs, 0 Outputs | .61 | .48 | .68 | .47 
2 Inputs, 2 Outputs | .56 | .36 | 156 | -32 
beider sel do. 1753 


INPUT AND OUTPUT INTERFERENCE IN STM 


Test 2—A summary of Test 2 data 
is presented in Table 4. The entries 
in Table 4 are conditional probabilities 
of two kinds of events, (a) probability 
that a correct response on Test 1 was 
again recalled correctly on Test 2 
(CiCz), and (b) probability that a 
given incorrect response on Test 1 was 
repeated on Test 2 (T,I5). 

As mentioned earlier, when the crit- 
ical unit was tested for recall on Test 1 
in absence of any interpolated activity, 
it was practically always recalled cor- 
rectly. Table 4 shows that the prob- 
ability of correct recall of these units 
on Test 2 was higher following two 
interpolated outputs in Interval B 
(.68) than following two interpolated 
inputs (.51). The difference between 
these two proportions was significant 
at the .01 level. 

Critical units tested on the first recall 
test following two interpolated outputs 
in Interval A yielded a higher probabil- 
ity of recall on Test 2 when Interval B 
was filled with two inputs (.71) than 
when it was filled with two outputs 
(.50). The same was true of incorrect 
responses given on Test 1. On Test 2 
the same incorrect response was given 
again by Ss more frequently following 
two interpolated inputs (.48) than two 
outputs (.36). Unfortunately, no ap- 
propriate statistical tests seem to be 
available for testing the significance of 
these differences. In view of the abso- 
lute size of these differences and in 
view of the fact that the data were very 
similar on both sets of 72 Ss, they can 
probably be regarded as quite reliable. 

The nature of units interpolated in 
Interval B did not seem to have any 
large or obvious systematic effect on 
probability of recall on Test 2 when 
the critical unit was first tested under 
Cond. 7 (2 inputs, 0 outputs) or Cond. 
9 (2 inputs, 2 outputs) of Interval A. 
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Discussion 


The major finding of the present ex- 
periment was that probability of recall 
of a single PA unit a few seconds after 
its presentation is higher following two 
interpolated outputs than following two 
interpolated inputs. Both Test 1 data 
under Cond, 3 and 7 and Test 2 data un- 
der Cond. 1-I and 1-O supported this 
conclusion. In the latter case the over- 
all level of recall was higher than in the 
former case, corroborating similar find- 
ings reported by Peterson et al. (1962, 
Exp. IV), but the differences in recall 
as a function of interpolated inputs and 
outputs were equally clear in both cases, 

Since the critical units occupied the 
same input position in all series, they 
must have been “learned” to the same 
extent under all experimental conditions, 
It would seem reasonable, therefore, to 
attribute the differences in their recall 
after a fixed interval to differences in 
interference effects produced by interpo- 
lated inputs and outputs. Input inter- 
ference in short-term associative memory 
is greater than output interference, 

This finding makes sense if we assume 
that short-term memory storage has a 
limited capacity. The addition of in- 
terpolated units to other units already in 
the storage under the conditions where 
interpolated events are inputs might re- 
sult in the displacement of some of the 
stored units, including the critical unit, 
with the consequence that the probability 
of recall of the critical unit decreases. 
But if we accept this interpretation, addi- 
tional mechanisms have to be postulated 
to account for the decrement in recall 
following interpolated outputs and for the 
findings on Test 2 under Cond, 3-I and 
3-O. Under the latter conditions, recall 
of the response item given by S on Test 
1, whether correct or incorrect, was 
greater following two interpolated inputs 
in Interval B than following two interpo- 
lated outputs. These findings cannot be 
easily reconciled with the hypothesis that 
interpolated inputs displace previously 
stored units. 
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A somewhat more promising hypothe- 
sis is that the critical unit can be recalled 
more readily if its initial presentation or 
first recall occurs in a position in the 
overall series of inputs and outputs that 
can be more efficiently coded by S. The 
critical units under Cond. 3 were in the 
final position of a “run” of other inputs, 
while those under Cond. 7 were more in 
the middle of the run. If S codes or 
"tags" (Yntema & Trask, 1963) both 
items of input units with respect to their 
position in the sequence, and retrieves 
some of the response items in terms of 
their position tags rather than in terms 
of a direct association between the stimu- 
lus and response item, recall would be 
higher for units that have been tagged 
more efficiently. The first recall test of 
the critical item under Cond, 3-I occurred 
in the final position of a run of three 
outputs, while under Cond. 3-O it oc- 
curred in the middle of a run of five 
outputs. Serial-position tagging of the 
stimulus item of the critical unit and the 
response item given by S to it in the 
first recall test, whether correct or in- 
correct by E's criteria, might have been 
facilitated by the distinctiveness of the 
position of the unit in the overall series, 
resulting in higher conditional recall 
probability of the critical unit under 
Cond, 3-I than Cond. 3-O. 

The serial-position coding hypothesis 
is admittedly tenuous. As more empiri- 
cal information becomes available about 
factors affecting short-term memory of 
single units of material, better interpreta- 
tions of the findings reported in this 
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study might suggest themselves. For the 
time being, however, the explanation of 
input and output interference effects in 
terms of coding operations performed by 
S on units to be retained would seem to 
be reasonable. 
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TWO REPLICATIONS OF AVOIDANCE CONDITIONING OF THE GSR1 


H. D. KIMMEL, H. S. STERNTHAL, anv H. STRUB 
University of Florida 


2 replications of an earlier study of avoidance conditioning of the GSR 
were done, The Ist involved 22 pairs of Ss who received a visual CS 
and the 2nd used 21 pairs of Ss who received an auditory CS. The 
CS-UCS interval was 5 sec. in both studies. Following a series of 
CS-only trials, paired acquisition trials were run in which the avoidance 
S of a pair could make a criterion GSR that resulted in both Ss of the 
pair avoiding the shock. Following acquisition all Ss were given 10 
extinction trials, The results of both studies showed that the avoidance 
Ss made larger GSRs than the control Ss during acquisition and extinc- 
tion, but the only significant effect was that of the combined extinction 
differences. It was concluded that these results provided further sup- 
port for the assertion that the GSR can be conditioned with an avoid- 


ance procedure. 


Kimmel and Baxter (1964) have recently 
reported positive results in an experiment 
on avoidance conditioning of the GSR. The 
present report describes two subsequent ex- 
periments which were conducted for the 
purpose of verifying the earlier findings. 

Method.—The procedures of the two repli- 
cations were identical with those of Kimmel 
and Baxter (1964) with the following excep- 
tions: (a) In Replication No. 1 the CS was 
a circular red light produced by a Grayson- 
Stadler multiple-stimulus projector. In Rep- 
lication No. 2 the CS was a 1,000-cps tone 
of 40 db., identical to that used by Kimmel 
and Baxter. In both replications the CS 
duration was 1.0 sec. and the CS-UCS inter- 
val was 5.0 sec. (b) An automatic elec- 
tronic device was used to make the decision 
whether or not a particular anticipatory re- 
sponse achieved criterion magnitude and, 
thus, led to omission of the shock. In the 
earlier study E made this decision as he 
monitored the GSR record visually. In the 
two replications, as in the earlier study, the 
criterion was determined prior to the begin- 
ning of acquisition and was maintained there- 
after. (c) Different numbers of acquisition 
trials were used. In the first replication 25 
trials were given. In the second replication 
16 trials were given. (d) In both replica- 
tions 10 extinction trials were run following 
acquisition. (e) The two replications were 

1Done under Grant_MH-06060-03 from the National 
Institute of Mental Health. The third author con- 
cutee Exp. I and the second author conducted 

Now at Ohio University. 
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run in a new GSR laboratory. The basic 
instrumentation of this laboratory has been 
described elsewhere (Kimmel & Kimmel, 
1965) and, for the present purposes, will be 
treated as essentially similar to the earlier 
one. (f) Forty-four Ss were run in yoked 
pairs in Replication No. 1 and 42 Ss were 
run in Replication No. 2. As in the earlier 
study, avoidance and yoked control Ss re- 
ceived shocks, except on Trial No. 1 when 
all Ss were shocked, only on trials on which 
the avoidance S failed to make a criterion 
GSR within the temporal interval from the 
offset of the CS to the onset of the UCS. 

Results.—In both replications the depend- 
ent variable was the magnitude of anticipa- 
tory GSRs, transformed to log conductance- 
change units, and expressed as difference 
Scores in relation to similar measures ob- 
tained on the last adaptation trial and the 
first acquisition trial (ie; the two trials 
immediately preceding the first occurrence of 
the shock). Figure 1 summarizes the results 
of both replications. 

As was observed by Kimmel and Baxter 
(1964), the avoidance group's average GSR 
during acquisition was generally higher than 
that of the control group. In the present 
two studies this difference was also observed 
during extinction. In neither of the acqui- 
sition contrasts was the observed difference 
statistically significant. When the inde- 
pendent probabilities associated with the two 
extinction contrasts were combined (Winer, 
1962, p. 44), the common null hypothesis 
could be rejected with p < .025. 
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Experiment No.2 
Auditory cs 
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Acquisition — Blocks of 3 trials Extinction ~ Blocks 
of 3 trials 


Fic. 1. GSR difference scores in acquisition and extinction in both experiments. 


Discussion.—The automatic reinforcement 
device was introduced to eliminate the po- 
tential bias of E, who was responsible in the 
previous experiment for deciding whether to 
deliver or omit the shock on each acquisition 
trial In accomplishing a reduction in bias, 
however, the use of this device may have led 
to increased variability in the application 
of the reinforcement criterion. Check of 
individual records in the present experiments 
Showed that responses that exceeded the 
reinforcement criterion occasionally failed to 
operate the device and responses that failed 
to meet the criterion Occasionally operated 
the device. Because of this the reduced 
precision of the two replications does not 
vitiate their contribution to the question. 


The approximate size of the effect in both 
replications is in accord with the earlier 
results and the combined probability of the 
extinction differences is at an acceptable 
fiducial level. It is our conclusion, thus, that 
the present findings support the earlier con- 
clusion that the GSR can be conditioned 
using an instrumental avoidance procedure. 
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RELATION BETWEEN STIMULUS PRESENTATION TIME, SERIAL 
LEARNING, AND THE SERIAL-POSITION EFFECT : 


GLORIA J. FISCHER ? 
University of South Florida 


Bugelski (1962) found total time in PA learning to be constant and 
hypothesized that stimulus presentation time (Tp) X number of trials 
to learn (N) =k. The present study tested the extension of this hy- 
pothesis to serial learning. The influence of stimulus presentation time 
on the serial-position effect was also tested. Ss learned to anticipate 
a list of 12 nonsense syllables of 33% association value, at 2, 4, 6, 8, or 


16 sec. presentation times. 


learning, at least for short presentation times. 


Results extended T,(N) =k to serial 


The latter were found 


to have no effect on the serial-position curve. 


Bugelski (1962) and Bugelski and Rick- 
wood (1963) found that increased stimulus 
presentation time (Tp) of the R terms in a 
paired-associate list decreased trials to crite- 
rion (N) in a curvilinear manner, but had a 
constant relation to total learning time. For 
at least the presentation times used (4, 6, 8, 
10, or 17 sec.) and for PA learning, Bugelski 
(1962) hypothesized that T,xN=k. The 
purpose of the present study was to test the 
generality of Bugelski’s findings to serial 
learning. The study additionally provided 
data to relate Tp to the serial-position effect. 

Method.—A. random sample of 140 stu- 
dents from a large introductory psychology 
class were required to participate in the ex- 
periment. The Ss were the first 120 of 
these students to sign up and appear for the 
experiment. 

Stimulus material was presented on a 
Lafayette memory drum which was modified 
so that presentation times could be controlled 
by a Hunter decade timer. 

A list of 12 nonsense syllables of 3396 
association value was constructed from 
Glaze's list, following the rules suggested 
by Hilgard (1951). A six-item practice 
list was similarly constructed from nonsense 
syllables of 10096 association value. 

Using the serial-anticipation method each 
S was required to learn the practice list 
and then the regular list to criteria of 6 and 
12 consecutive, correctly spelled syllables, 
respectively. Stimulus presentation times 
were 2, 4, 6, 8, or 16 sec. with no intertrial 
interval, Two Es ran half the Ss in each 
presentation time. Within the resulting 10 
cells each of 12 Ss received the two lists in 
a different rotation of the syllables. 

Results and discussion—Two Ss in the 

1The author wishes to thank J. Daniel Devine and 

ie. data. 


Eugene L. Norris, who collected the 
2 Now at Washington State University. 


2-sec. group quit and had to be replaced. 
Another S in the 6-sec. group took so many 
trials to criteria as to be considered a gross 
error (Cochran, 1947) and was, therefore, 
discarded and treated as a missing observa- 
tion. Analyses of variance and covariance 
were then performed on the two dependent 
variables: trials to criterion and total time 
to criterion, with trials and time to criterion 
on the practice list as the respective co- 
variates. As expected, stimulus presentation 
time significantly influenced the number of 
trials, but not the total time required to learn 
to spell the list of syllables serially. In addi- 
tion to presentation time the number of trials 
to learn was influenced by the number of 
trials taken to learn to spell the practice 
list (b=.55). Consequently, the mean num- 
ber of trials for the several presentation 
times were adjusted for covariation. The 
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Fic. 1. Adjusted mean trials to serially learn (W) 
at various stimulus presentation times (Ty). 
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resulting means are shown in Fig. 1, where 
the fitted parabola is a least-squares solu- 
tion. It is clear that increased stimulus 
presentation time decreased trials to crite- 
rion in a negatively decelerated manner. On 
the other hand, total learning time, T,(N), 
remained essentially constant. These results 
support the extension of Bugelski’s hypothe- 
sis from PA to serial learning. Though not 
statistically significant, deviation at 16 sec. 
from the least-squares parabola, T,(N) =k, 
suggests, however, that the equation may 
require modification for longer presentation 
times, 

The percentage of total errors that each 
S made at the 12 serial positions was trans- 
formed, using 2 arc sin Vp. These data 
were then subjected to an S-within Presen- 
tation Times (T;) X Positions (P), re- 
peated-measurements design analysis of vari- 
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ance. Results clearly indicated that variation 
in stimulus presentation time had no influ- 
ence on the serial-position effect. For T, 
X P, F (44, 1265) <1. Rather, all groups 
showed the usual serial-position curve, ex- 
cept that its origin was displaced from the 
first to the second syllable position as a 
result of no intertrial interval. 
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SERIAL-LIST ITEMS AS STIMULI IN PAIRED-ASSOCIATE LEARNING 


SHELDON M. EBENHOLTZ * 
Connecticut. College 


Previous experiments have shown a serial-position effect (SPE) in 
paired-associate (PA) learning where the pairs contained stimuli pre- 


viously learned in serial order. 
number of pairs from 10 to 14, 


The present experiment extended the 
Pairs containing stimuli from terminal 


serial positions were learned with significantly fewer errors than pairs 
whose stimuli derived from central positions. The latter produced a 
dip in the PA error distribution suggesting the presence of sequential 
associations in SL between items occupying central positions. 


Previous experiments (Ebenholtz, 1966) 
have shown a serial-position effect (SPE) 
in paired-associate (PA) learning as a re- 
sult of transfer from serial learning (SL) 
to a PA list. The pairs were composed of 
nonsense-syllable stimuli taken from a pre- 
viously learned serial list with common 
nouns as response terms. The SPE oc- 
curred when the mean percentage of errors 
in PA learning was plotted as a function 
of stimulus position in prior SL. A total of 
10 pairs were learned to a criterion of one 
perfect trial, however, PA learning was 
relatively rapid and resulted in a narrow 
distribution of errors with a relatively small 
SPE. The present experiment attempted to 
enhance the SPE by decreasing the rate of 
PA learning, thereby permitting a wider 
distribution of errors. 


2 The author wishes to thank Addison Woodward 
for his aid in the collection and analysis of the data. 


Method.—Group SL learn a 14-item serial 
list to a criterion of one errorless trial. 
Each S learned a different sequence in order 
to counterbalance items over positions. 
Group FR learned the same 14 items by the 
method of free recall with the order of pres- 
entation varied on each trial. Free recall 
continued until all 14 items were recall 
on one trial or for a maximum of 24 trials. 
Both groups then learned a 14-pair list 
composed as described above, by the method 
of anticipation to one errorless trial. Four- 
teen Ss were assigned to each group on an 
alternating basis according to their order 0 
appearance at the laboratory. Three differ- 
ent presentation sequences were used. Non- 
sense syllables for the first task (viz. SL 
or FR) were chosen from the Glaze (1928) 
list of 80-100% association values. The 
nouns used as response terms in PA learn- 
ing varied from 2 to over 50 per million 
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words according to the Thorndike-Lorge 
(1944) word count. The Ss were naive to 
the methods and purpose of the study. Items 
were presented at a 3:3-sec. rate in PA 
learning and at a 3-sec. rate in SL and FR. 
The Ss in the latter group were allowed 
30 sec. for recall after each recitation of 
the list. Intertrial intervals were 8 sec. and 
6 sec. in SL and PA, respectively. 

Results and discussion.—Two Ss failed to 
reach the first task criterion in Group FR, 
however, all Ss recalled each syllable at 
least once within 24 trials. The percentage 
of correct recalls corresponding to each syl- 
lable was determined for each S in Group 
FR and then averaged over Ss. The means 
were rank ordered and the latter were cor- 
related with the ranks of the mean percent- 
age of correct responses associated with 
these same stimuli on the PA list. The 
correlation was not significant, p (14) = .060, 
p>.05, indicating that under the present 
conditions degree of stimulus recall was not 
a determinant of PA learning. In contrast, 
there was a significant rank correlation be- 
tween the mean percentages of correct antici- 
pations at each position in SL and the mean 
percentages of correct responses in PA 
learning to stimuli deriving from these same 
positions, p (14) —.675, p < .01. This rep- 
licates previous results and supports the 
conclusion that the position of a stimulus 
term on a previously learned serial list will 
determine the distribution of errors asso- 
ciated with that term in subsequent PA 
learning. The Ss in Groups SL and FR 
reached the first task criterion in an average 
of 15.1 and 13.8 trials, respectively. Mean 
trials to learn the PA list was 6.4 and 8.0 
for groups SL and FR, respectively. The 
difference was not significant, p> .05 (Wil- 
coxon test). 

Figure 1 represents the mean percentage of 
errors in PA learning as a function of stimu- 
lus position in prior SL. The mean per- 
centage of errors at Positions 1, 2, 13, and 
14 (5.6) was compared with the mean at 
Positions 3 through 12 (7.7). The differ- 
ence was significant by the Wilcoxon test 
for paired replicates, p < .02. 

The relatively low percentage of errors 
at terminal positions is consistent with the 
previous study and supports the assumption 
that Ss respond in PA learning to the posi- 
tional properties of items previously learned 
in serial order. On the other hand, the 
U-shaped error distribution at Positions 3 


155 


MEAN PERCENT ERRORS 
o E] 


a 


12 3-4 5-6 78 — 940 "i2 i34 
STIMULUS POSITION 


Fic. 1. Mean percentage of errors in PA learning as 
a function of stimulus position in prior SL. 


through 12 provides evidence favorable to 
the assumption that SL entails the formation 
of sequential associations. This view per- 
mits the inference that associative interfer- 
ence should occur in PA learning due to the 
necessity to learn new responses to old 
stimuli (viz., an A-B, A-C paradigm). Thus 
serial associations corresponding to central 
serial positions should produce relatively less 
interference than associations corresponding 
to terminal positions as the latter presumably 
are stronger than the former. 

The particular distribution of errors rep- 
resented in Fig. 1 supports a dual process 
interpretation of serial learning (Ebenholtz, 
1963), to the effect that positional cues are 
used at the boundaries of the series and that 
sequential associations are formed between 
items at central positions where position 
discrimination is more difficult. On the 
further assumption that ease of position 
discrimination for central serial positions 
diminishes with increased list length, it fol- 
lows that one is more likely to find evidence 
of sequential associations with relatively 
longer and/or difficult serial lists. The 
present study also serves to confirm a similar 
conclusion by Battig, Brown, and Schild 
(1964). 
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STIMULUS INTERVALS, STIMULUS DURATIONS, AND DIFFICULTY 
LEVEL IN PAIRED-ASSOCIATES LEARNING? 


CALVIN F. NODINE AND BARBARA F. NODINE 
Carnegie Institute of Technology 


Presentation of paired associates using the anticipation method was 
investigated in a 3 X 3 X 2 factorial design as functions of (a) interval 
between stimulus term and St-R pair (ISI) ; 0, 1, or 2 sec., (b) inter- 
val between St-R pair and stimulus term of succeeding PA unit 
(IUI); 0, 1, or 2 sec, and (c) exposure duration of stimulus terms 
(St duration) and exposure duration of St-R pairs (Pair duration) ; 
Performance was found to increase directly with 
However, the data indicated that when 
Durations were 3:3 sec, IUI produced significantly larger incre- 
ments in correct responses than ISI from 0 to 2 sec. 


3:3 or 2:2 sec. 
ISI, IUI, and Durations. 


were 2:2 sec., the reverse was true. 


Nodine (1963) has investigated two tem- 
poral variables associated with the anticipa- 
tion method of presentation of paired associ- 
ates (PA). One class of variables which has 
been shown to influence acquisition of PA 
units is related to exposure time of stimulus 
units (St duration) and exposure time of 
stimulus and response units paired (St-R or 
Pair duration). The present experiment 
deals with a second class of temporal vari- 
ables associated with the anticipation method 
which are related to the "spacing interval" 
of PA units. One of these variables is the 
interval between the stimulus unit and the 
stimulus-response pair (designated interstim- 
ulus interval, ISI). A second variable is the 
interval between stimulus-response pair and 
the succeeding PA unit (designated interunit 
interval, IUI). The intertrial interval (ITI) 
which has been defined as the interval be- 
tween sets of n PA units may also be dis- 
tinguished; however, this interval will be 
held constant in the present study. 

The variables described above can best be 
illustrated by use of the following diagram 
which shows schematically the presentation 
of a two-unit PA list, GEL TUF and KAR BEZ 
using the anticipation method. The stimulus 
units are labeled above each block. The top 
line labeled “St” designates presentation of 
the stimulus unit; onset of each term is in- 
dicated by the upward excursion of the line, 
duration by length of the line, and offset by 
downward excursion of the line. The line 
labeled “R” represents response unit presenta- 
tion similarly. Time is indicated on the bot- 
tom line in seconds. 
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When Durations 


The specific aim of the present experim 
was the systematic exploration of the el 
of the several components of presentation. 
on performance. In order to assess the 
arate and joint effects of interval and d 
tion variables, ISI and IUI were ind 
ently manipulated at two exposure dura! 
ina 3X3X2 factorial design. Of intei 
also was the interaction of these three 
poral variables with difficulty level of the: 
units. } 
Method.—Data are reported on 144 unde 
graduates, 8 Ss in each of 18 conditions. 
apparatus components consisted of two 
projectors, six interval timers, and 
solenoid-operated shutters. One unit cons! 
ing of projector, shutter, and three ti 
controlled stimulus presentation, St dura! 
and ISI; an identical second unit contr 
St-R presentation, Pair duration, and 

All Ss acquired a mixed list of 16 
trigram pairs described previously by 
(1963). In the present experiment, e 
of Sim and M were not assessed dirt 
Instead, the 16 PA units were divided 
two levels of difficulty, hard and easy, 
termined empirically by the number of co 
responses obtained for each PA 
two previous experiments using the same 
(Nodine, 1963, 1965). Hard PA units 
represented by eight combinations of 
and M (eg., high Sim, low M for 
stimulus and response terms) and pr 
significantly fewer correct responses 
Easy PA units represented by the rema 
eight combinations of Sim and M (e. 
Sim, high M for both stimulus and re 
terms). In terms of proportions ba: 
total number of correct responses pO! 
hard PA units had an average value of # 
and easy PA units had an average 
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Fic. 1. Schematic presentation of a two-unit PA list, GEL TUF and KAR BEZ using the anticipation method. 


50. Serial learning was minimized by five 
semirandom orders of the list. 

Acquisition was investigated as a function 
of orthogonal combinations of 0, 1, and 2 sec. 
ISI:IUI at two exposure durations, 3:3 sec. 
and 2:2 sec. Because of the range of stim- 
ulus intervals employed, an interpolated re- 
call-test trial using a 4-sec. recall interval 
was systematically interspersed between each 
block of four acquisition trials. Intertrial in- 
tervals were 8 sec. All Ss received a total 
of 24 acquisition trials and 6 recall-test trials. 
At the completion of the task, Ss were ques- 
tioned by E regarding rehearsal activity dur- 
ing learning. 

Results and discussion—Analyses of vari- 
ance of correct responses and intralist errors 
are presented in Table 1. All main effects 
were significant for correct responses but 
only Trials and Difficulty were significant for 


intrusions. Correct responses increased di- 
rectly with increases in each of the three tem- 
poral variables; however, Durations produced 
a greater range of differences than either IST 
or IUI. In addition, IUI interacted signifi- 
cantly with Durations. This interaction re- 
sulted from the fact that performance for 
2:2 sec. durations was essentially invariant 
as IUI increased from 0 to 2 sec. In con- 
trast, performance for 3:3 sec. durations in- 
creased significantly (doubled) as IUI in- 
creased from 0 to 1 sec., then remained un- 
changed from 1 to 2 sec. Comparison of 
differences in mean number of correct re- 
sponses between ISI:IUI combinations 
(where the first number indicates ISI, and 
the second, IUI, respectively) using the 
Duncan test indicated 0:1> 1:0 (6.75) p< 
.01, and 0:2 > 2:0 (5.88) p < .01 when Dura- 
tions were 4:4 sec. The reverse was true 


TABLE 1 
ANALYSES OF VARIANCE OF CoRRECT RESPONSES AND INTRALIST INTRUSIONS FOR TRIALS 1-6 


Correct Responses 


Intralist Intrusions 


Source df 

MS F MS F 
Interstimulus Intervals (ISD | 2 175.10 114168 4.13 241 
Interunit Intervals (IUD 2 92.43 602** 0.78 = 
Durations (D) 1 353.89 23.07 1.82 1.06 
IUI XD 2 128.55 8.38 0.76 = 

error (b) 126 15.34 1.71 
Trials (T) 5 657.54 562.00 1.92 3.92 
ISI X 10 3.85 32999 0.69 1.41 
IUI XT 10 2:78 2.38 0.99 2:02* 
DXT 5 15:61 13.34 1.26 2.57% 
Difficult Covel (L) ^ 3763.02 853.29 55:62 66.219 
pL aly 1 7.79 1.77 407 4.84% 
W.: 12 Y i 
1 FLA à 48.62 stone 937 1.46 
ST L E 0 zx 2.6' . po 
DXL XT 5 243 2.70% 0.52 1.06 
Ist XIDI XL XT 20 0.99 2 0.85 1.73* 
IUIXD XL XT 10 2.86 3.186" 0.76 1.55 
ISI XIU X D XL XT 20 0.92 - 0.88 1.79% 
error (Wa) 630 0.90 0.49 
*p «.05. 
RD <.01 
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when Durations were 2:2 sec. and 1:0>0:1 
(7.76) p < .01, and 2:0 > 0:2 (13.74) p < .01. 

Learning curves for correct responses were 
negatively accelerated over trials for main 
effect and interactions. The curves increased 
directly with increases in ISI, IUI, and 
Durations in a diverging pattern. Intrusions 
increased over trials in an inverted U-shaped 
function, The inverted U-shaped function 
was also present in the significant interactions 
of Trials X IUI and Trials X Durations ex- 
cept for a direct increase in intrusions when 
IUI was 0 sec. in the former interaction and 
when Durations were 3:3 sec. in the latter 
interaction. 

The significant effect of Difficulty was the 
result of more than 2.5 times as many correct 
responses for easy as hard PA units. The 
several interactions with Difficulty were due 
primarily to faster learning rates for easy 
than hard PA units. Twice as many in- 
trusions occurred with Hard as Easy PA 
units, however, the interaction of Difficulty X 
Durations indicated that intrusions remained 
essentially unchanged for hard PA units as 
Durations increased from 3:3 to 2:2 sec. 
while the number of intrusions decreased for 
easy PA units as Durations increased from 
3:3 to 2:2 sec. 

Evidence from introspective reports by Ss 
suggested that lengthening either durations 
or intervals provided Ss with more rehearsal 
opportunities, — Eighty-nine percent of Ss 
reported rehearsal activity during the experi- 
ment. Half of these Ss reported multiple 
loci of rehearsal but this activity was con- 
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fined mainly to Pair durations, IUI and ITI. 
When opportunity to rehearse was limited by 
brief exposure durations to ISI or IUI, IUI 
was more important than ISI in influencing 
performance probably because Ss could re- 
hearse specific St-R pairs during IUI, but 
only stimulus terms during ISI. Lengthen- 
ing exposure durations resulted in larger 
increments in performance for ISI than IUI. 
However, lengthening exposure durations 
makes assessment of the "pure" effects of 
ISI and IUI difficult because of potential 
contamination due to increased rehearsal op- 
portunities during St and Pair durations. 
The direct relationships between these 
three temporal variables and numbers of cor- 
rect responses is viewed primarily as a func- 
tion of differential rehearsal opportunities 
by Ss which increase as stimulus intervals 
and stimulus durations lengthen thus provid- 
ing increased experience with PA units. In- 
tegration of stimulus and response terms are 
among the underlying processes which are 
presumed to vary directly with frequency of 
experiences with PA units. Lengthening 
stimulus intervals and durations hastens in- 
tegration thereby facilitating the formation 
of St-R associations necessary for learning. 
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ANALYSIS OF LEARNING RATE AND SAMPLING PROBABILITIES 
IN A CONTINGENT REINFORCEMENT SITUATION 1 


JOHN G. BORKOWSKI 2 
University of Iowa 


The relations between probabilities of rei 
of learning rates, sampling probabilities (0), 


AND 


GILBERT R. JOHNS 
Colorado College 


inforcement (r) and order 
and overall learning rates 


(a) were examined in a contingent 2-choice prediction situation. 


Rates, asymptotic levels, 6, and o values we: 
of m. The fastest rates of learning occur 


difference was large. 


‘ In the contingent two-choice reinforcement 
situation, the probabilities of reinforcement 


1 This paper is based on a thesis submitted in partial 
fulfillment for the MA degree at Ohio University, 
* Now at Oberlin College, Oberlin, Ohio. 


re found to be a function 
ted when the probability 


(x) are dependent upon S’s response so that 
reinforcing events have one set of probabil- 
ities, mu and 1 — mn, if the response is As, but 
a different set of probabilities, ma and 1 — 
Ta, if the response is As The probability 


A 


ee 
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of the A; response on Trial n is given by the 
formula 


ba = fa — (fe — pr) (1 — 0x)" 


where 6 is the stimulus sampling probability, 
`=] — rat a, and po (72/A) is the as- 
ymptotic probability of response A; The 
“overall” rate term, 0A, has been referred to 
as a (Estes, 1957b). 

According to theoretical assumptions of 
statistical learning theory, 0 is independent 
of m in the contingent as well as in the non- 
contingent reinforcement situation. How- 
ever, Estes and Straughan ( 1954) and Nei- 
mark (1956) have reported results indicating 
that @ was dependent on m in the noncon- 
tingent prediction situation. The relationship 
between @ and r in the contingent case has 
not been extensively investigated, though a 
study by Woods (1959) concluded that œ was 
dependent upon x while 6 was unaffected by 
variations in 7. 

This study was designed to determine the 
relationships between + and observed learn- 
ing rates, 0, and « in a contingent two-choice 
prediction situation. 

Method.—The Ss were 144 Ohio Univer- 
sity freshmen who were randomly assigned 
to one of six treatment groups. Groups I, II, 
and III had predicted asymptotes of .67 and 
à values of 1.5, 1.0, and .375, respectively. 
Groups IV, V, and VI had predicted asymp- 
totes of .40 and A values of 1.5, 1.0, and .375. 
Specific probabilities of reinforcement were 
determined by substituting fə and A values 
into the equation for asymptotic probabilities 
and then solving for z values. Each S pre- 
dicted which of two reinforcing lights, Ex or 
E» would occur by pressing one of two re- 
sponse keys, Ai or As A sequence of 300 
prediction trials was presented without in- 
terruption. The total time for one trial was 
6 sec. Durations of the intratrial events 
were: signal light, 2 sec.; interval between 
the signal and reinforcing lights, .8 sec.; re- 
inforcing light, 2 sec. ; interval between rein- 
forcing light and the following signal light, 
l2 sec. Each of the six treatment groups 
was divided into four subgroups, each receiv- 
ing a different sequence of E; and E; events. 
Reinforcing events were randomized in blocks 
of 100 trials with the restriction that the 
Proportions of E; and Es events would be the 
same for each of three 100 trial blocks. 

Results and discussion —The mean propor- 
tion of A; responses in 20-trial blocks for the 
six groups is shown in Fig. 1. Inspection of 
the curves with predicted asymptotes at .67 
indicates that the observed rates of learning 
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300 prediction trials. 


were ordered from Group I to III, with I 
having the fastest rate. The order of rates 
at the predicted asymptote of .40 was from 
Groups IV to VI. Performance for the six 
groups was compared by means of a 2X 3 
factorial analyses of variance on number of 
A: responses over 300 trials. A significant 
interaction was obtained between P» and A, 
F (2, 138) = 36.59, p < .01. Individual com- 
parisons of the mean number of A; responses 
for Groups I, II, and III showed no signif- 
icant difference between I and II, but the 
levels of responding for I and II were sig- 
nificantly higher than the level of III, £(46) 
= 7.39; (46) =6.72. Significant differences 
between Groups I and II occurred during the 
first 100 trials as these groups approached 
asymptote, #(46) —2.50. Though the levels 
of A; responding for Groups IV and V were 
not significantly different across conditioning 
trials, the mean number of A; responses for 
both groups was greater than for Group VI, 
1(46) =3.54; 1(46) =3.19. It was con- 
cluded that observed rates of approaching 
asymptotes of .67 and .40 were a function of 
probabilities of reinforcement as expressed 
by ^ values. 

Comparisons between the observed and pre- 
dicted number of A: responses during the last 
100 trials showed more deviation from the 
predicted asymptote of .67 in terms of "over- 
shooting" (Group I) and “undershooting” 
(Group III) than for groups with a pre- 
dicted asymptote of .40. Rates and final 
levels of learning for the noncontingent 
groups (II and V), in which mu equaled ma, 
were consistent wtih previous findings (Estes 
& Straughan, 1954; Straughan, 1956) and 
supported the “matching law" (Estes, 1957a). 

Perhaps the most salient aspect of the re- 


= 


160 


sults is the failure of Groups III and VI to 
show any appreciable learning. An explana- 
tion for the slow rates of Groups III and VI 
can be derived from a consideration of prob- 
ability differences, y = |ra — maļ. For Group 
III, where y was small (.125), the probabil- 
ity- of E; following A: was .875 while the 
probability of Es following As was .75. In 
order to approach the asymptote of .67, Ss 
were required to discriminate between the 
87% and 75% reinforced events as to the 
more. frequently rewarded response. The 
similarity between reinforcement probabilities 


(small y) resulted in a difficult discrimination 


situation and was responsible for the slow 
learning rates of Groups III and VI. 

Woods (1959) concluded that @ was inde- 
pendent of + in the contingent case. The 
parameter, 0, was estimated for each of the 


six groups in the present study from the 
F j 


formula: 
Rs — nba — (po — pr) [1 — (1 — 0r)"] (1/03) 


where Ra is the observed mean number of Ai 
responses," 1:—.50 and f. and are deter- 
mined by specific m values. 


` *An n of 300 trials resulted in negative estimated 0 
yalues; since such a parameter cannot be meaningfully 
interpreted, an » of 200 trials was used in @ estima- 
tions for each of the six groups. 
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For Groups I, II, and III estimated 0 


“values (035, .013, and .002, respectively) 
‘were found to be a function of m or X. Al- 


though the @ values for Groups IV and V 
were similar (.010 and .011), both were 
larger than the estimated @ for Group VI 
(.004):* It seems apparent that ð is depend- 
ent upon ~ in the contingent reinforcement 


-situation. 


Overall rate terms were found «to be a 
function of m. The largest a values occurred 
for Groups I (.052) and IV (.015) and the 
smallest for Groups III (.001) and VI 
(.002). 
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EFFECTS OF ADDING A STIMULUS DIMENSION 
PRIOR TO A NONREVERSAL SHIFT : 


DONALD E. GUY, FREDERICK M. VAN FLEET, AND LYLE E. BOURNE, Jn. 


University of Colorado 


In a concept-identification task, Ss attained a unidimensional solution 
(Stage 1), then used that solution to classify stimulus patterns which 
embodied an additional dimension of variation (Stage 2), and finally 
discovered a new solution determined by a nonreversal (NR) shift to 
the added dimension (Stage 3).. In Experiment I, the added dimen- 
sion was either irrelevant to" correct responding (Cond. I) or 
redundant with the initially relevant dimension (Cond. R) in Stage 2. 
Performance in Stage 3 was significantly better in Cond. R than in 
Cond. I. The results of Experiment II, including Cond. IR for 
which the added dimension was intermittently reinforced during Stage 
2, were compatible with those of Experiment I; performance in Cond. 
IR fell midway between Cond. I and R. The overall Outcome was 
interpreted within contrasting frameworks of cue-conditioning and 


hypothesis-testing models of concept identification. 


A nonreversal (NR) shift in dis- 
crimination and simple concept-learn- 
ing tasks entails an unannounced 
change in solution from one relev: 
dimension (or set of dimensions) to 
another. This paper describes two ex- 
periments designed to explore the effect 
of adding a novel dimension to the 
stimulus patterns after S had attained 
Solution to Problem 1 but before an 
NR shift was put into effect. The 
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added dimension was, in all cases, rele- 
vant after the shift, i.e., in Problem 2. 

The rationale for these experiments 
arises from the following considera- 
tions. Suppose the process of solving 
a concept-identification problem may be 
described as sampling, testing, and 
rejecting hypotheses until a correct or 
relevant one is discovered (Bower 
& Trabasso, 1963; Restle, 1962). 
Taking this assumption literally, one 
would anticipate no difference between 
conditions wherein a new (to-be-rele- 
vant) dimension is added to the stimuli 
before or on the first shift trial, for 
once S has sampled a correct hypothesis 
(Problem 1) he wilF not resample (nor 
presumably be influenced in any way 
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"by other dimensions) until he en- 
counters an error, i.e., on the first shift 
(Problem 2) trial. Further, there is 
the implication that neither the correla- 
tion between the levels of an added 
dimension and the response categories 
on preshift trials nor the number of 
preshift trials on which the dimension 
appears will be variables of conse- 
quence, 

On the other hand, if cue condition- 
ing (of relevant stimulus attributes to 
response categories)» and adaptation 
(of irrelevant attributes) underlie 
concept identification (Bourne & 
Restle, 1959), real differences would 
be expected to result from introducing 
a new dimension during a preshift trial 
sequence. If the new dimension is 
irrelevant on these trials (ie., uncor- 
related with the correct response se- 
quence), cues arising from it may 
adapt, thus becoming relatively inacces- 
sible for conditioning after the shift. 
If the new dimension is relevant (and, 
therefore, redundant with the initially 
relevant dimension [Bourne & Hay- 
good, 1959]), its cues may be partially 
conditioned in Problem 1, thus reduc- 
ing the relative difficulty of Problem 2. 
Theoretically, the amounts of condi- 
tioning or adapting for the newly intro- 
duced dimension, and, therefore, the 
change in shift difficulty relative to a 
condition wherein the new dimension 
is added on the first shift trial, should 
be direct functions of the number of 
preshift trials on which it is permitted 
to vary. 

There are two qualifications on the 
foregoing predictions, First, if the 
conditioning or adaptation of any 
single dimension depends on its validity 
(as a cue to correct responses) rela- 
tive to other variable dimensions and 
vanishes immediately upon any change 
in relative validities (Gormezano & 
Abraham, 1961; Restle, 1958), then 
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cue-conditioning theory generates much 
the same “null hypotheses” about the 
effects of introducing a new dimension 
as does a hypothesis-testing model. 


- When the shift is put into effect, all 


Problem 1 dimensions may decondition 
and deadapt, and learning may begin 
over, as if Problem 1 had»not been 
given. Therefore, failure of the manip- 
ulated variables to produce any effect 
cannot be taken as definitive evidence 
in favor of either model. Still, such a 
result would imply important restric- 
tions on a cue-conditioning theory of 
learning, 

Second, recent studies by Braley 
(1962; Braley & Johnson, 1963), 
which included some conditions similar 
to those of the present experiment, pro- 
vided evidence—not taken account of 
in present predictions—that cue novelty 
is an effective variable in classification 
tasks. Braley (1962) tested the notion 
(Bruner, Matter, & Papenek, 1955) 
that exposing Ss to certain stimulus at- 
tributes prior to an NR shift would 
facilitate the later use of these at- 
tributes when they became relevant 
(after the shift). Contrary to expecta- 
tions, however, prior exposure im- 
paired performance relative to a con- 
trol condition in which attributes were 
introduced on the first shift trial. 

ley proposed that when a new di- 
mension is first introduced it has maxi- 
mal attention value which diminishes 
over continued exposure. In his ex- 
periments, this factor would tend to 
facilitate the postshift performance of 
control Ss. Theoretically, it may be 
asserted that novel or salient stimulus 
attributes (a) are sampled with highet 
probability as hypotheses or (b) enter 
more readily into a conditioning proc 
ess. Regardless of the interpretation, 
if cue novelty is a significant factor !n 
the present experimental situation, it 
will reduce the difficulty of the NR 
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shift for conditions in which the dimen- 
sion is added on the first. shift trial. 
It is impossible, of course, to forecast 
the magnitude of the effect. 

To summarize, the purpose of these 
experiments was to explore the effects 
on performance in a concept-identifica- 
tion task that result from the introduc- 
tion, under various conditions, of a 
new stimulus dimension during a series 
of trials just preceding an NR shift. 
They were designed to provide data 
relevant to assumptions of two 
types of concept-identification theory. 
Whereas  hypothesis-testing ^ models 
predict no differences in postshift per- 
formance, cue-conditioning models may 
be interpreted to imply (a) facilitation 
when the added dimension is relevant 
(both before and after the shift), (b) 
interference when the added dimension 
is irrelevant—both effects relative to 
expected performance when no dimen- 
sion is introduced prior to a solution 
shift, and (c) that both effects will in- 
crease with number of preshift trials. 
Expectations from both theories ignore 
the probable influence of cue novelty 
which may facilitate performance when 
the new dimension is introduced on 


the first shift trial. 
Method Li] 


Task.—Each S was presented at the out- 
set with instructions describing (a) the type 
of solution to be discovered, (b) the re- 
Sponse requirements, and (c) the meaning 
of feedback signals. The S was required 
to respond to each of the serially presented 
Seometric stimulus patterns (projected on 
a viewing screen) by pressing one of two 
buttons. Immediately after each response, 
the pattern was removed and a feedback 
lamp was turned on over the correct button. 


EXPERIMENT I 


The S's problem was to learn the association , 


between these buttons and the two levels 
of a single relevant dimension. Several 
irrelevant dimensions, the levels of which 
Were uncorrelated with correct responses, 
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appeared in all problems. All conditions 

were self-paced in that S was allowed as 

much time as he needed to respond to any 

pattern. The presentation of stimulus pat- 

terns, the control of feedback signals (1-sec. 

duration) and of overall interstimulus in- 

terval (7 sec.), and the recording of re- 

sponses were accomplished automatically 

with an apparatus described by Bourne and - 
Haygood (1959). 

The task consisted of three stages. In 
Stage 1 (Problem 1), S solved a unidimen- 
sional concept problem—attaining a criterion 
of 10 correct responses in a row. In Stage 
2 (continuation of Problem 1), a new 
stimulus dimension, one which did not vary 
during Stage 1, was introduced. In all 
cases, the same dimension was relevant. in 
both Stages 1 and 2. Stage 3 (Problem 2) 
initiated the NR shift. In all conditions, a 
different dimension—the one introduced in 
Stage 2—became relevant and the formerly 
relevant dimension became irrelevant. The 
Ss were taken to a criterion of 10 con- 
secutively correct responses. The Ss were 
treated identically (except for problem dif- 
ferences) in Stages 1 and 3; groups dif- 
fered on the basis of treatments administered 
during Stage 2. While the task is divisible 
into three stages in these studies, trials were 
actually run-off without interruption or in- 
struction to S. Necessary changes in the 
programmed stimulus and feedback sequences 
between stages were accomplished by switch- 
ing silently between two projector-feedback 
transmitter couplings. 

Subjects—The Ss were 84 students in 
elementary psychology classes. They were 
assigned randomly, but in equal number to 
14 experimental conditions. 

Design.—Two problems, each given to half 
the Ss, were used in Stage 1: (Problem A) 
size (large-small) dimension and (Problem 
B) form (square-triangle) dimension rele- 
vant. Vertical position (top-bottom of view- 
ing screen), color (red-green), number (one- 
two figures), and form (Problem A) or size 
(Problem B) were the four irrelevant di- 
mensions. All patterns were presented 
against a white field. In Stage 2, the same 
dimension was relevant, but a new dimen- 
sion—Background (stippled cross hatched) — 
was introduced for some Ss? Background 


2The use of Background as the added 
dimension was suggested by the results of a 
preliminary experiment. In that study, 
which had essentially the same design and 
purpose as Exp. I, the added dimension 
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was irrelevant in Cond. I; relevant and 
redundant with the Stage 1 solution (i.e. 
its levels were perfectly correlated with the 
sequence of correct responses) in Cond. R; 
and not introduced until Stage 3 in Cond. N. 
For an equal number of Ss within each 
condition, the length of Stage 2 was 10 or 
20 trials. Stage 3 began with an NR shift 
to Background as the only relevant dimen- 
sion. The Ss in Cond. C were shifted to 
Stage 3 immediately upon attaining criterion 
in Stage 1. Thus the experiment may be 
visualized as a 3 (Conditions) X 2 (Length 
of Stage 2) X 2 (Problems) factorial with 2 
(Problems) control groups, shifted directly 
from Stage 1 to Stage 3. 


Results 


Inspection of the raw-score distribu- 
tions of errors and trials to last error 
in this and the following experiment 
suggested that normality could be bet- 
ter approximated with a square-root 
transformation. Identical results were 
obtained throughout, however, from 
analyses of variance on both measures 
in transformed and raw-score form. 
Therefore, only the untransformed 
trials measure will be considered in 
detail. 

Stages 1 and 2.—The analyses on 
Stage 1 performance were made to 
assess the equality of groups serving 
under comparable experimental condi- 
tions. No significant effect of any 
variable, except Problems (p < 01), 
was observed. Problem B was solved 
in reliably fewer trials than Problem 


was either size or form depending on which 
of the two was relevant in Stage 1. The 
outcome was inconclusive largely because 
one level of the added dimension was em- 
bodied in all Stage 1 stimulus patterns. 
Most 1 Ss (according to postexperimental 
interview) failed to notice the absence of 
the other level. Thus, the dimension failed 
to function as an added source of stimulus 
variability. The introduction, in Exp. I and 
IL of a more obvious dimension—Back- 
ground—at two previously unseen levels was 
expected to provide a more effective experi- 
mental manipulation. 
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A. No S made even one category 
error during Stage 2, which is not un- 
expected in view of the fact that prob- 
lem solution remained unchanged. 
Stage 3.—Mean numbers of trials to 
solution in Stage 3 as a function of 
the main variables (Length of Stage 2 
and the Condition of the added dimen- 
sion) are plotted in Fig. 1. While per- 
formance levels in the three conditions 
are clearly different—a result which 
would appear more consistent with ex- 
pectations based on cue-conditioning 
than hypothesis-testing theory—the 
manner in which they differ is difficult 
to resolve with either position. Trials 
and errors to solution increase with 
length of Stage 2 for Cond. I, reflecting 
possibly a  cue-adaptation process. 
However, there is (a) no significant 
change in Stage 3 performance for 
Cond. R and (b) systematic improve- 
ment in Stage 3 performance for Cond. 
N. The latter observation seems likely 
to be the combined result of a facilita- 
tive overlearning effect (Grant & Cost, 
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Fic. 1. Mean number of trials to solution 
in Stage 3 as a function of cue relevancy 
and trial length received during Stage ^ 
(Each plotted point represents 12 Ss m 
Exp. I.) 
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1954) and the novelty of the relevant 
dimension first introduced in Stage 3. 
Analyses of these data (omitting Cond. 
C) revealed a reliable Conditions term 
only, Frias (2, 60) = 19.00, p < .001. 
Performance in both Cond. I and R 
differed significantly from Cond. N, 
Fi vs, n- (1, 77) = 39.49, and Fg x 
(1, 77) = 1348, p « .001. 


Experiment II 


In the two-category concept-identifi- 
cation problem, any relevant dimension 
is accompanied by confirming informa- 
tive feedback or reinforcement on 
100% of trials. In other words, its 
levels are perfectly correlated with the 
correct response categories during the 
series of trials. Any irrelevant dimen- 
sion is reinforced at a chance (50%) 
rate; its levels are associated with both 
categories on an equal number of trials 
and are uncorrelated with the correct 
response sequence. It is possible to 
construct problems wherein one or 
more irrelevant dimensions are inter- 
mittently reinforced at some rate 
greater than chance, such that a non- 
zero correlation exists between their 
levels and correct responses. The 
existence of such a dimension has been 
shown to facilitate both the identifica- 

` tion of a (different) relevant dimension 
and subsequent performance on an N 
shift to the intermittently reinforce 
dimension (Bourne & Haygood, 1960). 

When a dimension was added in 
Stage 2 of Exp. I it was either com- 
pletely relevant (100% reinforced) or 
completely irrelevant (50% rein- 
forced). Experiment II was designed 
to explore the effects of adding an in- 
termittently reinforced dimension dur- 
ing Stage 2. Based on an incre- 
mental cue-conditioning model, one 
would expect an improvement in Stage 
3 performance as frequency of rein- 
forcement during Stage 2 increases. 
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On the other hand, no effect of this 
variable is anticipated if Ss rely during 
Stage 2 solely on the relevant hy- 
pothesis sampled during Stage 1. 


Method 


Subjects—The Ss were 48 students se- 
lected on a volunteer basis from introductory 
psychology courses. They were assigned 
randomly and in equal number to eight 
conditions, 

Design.—The general procedure was es- 
sentially the same as that used in Exp. I. 
Each S solved one of two problems— ( Prob- 
lem A) size relevant or (Problem B) form 
relevant—in Stage 1, each with four ir- 
relevant dimensions. The length of Stage 
2 was 16 trials for all groups Each S 
served in one of four main conditions during 
Stage 2. In Cond. R: the added dimension 
(Background) was redundant with the ini- 
tially relevant dimension and therefore re- 
inforced on 100% of Stage 2 trials; Cond. I: 
the added dimension was irrelevant and 
therefore reinforced at a chance rate (50%) ; 
Cond. IR: levels of the added dimension 
were partially correlated with those of the 
initially relevant dimension such that their 
rate of intermittent reinforcement was 75%; 
and Cond. N: no dimension was added dur- 
ing Stage 2 and Background variation was 
first introduced in Stage 3. All groups 
were required to make an NR shift to Back- 
ground as a relevant dimension in Stage 3. 


Results 


Groups performed equivalently dur- 
ing Stage 1, except for the reliable 
difference in problem difficulty, p < .01. 

Performance in Stage 3 was clearly 
affected by the condition under which 
the new dimension was added. Mean 
trials to criterion for Cond. N, R, IR, 
and I, respectively, were 8.0, 16.0, 30.8, 
and 37.0. The overall difference is 
reliable, F (3, 40) = 419, p< .03. 
Subsequent analyses showed: (a) the 
difference between Cond. N and the 
remaining groups was significant, F 
(1,40) = 7.76, p < .01, thus confirm- 
ing the observation in Exp. I that the 
saliency or novelty of an added dimen- 
sion facilitates its utilization even more 
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than the prior relevancy of that dimen- 
sion, and (5) the trend in performance 
as a function of rate of reinforcement 
for the added dimension, 50%, 75%, 
and 100%, is linear, F (1, 30) — 521, 
p<.01. 


GENERAL DISCUSSION 


The results of Exp. I and II are not 
entirely consistent with literal interpre- 
tations of either a cue-conditioning or a 
hypothesis-testing theory of concept 
identification. On the surface they seem 
more compatible with the former: (a) 
Exposure during Stage 2 to an irrelevant 
dimension results in a Stage 3 perform- 
ance decrement, the amount of which 
increases with number of Stage 2 trials; 
and (b) increasing the degree of inter- 
mittent reinforcement for a dimension 
added during Stage 2 results in a direct 
improvement in Stage 3 performance. 
Even the fact that Cond. N yields best 
overall performance in Stage 3 is ex- 
plainable if the theory is liberalized to 
allow for a factor of novelty or saliency 
of dimensions. None of these results, 
with the possible exception of the last, 
seems predictable from available state- 
ments of theories based on hypothesis 
testing. 

One outcome of Exp. I, however, is 
cause for suspicion about the adequacy 
of a cue-conditioning model. It was 
predicted that the introduction of a di- 
mension, redundant with the initially 
relevant dimension and itself relevant in 
Stage 3, would produce a facilitory effect 
on Stage 3 performance, the amount of 
which would increase with number of 
Stage 2 trials. On the contrary, there 
is no trend toward improved performance 
with length of Stage 2 in Cond. R. As 
Such, the results are inconsistent with 
the cue-conditioning model. 

Lack of complete certainty about the 
proper interpretation of this finding led 
to additional data analyses. Stationarity, 
Vincentized learning curves (Suppes & 
Ginsberg, 1963), and backward learning 
curves were constructed from the data 
of both studies, These functions pro- 
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vided no evidence of learning, i.e., of 
departures from a chance level (50%) 
of correct responses, prior to S’s last 
error. Because there were no obvious 
differences among conditions or experi- 
ments, a composite Vincentized learning 
curve was computed and is shown in 
Fig. 2. The trend appears to be more 
compatible with expectations based on an 
all-or-none, hypothesis-testing description 
of performance than with a model which 
supposes incremental learning. To test 
an assumption basic to all-or-none the- 
ories, Order and Stationarity statistics 
described by Suppes and Ginsberg were 
computed for the data of Exp. I and II. 
The Order test is presumably sensitive 
to departures from independence of S's 
successive category responses, while the 
Stationarity test determines the reliability 
of trends in the error rate over trials. 
the resulting xy?s for the various treat- 
ment conditions over both experiments 
were insignificant, with one exception 
identified with Problem B in Exp. I 
where high conditional probabilities in 
the stimulus sequence for the first 16 
trials produced a reliable effect. p 
The foregoing analyses led to an m- 
teresting, if uncomfortable dilemma. 
Whereas overall performance trends ap- 
pear to be more consistent with expecta- 
tions based on a theory of gradual 
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Fic. 2. Vincent learning curve in quartiles 
for the proportion of correct responses prio 
to last error in Exp. I and II. 
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learning, evidence from the response 
sequences suggests the greater appro- 
priateness of an all-or-none description. 
The latter results, of course, are hardly 
critical for a general incremental learn- 
ing theory. It is not unreasonable to 
expect rapid attainment of performance 
criterion in the tasks used here for (a) 
the stimulus dimensions were familiar, 
(b) the levels within dimensions were 
well articulated and discriminable and 
had common preestablished labels, (c) 
the responses were primitive and highly 
integrated, and (d) the solution was 
simple, involving the identification of a 
single relevant dimension. When any 
one or all of these conditions fail to hold, 
learning may not only be retarded but 
also shows obvious, gradual character- 
istics. 

Suppose, however, that solution to 
simple concept-identification problems is 
attained on a single trial. Then, it must 
also be true that the particular solution 
trial is affected by the conditions under 
which the Stage 3 relevant dimension is 
introduced. Specifically, the solution 
Should be attained earlier as the degree 
of prior reinforcement for the relevant 
dimension increases. Furthermore, the 
solution trial is predictably changed by 
(a) the length of Stage 2 and (b) the 
novelty of the relevant dimension. These 
observations suggest the need for certain 
elaborations in the basic structure of 
hypothesis-testing models of concept 
identification. While it is difficult 5 
specify in detail the nature of thes 
elaborations, it is reasonable to imagine 
Some modification in the manner in 
which hypotheses are selected for test 
in any problem. The present data sug- 
gest that, rather than sampling randomly 
from a pool of tenable hypotheses, S may 
select one or more according to a “notic- 
ing order” determined by the immediate 
history of each dimension within the task. 
Thus, if a dimension has been irrelevant, 
it may be relegated to a relatively low 
Position in the noticing order. If it has 
been partially or completely relevant, then 
its position is relatively high. Moreover, 
the actual position of any dimension will 
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be a function of the trial length of its 
history in the problem, Finally, it may 
be asserted that a novel dimension—one 
with no history—takes precedence over 
all others, save possibly the initially 
relevant dimension, as a source of tes- 
table hypotheses. The foregoing descrip- 
tion could be formalized by a change 
of rules for sampling such that the hypo- 
theses arising from any dimension would 
have a selection probability which de- 
pends on prior relevancy, novelty, and 
perhaps other variables. Such an elab- 
orated model would quite likely produce 
greater consistency not only with the 
present data but also with other well- 
known results in multistage discrimina- 
tion and conceptual tasks, such as those 
comparing reversal, NR, and intradimen- 
sional shifts in solution (Issacs & Dun- 
can, 1962; Kendler & Kendler, 1962). 
However, because the present interpre- 
tation is entirely ad hoc and because the 
alternative descriptions are likely to be 
just as plausible, a detailed reformula- 
tion of hypothesis-testing theory here 
would be premature. 
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MAGNITUDE ESTIMATION OF ANGULAR VELOCITY 
DURING PASSIVE ROTATION 


JAMES H. BROWN 
United States Army Medical Research Laboratory, Fort Knox, Kentucky 


Using an adaptation of the Stevens’ scaling technique, 25 Ss estimated 
subjective angular velocity during constant angular acceleration in 


darkness. Acceleration 


intensities varied from 3°/sec? to 24° /sec?, 
with stimulus durations ranging from 10 sec. to 80 


sec. The exponent 


of the power function relating the subjective and intensive dimensions 


is on the order of 1.0. 


Although Stevens' technique (Stev- 
ens, 1957, 1958) has been applied suc- 
cessfully to responses from various 
sensory modalities, it has not been 
utilized for scaling subjective responses 
to stimulation of the semicircular 
canals. In fact, when contrasted with 
the standard psychophysical data avail- 
able for other modalities, there is an 
almost complete dearth of psycho- 
physical information relative to the 
vestibular system. Although the 
limited number of investigators work- 
ing in the general vestibular area, and 
the obvious problems of stimulus con- 
trol have contributed to this discrep- 
ancy, it is reasonable to assume that an 
equally important deterrent may stem 
from a classical question as to the 
uniqueness of sensations resulting from 
stimulation of the vestibular apparatus, _ 

The primary technique used for 
evaluating subjective reactions to angu- 
lar acceleration consists of S estimating 
his subjective velocity by depressing a 
key every time he feels he has passed 
through 90° of arc (eg, Brown & 
Crampton, 1964; Collins, 1962; 
Guedry, 1965), In addition to the ob- 
vious limitation of how rapidly S can 
Press the key at high rpm’s, key press- 
ing leads S to make the stimulus error. 
That is, it maximizes his attention to 
Such undesirable external cues to rota- 
tion as noises from actual machinery 
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rotation. In order to attenuate some 
of these problems and more directly 
focus S’s attention on his subjective 
sensations of velocity rather than the 
actual rate of rotation, the present 
study made use of an adaptation of 
Stevens’ magnitude-estimation tech- 
nique to evaluate subjective reactions 
to a broad range of angular accelera- 
tions. 


METHOD 


The stimulus, angular acceleration, poses 
a somewhat unique problem for psychophysi- 
cal techniques. Unlike stimulation in other 
modalities where time is a critical dimension 
for only very brief periods, for the vestibular 
system, time contributes to the effective 
intensity of an acceleration over a much 
longer period. That is, even with long dura- 
tion, constant, angular accelerations there is 
a systematic increase in the sensed velocity. 
In order to evaluate the course of this sensa- 
tion, a variation of the basic Stevens' tech- 
nique was required. This adaptation con- 
sisted of instructing S to make a continuous 
evaluation of his sensation relative to a 
standard stimulus, and to give frequent nu- 
merical responses during the acceleration to 
identify the momentary magnitude of his 
sensation. y k 

Apparatus —The stimulator, described in 
detail by Brown and Crampton (1964), con- 
sists of a hydraulically driven rotating beam 
that supports S’s chair. A removable cap- 
sule entirely enclosed S, permitting control of 
visual stimulation and eliminating cues of 
angular velocity associated with air move- 
ment across the body. All trials were con- 
ducted in total darkness. The S's head was 
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maintained in a fixed position over the axis 
of rotation by means of an adjustable bite 
board. A ventilating fan for the capsule 
interior and a small “dither” motor provided 
low-level vibration and noise to prevent 
detection of extraneous cues. 

Recording.—Electronic voice communica- 
tions served not only to facilitate giving in- 
structions to S' but also to permit the accu- 
rate recording of S's magnitude responses 
with respect to time. The signal from the 
intercom was led through slip rings to an 
ink-writing recorder instrumented so that 
when S responded vocally, a pen deflected. 
This provided a permanent record of the 
occurrence of the response, and E simply 
noted the magnitude of the response directly 
on the record as S responded. 

Subjects.—Twenty-five soldier volunteers 
of average intelligence and ranging from 18 
to 24 yr. served as Ss. A medical exami- 
nation verified their physical health, and 
those Ss who reported any past history of 
inner-ear difficulty or head injury were 
eliminated. All S's were experienced to the 
extent that they had participated in two 
previous experiments during which they 
were subjected to a broad range of angular 
accelerations while performing a variety of 
tasks. 

Procedure.—The 25 Ss were given 12 ac- 
celeration trials during each of six experi- 
mental sessions. An initial or practice ses- 
sion, consisting of 10 trials, served to 
acquaint $ with the experimental procedure, 
and to familiarize him with the range of 
stimuli, All accelerations alternated in sign, 
first a positive then a negative acceleration. 
In order to decrease contamination from cues 
to rotation associated with the slight g 
forces exerted on those parts of the body not 
over the axis of rotation, all accelerations 
were presented symmetrically around zero 
velocity. A 5-min. interval of constant 
velocity was programmed between trials to 
avoid interactive effects of successive accel- 
erations. A standard trial of 9? /sec? was in- 
troduced on every third trial. The S was in- 
structed not to respond to the standard 
stimulus but only to concentrate on his 
sensation of subjective velocity. At the peak 
of the 9°/sec? standard acceleration, E called 
"S0" over the intercom. This was the 
modulus and provided S with a reference 
point to facilitate responding to the test 
stimuli. The test trials consisted of angular 
accelerations of 3.0, 5.0, 8.0, 12.0, 17.0, and 
24.0°/sec?, maintained for durations ranging 
from 10 sec. to 80 sec. such that each acceler- 
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ation involved a total velocity change of 
40 rpm. During the test trials Ss nu- 
merically described the growth and decline 
of his subjective velocity relative to the 
modulus. The ordering of test trials was 
randomized within blocks of the six test 
stimuli. 

The net result of this rather complicated 
design is that each S received eight repeti- 
tions of each of the six test stimuli, Since 
responses were not made in every second 
of a given stimulus, values were interpolated 
in order that responses might be averaged 
at systematic time intervals. The sampling 
intervals for averaging were: every other 
second for the 24.0 and 17.0°/sec? accelera- 
tions; every third second for 12.0 and 8.0°/ 
sec? accelerations; and every fourth second 
for the 5.0 and 3.0°/sec? accelerations. 


RESULTS 


Geometric means for the periodically 
sampled magnitude estimates were 
computed across replications and Ss for 
each of the six test stimuli (200 values 
contributing to each mean). These 
mean values are plotted against time 
in Fig. 1. The vertical bars on the 
curves define the termination of the 
acceleration, and it is of particular im 


Fic, 1. Geometric means of magnitude 
estimates averaged across Ss and repetitio 
plotted against time for the six test anı 


accelerations. (All accelerations started at 
time “0,” and the termination of the acceler 
tion and onset of constant velocity is sho 
by the vertical bars through the res! 
curves.) 
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terest to examine the form of the 
response in this region. For the 
two short-duration accelerations there 
is an overshooting evident in the re- 
sponse, i.e., there is not a sharp down- 
ward break at the end of the accelera- 
tion, but a short period when Ss 
reported a continuing increase in sub- 
jective velocity, On the other hand, 
responses to the 20-sec. stimulus 
(12.0*/sec?) peak at the termination 
of the acceleration, while responses to 
the longer duration accelerations show 
obvious adaptation. 

In Fig. 2 these data are plotted on 
log-log coordinates against a complex 
time X acceleration (ta) quantity. The 
overall exponent of the power function 
is 1.0, indicating a linear relationship 
between sensation and the complex ta 
quantity. However, the exponent of 
1.3 for t, required to fit these data, 
indicates that the growth of the sensa- 
tion with respect to time alone is not 
linear but exponential It should be 
noted that in arriving at the equation 
and curve shown in Fig. 2, only data 
prior to the onset of adaptation for 
each acceleration were used. The re- 
maining aspect of the equation that re- 
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Fic. 2. Geometric mean magnitude esti- 
Mates plotted logarithmically against the 
complex logarithmic, fa quantity. (Time, t, 
signifies time in seconds from the onset of 
the acceleration while acceleration, a, is €x- 
pressed in degrees/sec?.) 
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quires comment is the quantity a-2.0, 
which is directly comparable to a 
threshold, and, in fact, the absolute 
threshold for sensing angular accelera- 
tion is on the order of 2°/sec? (e.g., 
Clark & Stewart, 1962). 

Although no formal reliability meas- 
urements were made, individual trials 
were plotted and showed little variabil- 
ity either across repetitions or Ss. 
That is, the linear function adequately 
described both the individual responses 
to a single acceleration and the mean 
data. 

Discussion 


Magnitude estimation appears to be an 
eminently satisfactory means of evaluat- 
ing the subjective experience of angular 
velocity. The considerable amount of 
adaptation found for the longer duration 
stimuli, e.g., the responses to the 3°/sec® 
stimulus showed almost complete adapta- 
tion by the end of the acceleration period, 
clearly implies that the subjective set 
provided by the magnitude task at least 
partially avoids the stimulus error. Al- 
though Guedry (1965) has reported 
adaptation in key-press responses to pro- 
longed constant angular acceleration in 
highly trained Ss, no adaptation was evi- 
dent in key-press responses to a 45-sec. 
stimulus from Ss similar to those used 
in the present study. It may be concluded 
that training with the key-press response 
can overcome the stimulus error problems 
inherent to the task. However, the mag- 
nitude task has clear advantages in terms 
of both its independence of the necessity 
of such training, and the likelihood of 
individual problems with key pressing 
in attaining and maintaining a systematic 
concept of degrees of arc while being 
accelerated. 

It is interesting to note that while both 
sensation and  electrophysiological re- 
sponses to prolonged constant angular 
acceleration show adaptation, nystagmic 
responses do not (Guedry, 1965). Fur- 
ther, both the linear relation demonstrated 
in the present study, and the cupulometric 
evaluation of  electrophysiological re- 
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sponses from the vestibular system 
(Groen, Lowenstein, & Vendrik, 1952) 
are in general accord with the torsion 
pendulum analog, the mechanical model 
for the semicircular canals. However, 
nystagmic responses demonstrate system- 
atic discrepancies with reference to this 
model (Brown & Crampton, 1964). 
These general comparisons accentuate the 
necessity of further psychophysical work 
with this sense organ. 
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SPATIAL S-S PROXIMITY IN HUMAN DISCRIMINATION 
LEARNING! 


C. D. STANDISH 2 
University of Sydney 


The effect of proximity of cues was studied with 2 pairs of visual 
stimuli in transfer from spatial to nonspatial discrimination learning 
tasks, with 225 Ss. The prediction that proximity promotes positive 
transfer was confirmed with both successive and simultaneous viewing 
of stimuli, with temporal as well as spatial proximity, and with reduced 


viewing time. 


The hypothesis that the spatial proximity effect is 


mediated by background stimuli was confirmed in the further finding 
that variation in background gives the same result as spatial non- 
proximity. The temporal proximity effect was ascribed to temporal 


S-R contiguity as a variable. 


By means of the technique in which 
preliminary training with two pairs of 
stimuli in fixed spatial relations is fol- 
lowed by a test stage involving the two 
pairs and their transposes (nonspatial 
discrimination learning) it has been 
shown that the dominance of between- 
pair stimulus differences yields nega- 
tive transfer from training to test when 
there is spatial noncontiguity of stimu- 
lus and response, while positive trans- 
fer occurs both when within-pair dif- 
ferences dominate and with S-R conti- 
guity (Champion & Standish, 1960; 
Standish & Champion, 1962). These 
findings were interpreted in terms of 
an extension of Spence’s S-R theory of 
discrimination learning (Spence, 1960) 
with the hypothesis that the establish- 
ment of within-pair discriminations in 
training leads to positive transfer, 
whereas the lack of such discrimination 
causes negative transfer from training 
to test stages. The first aim of the 
following experiment was to test the 
further prediction that spatial prox- 

1Based on research done for the PhD 
degree at the University of Sydney under 
the supervision of R. A. Champion, to whom 
the author is indebted for assistance in the 
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2Now at Avondale College, Cooranbong, 
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imity of the stimulus cues, as well as 
within-pair differences and S-R conti- 
guity, produces positive transfer by 
promoting within-pair discriminations 
in training. This prediction having 
been confirmed, an attempt was made 
to delineate the effects of S-S prox- 
imity in discrimination learning more 
precisely. 

The degree of spatial separation of 
the members of the stimulus pairs un- 
der the initial nonproximity conditions 
forced S to view the two cues suc- 
cessively on trials where the response 
was not simply made to one cue. If 
both stimuli were to be effective on a 
given trial, S had to look back and 
forth to the extent allowed by the ex- 
posure time of 1 sec. Another group 
was therefore trained with the distance 
between the cues (degree of nonprox- 
imity) reduced to a point where there 
was no need for the head and eye 
movements demanded hitherto, S being 
able to view the two simultaneously, at 
least with peripheral vision. 

Besides head and eye movements, 
tests were made of the effects of tem- 
poral nonproximity of cues, reduced 
viewing time, and heterogeneity of vis- 
ual backgrounds as factors confounded 
with spatial nonproximity. In each 
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case the cues were presented in spatial 
proximity, and the results were com- 
pared with the initial data for spatial 
proximity vs. nonproximity. 


METHOD 


Subjects.—The Ss were 225 students from 
undergraduate psychology courses, allocated 
at random to 15 groups of 15 each. 

Apparatus—The S was seated 18 in. in 
front of a homogeneous opaque screen con- 
taining two 21 X24 in. pearl-perspex win- 
dows side by side at eye level, each of which. 
could be illuminated by a 15-w. lamp whose 
intensity was controlled by a variac. The 
location of the windows in the horizontal 
midline of the screen was adjustable, to 
allow for the proximity variable, but they 
were always placed symmetrically with re- 
spect to S. A response button was placed 
in the top left corner of the left window 
and the top right corner of the right win- 
dow; the buttons activated corresponding 
signal lamps behind the screen, where E sat. 
The intensity values of the stimuli in foot- 
lambert units were: Bi—125, Ba= 100, Di 
=2.5, Ds=3.2; these were measured with 
a photometer at the usual location of S’s 
head. For the test of visual background 
effects the screen containing the stimulus 
windows was bisected vertically to give left 
and right backgrounds, which were covered 
with white cardboard marked with horizon- 
tal or yertical black stripes 2 in. wide cen- 
tered 2 in. apart. 

Procedure.—The stimulus windows were 
placed together for proximity conditions and 
with centers 60 in. apart for the initial non- 
proximity condition. The stimuli were first 
presented simultaneously in pairs for 1 sec. 
on each trial, and the intertrial interval was 
approximately 15 sec. The $ was given a 
verbal "ready" signal approximately 2 sec. 
before each presentation and was simply 
instructed to press one of the two buttons as 
a response, which was immediately described 
verbally by E as correct or wrong. 

For each of these two degrees of stimulus 
proximity there were experimental and con- 
trol groups, the latter simply being tested 
(Stage 2) without preliminary training 
(Stage 1). The four groups were identified 
as Groups P (proximity experimental), PC 
(proximity control), NP (nonproximity ex- 
perimental), and NPC (nonproximity con- 


trol). In Stage 1 all experimental Ss were 
trained with B: (positive) and Ba (nega- 
tive), and with Da (negative), and Di 
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(positive) as the stimuli on the left and 
right, respectively, to be discriminated within 
pairs. In Stage 2 all Ss were tested with 
these two pairs and their transposes. The 
designation "positive" means that, for a cor- 
rect response, S was required to press the 
button corresponding spatially to that stimu- 
lus. The arrangement of stimuli is sum- 
marized in Table 1. 

It will be noted that the intensity dif- 
ference between pairs was greater than 
that within pairs. The stimulus pairs were 
presented in random order, with the re- 
striction that they appear with equal fre- 
quency, and the criterion of learning in both 
stages was 12 successive correct responses. 
Testing in Stage 2 was discontinued if $ 
could not achieve this criterion in 100 trials. 
The whole procedure to this point was repli- 
cated with four more groups of Ss, under 
conditions of spatial S-R noncontiguity; the 
response buttons in the stimulus windows 
were disconnected, and S was required to 
operate a pair of buttons placed together 
at the foot of the screen, at hand level. 

The transfer control groups were dis- 
carded in the remaining procedure, although 
the training-test method was preserved for 
the sake of continuity. In the test for the 
effect of head and eye movements, the stimu- 
lus windows were placed with their centers 
12 in. apart to give reduced nonproximity 
conditions (Group RNP).. In the test of 
temporal nonproximity effects the stimulus 
windows were placed together, but there 
was a 4-sec. interval between the offset of 
one stimulus and the onset of the other 
member of the pair on each trial (Group 
TNP); the first stimulus in this sequence 
was determined at random, with the restric- 
tion that all stimuli appear first with equal 
frequency. The effect of reduced viewing 


TABLE 1 


SPATIAL ARRANGEMENT OF STIMULUS 
CuEs IN PAIRS 


Stage Cus Control Groups 
1 Borbi = 
2—:Did- 
2 Bid-:Bj— B,+:Be— 
3—:Di4- Di—:Did 
Be—:Bi+ Bs—:Bi 
Dı+:D:— Di+:De— 


Note.—The spatial arrangement of the symbols in 
ch pair correspond ti -right arrange- 
ments of the stimuli. phe EA Ie right a 
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time was tested by reducing the exposure 
time of the two stimuli presented simul- 
taneously, from 1 sec. to 4 sec., with the 
stimulus windows together (Group RTP). 

The test of visual background effects in- 
volved four more groups of Ss, with the 
background arrangements distributed thus: 
Group D:—vertical black stripes on the left, 
horizontal on the right; Group D.—horizon- 
tal left, vertical right; Group S;—vertical 
left and right; Group S:—horizontal left 
and right. 


RESULTS 


The results in terms of trials to cri- 
terion for the test of proximity vs. non- 
proximity are summarized in Table 2. 
Analysis of variance in the Stage 2 
(test) data showed both the main effect 
of proximity and its interaction with 
the experimental-control variable to be 
statistically significant, F (1, 56) = 
167, p <.001, and 52, p < .05, re- 
spectively. Further ¢ tests showed a 
significant difference between Groups 
P and PC, : (56) = 4.96, p< .001, 
but not between Groups NP and NPC, 
t (56) = .33, p > .05. Thus it appears 
that positive transfer occurred only 
with proximity conditions, and that 
there was no negative transfer under 
nonproximity conditions. The same re- 
sults were obtained when this experi- 
ment was replicated under conditions 
of spatial S-R noncontiguity. 

The mean number of trials to cri- 
terion in Stage 2 (test) for Group 
RNP was 53.3 (SD = 34.2), a value 
lying between the comparable scores 
of 13.1 for Group P (proximity) and 
64.7 for Group NP (nonproximity). 
Analysis of variance in these three sets 
of scores showed a significant overall 
effect, F (2, 42) =11.0, p< 001. 
Separate f£ tests indicated a significant 
difference between Groups P and 
RNP, £ (42) = 3.50, p < .01, but not 
between Groups RNP and NP, t (42) 
=0.99, p 2.05. The same measure 
for Group TNP was 40.1 (SD = 
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TABLE 2 


TRIALS TO CRITERION IN STAGE 2 OF THE TEST 
or Proximity vs. NONPROXIMITY 


Conditions Proximity Nonproximity 
Experimental 

M 13.1 64.7 

SD 2.3 34.9 
Control 

M 47.9 62.4 

SD 32.8 31.4 


30.2), a value which also lay between 
those of Groups P and NP. The over- 
all effect in these three groups was 
significant, F (2, 42) = 112, p < .001, 
and there were significant differences 
between both Groups P and TNP, 
t (42) =2.48, p «.05, and Groups 
TNP and NP, : (42) — 2.47, p <.05. 

The mean number of trials to cri- 
terion for Group RTP in Stage 2, 
17.5 (SD — 7.0), once more lay be- 
tween Groups P and NP. The overall 
effect was significant, F (2, 42) — 
20.9, p < .001, but only the compari- 
son of Groups RTP and NP yielded a 
significant t value, t (42) = 5.55, p < 
.001, the difference between Groups P 
and RTP being not significant, t (42) 
= 0.52, p > .05. In the test of visual 
background effects, the mean trials to 
criterion in Stage 2 were: Group 
D,—321 (SD —254); D,—347 
(271); Si—17.5 (12.5); $,—184 
(6.1). A t test for same vs. different 
backgrounds showed S groups to be 
superior to D groups, t (58) = 2.80, 
p « 01. 


Discussion 


The fact that spatial S-S proximity 
acted as predicted suggests that prox- 
imity of cues, like S-R contiguity, facili- 
tates the within-pair discrimination essen- 
tial for the nonspatial learning required 
in Stage 2 of the present type of experi- 
ment. It may be inferred from the other 
results, furthermore, that a potent factor 


176 


mediating this facilitation is the similarity 
of background stimuli which occurs with 
proximity, for under these conditions S 
is forced to respond more to the stimulus 
cues as a basis for discrimination. 
Three specific features of the data de- 
serve comment. First, the results for 
Stage 1 (training) were similar to those 
for Stage 2 (test) in all cases, although 
the differences between groups were gen- 
erally less in training than testing. This 
parallel may seem surprising in view of 
the necessity for within-pair discrimini- 
nation only in Stage 2, but it must be 
allowed that some Ss made the discrimi- 
nations in Stage 1, even though they 
were unnecessary; the effects of inde- 
pendent variables in Stage 1 may have 
been the weaker simply because Ss did 
not always act this way. Secondly, al- 
though appeal has been made to the effect 
of visual background stimuli, with sup- 
porting evidence, the data also suggest 
that proprioceptive stimuli from head and 
eye movements may not play the same 
role under the present conditions, al- 
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though there is no obvious reason why 
they should not do so. Finally, it was 
found that temporal as well as spatial 
nonproximity acted to depress perform- 
ance, and an explanation for this effect 
may be found if it be recognized that the 
introduction of a time interval between 
the stimuli means less temporal S-R con- 
tiguity when the positive stimulus cue 
is presented first and S is forced to wait 
for the second cue before responding, as 
in the present experiments. 
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EFFECTS OF CS AND UCS RELATIONSHIPS ON ELECTRO- 
DERMAL RESPONSE AND HEART RATE* 


GEORGE H. ZIMNY,? JOHN A. STERN, AN» STANTON P. FJELD 
Washington University School of Medicine, St. Louis, Missouri 


The effects of 4 CS-UCS conditions upon physiological responses were 
studied. The conditions were forward conditioning, backward condi- 
tioning, UCS-alone, and unpaired CS and UCS. 6 test, 42 “condition- 
ing,” and 4 extinction trials were given. Analysis of variance was 
carried out for 3 measures of electrodermal response (EDR) and 4 
measures of heart rate. Significant group differences for EDR 
amplitude were found on test and extinction trials. The UCS-alone 
and forward-conditioning groups had higher amplitudes during condi- 
tioning. During extinction the forward-conditioning group continued 
to exhibit higher amplitude. The findings for 3 conditions were ac- 
counted for by orienting-response theory, but those for forward condi- 


tioning did not conform to this theory. 


In the classical conditioning of auto- 
nomic responses, both the CS and UCS 
initially elicit similar responses. The 
lack of a “neutral” CS makes it difficult 
to clearly identify the response to the 
CS alone on test or extinction trials 
as due to conditioning (Fjeld, Stern, & 
Zimny, 1964) rather than to factors 
such as sensitization (Prokasy & Ebel, 
1964), pseudoconditioning resulting 
from use of a noxious stimulus (Kim- 
ble, Mann, & Dufort, 1955), and the 
order of CS and UCS pairing other 
than the classical CS-UCS forward 
pairing (Champion & Jones, 1961). 
In addition, Stewart, Stern, Winokur, 
and Fredman (1961) have indicated 
that the initial electrodermal response 
(EDR) to a stimulus is an orienting 
response (OR), and they suggest that 
much previous work in autonomic con- 
ditioning has dealt primarily with the 
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adaptation and recovery of uncondi- 
tioned responses (ORs) rather than 
with conditioning of responses, Soko- 
lov (1963), too, describes the return 
of the OR upon any change in stimula- 
tion and the habituation of the OR with 
repeated stimulation. 

The purpose of the present study is 
to test certain predictions derived from 
OR theory and research about the ef- 
fects on autonomic responses of four 
stimulus conditions, namely, CS-UCS 
pairing, UCS-CS pairing, random CS 
and UCS, and UCS alone. In all con- 
ditions, interpolated CS-alone test trials 
are given, In the UCS-alone condi- 
tion, there is minimum habituation of 
the response to the CS and maximum 
return of the OR to the CS presented 
after a series of UCS presentations. 
Consequently, a maximal response is 
predicted on test trials. The random 
condition involves many more presenta- 
tions of the CS, resulting in increased 
habituation to the CS, and the frequent 
interpolation of the CS among UCS 
presentations results in a minimum re- 
turn of the OR to the CS. Conse- 
quently, a minimal response on the test 
trials is predicted. In forward and 
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backward conditioning, the CS becomes 
part of a single complex stimulus, 
thereby decreasing habituation of the 
OR to the CS and increasing the return 
of the OR to the CS. Consequently, 
the response on test trials for these two 
conditions is predicted to lie between 
that for the UCS-alone and random 
conditions. No predictions based on 
OR theory are made here about dif- 
ferences between the forward and back- 
ward conditions. 


METHOD 


The Ss consisted of four groups each with 
12 paid volunteer male, college sophomores 
randomly selected from students enrolled in 
introductory psychology. The Ss were ran- 
domly assigned to groups. 

EDR was recorded as a dc resistance 
change through 0.5-in. diameter zinc elec- 
trodes coated with electrode paste and at- 
tached to the cleansed volar surfaces of the 
second phalanx of the first and third fingers 
of the left hand. Shock electrodes were 
similarly attached to the right hand. Heart 
rate (HR) was recorded by a cardio- 
tachometer through a photoelectric cell on 
the lobe of the left ear. The stimuli were 
an ac shock of .5-sec. duration and a 1,000 
cps, 70-db. tone of .5-sec. duration. All 
stimulus presentations were initiated by tape- 
recorded signals with the durations con- 
trolled by timers. EDR, HR, and shock and 
tone durations were recorded on a Model 5 
Grass polygraph. 

Prior to any experimental presentation, S' 
was given shocks of increasing intensity to 
determine a level that was "definitely un- 
pleasant but not painful" in S's opinion. 
Subsequently, all Ss were told that they 
would be given some tones and shocks and 
were given a test trial consisting of a tone. 
Each S received one of four treatments: 
only shocks were presented (UCS-alone) ; 
tones preceded and were contiguous with 
shocks (forward conditioning) ; shocks pre- 
ceded and were contiguous with tones (back- 
ward conditioning); or tones and shocks 
were presented in random order with the 
tone preceding or following the shock by an 
interval of 2 to 15 sec. (random pairing). 
In the random-pairing condition, the number 
of tone and shock presentations were iden- 
tical to the number used for forward and 


backward conditioning. The time intervals 
for presentation of all shocks and of all test 
trials were identical for the four conditions. 
The intertrial interval varied from 30 to 50 
sec. in a random sequence except for the 
random-pairing condition. Forty-two acqui- 
sition trials were given. A test trial (tone- 
only) was presented before the first and 
after the second, eleventh, twenty-second, 
thirtieth, and forty-second acquisition trials. 
Four extinction trials (tone-only) were 
given after the last test trial. 

The three measures of EDR were: (a) 
amplitude, the difference in resistance be- 
tween the preresponse base level and the 
maximum level of a response occurring 
within 5 sec. of stimulus onset; (b) latency, 
the time between stimulus onset and response 
onset; and (c) recruitment, the time between 
response onset and the peak of the response. 
Base level of skin resistance was measured 
immediately prior to stimulus onset. The 
four measures of HR were: HR base, the 
mean of the 4 beats immediately prior to 
stimulus onset; HR change, the mean of 
the fastest beat and the higher adjacent 
beat among the first 12 poststimulus beats 
minus the HR base; HR acceleration, the 
mean of the first 2 poststimulus beats minus 
the HR base; and HR variability, the dif- 
ference between the fastest and slowest of 
the first 12 poststimulus beats. 


RESULTS 


A repeated-measures analysis of 
trends (Edwards, 1963) was carried 
out for the eight measures on the five 
test and on the four extinction trials. 
No significant effects were found for 
EDR recruitment, HR base, HR 
change, or HR acceleration-decelera- 
tion. HR variability increased over test 
trials, F (5, 220) = 2.82, p < .05, and 
over extinction trials, F (3, 132) — 
3.14, p < .01. Base skin resistance de- 
creased over test trials, F (5, 220) = 
27.20, p < .001, and increased over €x- 
tinction trials, F (3, 132) = 3.60, ^ < 
01. For EDR latency, only the extinc- 
tion interaction effect was significant, 
F (9, 132) = 2.57, p< 01; A trend 
analysis of this interaction indicated 
that only the linear components of the 
trends of the four groups differed sig- 
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nificantly, F (3, 132) = 3.39, p < .05. 
Examination of the data suggests that 
the latency of the UCS-alone group de- 
creased over trials whereas that of the 
other groups did not. 

The EDR amplitude measures are 
shown in Fig. 1. For the test trials, 
the trials effect was significant, F (5, 
220) = 2.74, p < .05. A trend analy- 
sis of the overall trial means indicated 
that the trend is essentially linear, F 
(1, 220) = 7.64, p « 01. Examina- 
tion of Fig. 1 suggests that it is the 
initial test trial which accounts for this 
effect. The interaction effect was also 
significant, F (15, 220) — 244, p< 
01. A trend analysis of this interac- 
tion indicated that the linear compo- 
nents of the trends of the four groups 
differed significantly, F (3, 220) — 
9.21, p < .001, and the Duncan's new 
multiple-range test (Edwards, 1963) 
was then carried out. It was found 
(for p = .05) that the UCS-alone group 
differed significantly from the back- 
ward-conditioning group on Trial 2 
and from the backward-conditioning 
and the random-pairing groups on Trial 
3, Trial 4, and Trial 5. The forward- 
conditioning group differed from the 
same two groups on Trial 4. Thus, 
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Fic. 1. EDR amplitude on test and ex- 
tinction trials for four conditions. (Differ- 
ences indicated are statistically significant— 
A-forward, B-backward, C-UCS alone, 
D-random.) 
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UCS alone which has the fewest tones 
produced greater EDR amplitude than 
either backward conditioning or ran- 
dom pairing but not greater than for- 
ward conditioning. It was only after 
30 conditioning trials (at Trial 4) that 
EDR amplitude in the forward-condi- 
tioning procedure differed significantly 
from that of any other condition. 

For the extinction trials, the condi- 
tions effect was significant, F (3, 44) = 
3.42, p < .01. Duncan’s test (for p = 
.05) showed that the forward-condi- 
tioning group had a significantly greater 
overall EDR amplitude than the back- 
ward-conditioning and random-pairing 
groups. The UCS-alone condition did 
not differ significantly from these two 
groups. The interaction effect was also 
significant and a trend analysis indi- 
cated that the quadratic components of 
the trends of the four groups differed 
significantly, F (3, 132) = 808, p< 
.001. Inspection of Fig. 1 indicates 
that it is the random-pairing group 
which accounts for this effect, 


Discussion 


The four experimental conditions pro- 
duced no differential effects upon cardiac 
activity. EDR amplitude demonstrated 
the greatest effects of the experimental 
conditions, The backward-conditioning 
and random-pairing groups did not 
differ and both showed a consistent de- 
crement in amplitude over test trials. 
This decrement may be interpreted as the 
habituation of the OR due to repeated 
stimulation suggested by Stewart et al. 
(1961) and Sokolov (1963). The find- 
ing that the backward-conditioning and 
random-pairing groups do not differ is 
in general agreement with the usual find- 
ings for backward conditioning (Kimble, 
1961, p. 158) but is not in agreement 
with the EDR findings of Champion and 
Jones (1961) which were, however, based 
on far fewer conditioning trials. 

The EDR amplitude of the UCS-alone 
group during acquisition was significantly 
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greater than that of the backward or 
random-pairing groups. One explanation 
of the superiority of this group is that 
the discrepancy between the acquisition 
stimulus (shock) and test stimulus (tone) 
is greatest for this group, thereby pro- 
ducing the greatest return of the OR on 
test trials. The bulk of research on gen- 
eralization gradients has demonstrated 
that the greater the stimulus change, the 
greater the EDR amplitude change 
(Grim & White, 1965; Kimmel, 1960; 
Sokolov, 1963). Another explanation is 
that the UCS-alone group received far 
fewer tone stimuli during acquisition than 
the other groups, thereby reducing the 
opportunity for habituation of the OR 
to the tone. 

The EDR amplitude of the forward- 
conditioning group was also greater than 
that of the backward-conditioning and 
random-pairing groups, but only after 
30 conditioning trials had been given, 
Since the same number of tone stimuli 
were given to the three groups, it is 
apparent that the CS-UCS pairing pro- 
duced effects that counteracted the ha- 
bituation effects which occurred in these 
two groups. Whether this superiority of 
the forward-conditioning group is due to 
conditioning or to a factor such as 
the discrepancy between the acquisition 
stimulus and test stimulus cannot be de- 
termined by a comparison among these 
three groups. The discrepancy, for ex- 
ample, between the CS-UCS stimulus and 
CS-alone (test) stimulus is different from 
that between the UCS-CS and CS-alone 
stimulus. Grings (1960) has discussed 
this factor within the context of “percep- 
tual-disparity response.” 

The most interesting finding is that the 
EDR amplitude for the UCS-alone group 
during acquisition was as great as that 
for the forward-conditioning group. On 
the basis of the two OR factors of dis- 
crepancy between acquisition and test 
stimuli (OR return) and of number of 
tone stimuli during acquisition (OR 
habituation), the amplitude for the UCS- 
alone group would be expected to be 
considerably greater than that for the 
forward-conditioning group. The equality 
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of the two groups strongly suggests the 
operation of another process, such as 
conditioning. Support for the operation 
of conditioning (learning) is provided 
by the fact that 30 trials were required 
before the forward-conditioning group 
differed significantly from the backward 
and random-pairing groups, which were 
showing pronounced habituation. The 
concept of the perceptual-disparity re- 
sponse is not sufficient to explain the 
equality since there appears to be greater 
disparity between the UCS-alone and CS- 
alone (test) stimuli than between the 
CS-UCS and CS-alone stimuli, 

In the later stages of acquisition, both 
the forward conditioning and UCS-alone 
groups differed írom the other two 
groups. In extinction, however, the for- 
ward-conditioning group continued to 
demonstrate a significantly greater EDR 
amplitude than the backward-condition- 
ing and the random-pairing groups while 
the UCS-alone group no longer differed 
from them. The UCS-alone and forward- 
conditioning groups, however, did not 
differ significantly. It may be inferred 
from these findings that the forward- 
conditioning procedure was the most pro- 
ductive of resistance to extinction. These 
findings for extinction, like those for 
acquisition, indicate the functioning of a 
conditioning process. As before, explana- 
tion based only on the two OR factors 
and the concept of the perceptual-dispar- 
ity response would favor superiority of 
the UCS-alone group for extinction trials. 

The results clearly indicate the impor- 
tance of the type of control procedure 
used in autonomic-conditioning studies. 
Using the UCS-alone group as a control, 
no conditioning effects could be ascribed 
to the forward-conditioning group during 
acquisition or extinction, whereas such 
effects would be suggested if either the 
random-pairing Or the backward-condi- 
tioning group were used as the control. 

It was predicted on the basis of OR 
theory that the order of groups from 
greatest to least response on test trials 
would be UCS-alone, forward and back- 
ward conditioning, and random pairing. 
As predicted, the UCS-alone group was 
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clearly superior to the backward-condi- 
tioning and  random-pairing groups. 
These findings clearly demonstrate the 
capacity of OR theory to account for the 
results obtained by procedures commonly 
employed as controls in conditioning 
studies. The failure of the backward- 
conditioning group to be differentiated 
from the random-pairing group indicates 
that the factor of stimulus complexity did 
not operate as predicted to reduce ha- 
bituation of the OR or to increase return 
of the OR to the CS alone. The equality 
of the UCS-alone and forward-condition- 
ing groups was contrary to the prediction 
based on OR theory, indicating that the 
classical conditioning of autonomic re- 
sponses cannot be accounted for solely 
on the basis of OR processes. 
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ON LEARNING SEVERAL SIMULTANEOUS 
PROBABILITY-LEARNING PROBLEMS * 
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Ss performed 4 simultaneous probability-learning tasks under either 
speed or accuracy instructions. When the 4 r values were the same, 
Ss tended to overshoot matching under accuracy instructions, but under 
speed instructions overshot only at higher m values. When the 4 r 
values were not the same, there was a tendency to overshoot for lower 
7 values, but not higher ones. Ss consistently responded faster at 
higher z values and the increase in speed was linear as m increased 
from .50 to 1.00. Ss’ subjective estimates of m suggested that they 
rather consistently overshot their m estimates by 5-10 percentage points 
(except when r= 1.00). These results were discussed with respect 
to stimulus-sampling theories of pr 


In probability-learning situations it 
is often found that the probability of 
a choice [p(4;)] is approximately 
equal to the probability of that choice 
being reinforced [zy  — p(E,)]. This 
has been found in numerous studies, 
e.g., Humphreys (1939) ; Grant, Hake, 
and Hornseth (1951); Estes and 
Straughan (1954); and has come to 
be known as the probability-matching 
law. 

This law is of interest because, on 
the one hand, it has been predicted 
by mathematical learning models (eg. 
Estes & Straughan, 1954), while on 
the other hand it seems illogical or at 
least nonoptimal. 

There are at least three situations 
in which matching is not observed. 
When Ss receive payoffs or penalties 
on the basis of their correct or in- 
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correct prediction—as when Ss are 
told they are in a gambling situation— 
the asymptotic Æ, probability over- 
shoots m (Edwards, 1956; Goodenow, 
1955; Siegel & Goldstein, 1959). 
When Ss are given information about 
the random mature of the schedules 
used, they also tend to overshoot v 
(McCracken, Osterhout, & Voss, 1962). 
Finally, when more than two alterna- 
tives are involved in each choice, Ss 
sometimes overshoot m for the alterna- 
tive with the highest probability of oc- 
currence (Gardner, 1957; Neimark, 
1956). These data have been sum- 
marized by Estes (1964). 

The first two of these situations in- 
volve variables which tend to make 
each prediction in the series an indi- 
vidual event (Goodenow, 1955). It 
would be of some interest if an opera- 
tion which emphasizes the independent 
nature of choice predictions, without 
Changing the nature of the task (as 
by introducing costs and rewards or 
emphasizing randomness), was suffi- 
cient to cause overshooting, since Estes 
(Estes, 1959; Estes & Straughan, 
1954) has maintained that increasing 
independence of predictions should 
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make the stimulus-sampling theory 
more  applicable—and ^ overshooting 
should not occur. 

The first experiment was designed 
to explore this possibility. The Ss 
learned four independent, concurrent 
probability-learning problems, with 
identical m values on each problem. 
There is some evidence that this pro- 
cedure produces overshooting. Voss, 
Thompson, and Keegan (1959), and 
Goss and Sugerman (1961), using 
probabilistic paired-associate learning 
tasks, found that Ss chose the more 
frequent response on a greater propor- 
tion of trials than m. These studies 
involved response learning stages, so 
b(A,) and p(A,) did not necessarily 
sum to 1 for the various stimulus 
words. In this study a response was 
always made, so better estimates of 
these probabilities throughout the 
learning process were obtained. 

In the second experiment, different 
combinations of m values were used 
for the four tasks to provide informa- 
tion on context and generalization ef- 
fects in this type of setting. 


EXPERIMENT I 


Method 


Subjects.—A total of 32 Ss was recruited 
from the introductory psychology course at 
the Ohio State University, Experimental 
participation was part of their course re- 
quirement. , 

Apparatus.—The S sat at a table in a 
small cubicle. In front of him on the table 
were four signal lights, four pairs of response 
levers, four pairs of reinforcement lights and 
two "home" keys. The lights were mounted 
on a vertical panel behind the response keys 
and about 18 in. from the front edge of the 
table. The four red signal lights were 2} in. 
apart and 71 in. above the table top. Two 
white reinforcement lights were placed 23 in. 
below each signal light, 13 in. apart. The 
eight response levers were placed on the table 
below the stimulus panel, directly under the 
eight reinforcement lights. Thus, there were 
four complete, independent probability learn- 
ing setups each consisting of a signal light, 
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two response levers, and two reinforcement 
lights. 

The control equipment and response-moni- 
toring equipment were located in a second 
cubicle. Stimulus signals and reinforcement 
events were preprogrammed on paper tape. 
A Friden Flex-O-Writer with an external 
readout translated the paper-tape signal into 
an electronic signal which turned on the 
appropriate lights on S's panel. An elec- 
tronic timer controlled the intertrial interval. 
The S's responses and the reinforcement 
events were recorded via an Esterline-Angus 
event reader. Decision time was measured 
on Standard Electric 1/100-sec. clocks and 
was recorded manually. 

The Ss wore sound-deadening ear muffs 
to deaden as much as possible possibly dis- 
tracting noises from E's booth (chiefly from 
the Flex-O-Writer). 

Design.—Two values of m were used, m 
=.70 and r=.90, with one half of the Ss 
receiving each value. The high-frequency 
reinforcement event (Ei) was on the left 
for one half the Ss and on the right for the 
rest, and was on the same side for all four 
tasks. In addition, one half of the Ss re- 
ceived instructions emphasizing the accuracy 
of their prediction and the other half received 
instructions emphasizing response speed. 

Procedure.—The Ss reported two at a time 
to E, who read them the instructions and 
administered the trials. Each S was run 
individually for four blocks of 120 trials 
each. In each block there were 30 predic- 
tions involving each position, the order of 
predictions being randomly determined. 
Within the 30 predictions at any one position, 
the E; and Es events were randomly ordered 
according to the number of each needed 
for the appropriate m value. 

While one S performed a block of predic- 
tions, the other rested. All four blocks were 
completed in a 2-hr. experimental session, 

The Ss paced themselves on the task using 
the two “home” levers. One of these 
advanced the Flex-O-Writer tape one posi- 
tion and was used to obtain the signal and 
reinforcement information. The other turned 
off the signal and reinforcement lights after 
each prediction. An S’s sequence of re- 
sponses on any one trial was (a) press 
“home” key to obtain signal, (b) press one 
of the two response levers below the signal, 
(c) press the “home” key to obtain feedback, 
and (d) press “off” key to turn off signal 
and reinforcement lights. The Ss had no 
difficulty with this after a few practice trials. 
Response latency was defined as the interval 
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between signal-light onset and the response 
lever press. 

At the end of the session, Ss were asked to 
estimate the conditional probabilities of each 
reinforcement light [p(E:|S:) and p(E:|S;)). 


Results 


Choice data.—Figure 1 shows the 
proportion of A, choices within each 
block of 30 trials as a function of ~ 
value and of instruction. In this fig- 
ure, the data from the four positions 
have been combined, so each point in 
the figure represents 960 responses. 

The data from Blocks 3 and 4 were 
analyzed as a 2X 2X 4 X 2 factorial 
(two levels of instruction, two levels 
of m value, four positions, and two 
blocks) with repeated measures on the 
last two factors, the dependent varia- 
ble being the number of 4, responses 
in each block of trials. The analysis 
revealed that the main effect of proba- 
bility level was highly significant, F 
(1, 28) = 45.01, p «001, and that 
the interaction between instructions 


2 3 
Blocks of 30 Trials 


Fic. l. Proportion of A, responses as a 
function of v value and instructions when 
Ss had four probability-learning tasks with 
the same m value (Exp. I). 
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and probability level approached sig- 
nificance, F (1, 28) = 3.40, p < 07. 
No other effects or interactions were 
significant. It can be seen from Fig. 
1 that both m = .90 groups and the + 
= .70 groups which received accuracy 
instructions overshot their respective 
m values. The deviation from match- 
ing in Blocks 3 and 4 combined was 
highly significant for the two m = .90 
groups, t (15) = 4.786, p < .001, but 
not significant for the m = 70 group 
receiving accuracy instructions, £ (7) 
=.798, but this group had obviously 
not reached an asymptote by the end 
of the fourth block. The deviation 
from matching of the r= 70 group 
receiving speed instructions was not 
significant, £ (7) = 1.571. 

Latency data.—Latency data were 
collected during the fourth block of 
trials. Mean reciprocal latencies for 
the four conditions are shown in Table 
1, Analysis of variance of the latency 
data revealed that the z-level main 
effect was significant, F (1, 28) — 
5.00, $ < .05, along with triple inter- 
action among =, instructions, and posi- 
tions, F (3, 84) = 2.84, p < .05. The 
triple interaction is not easily interpret- 
able, but occurred because the apparent 
interaction between z value and in- 
structions in Table 1 occurred primar- 
ily at two of the four task positions. 
At Positions 1, 2, and 3 (counting 
from the left) there were large differ- 
ences between speed and accuracy in- 
structions when += .70, while when 
7 = 90 only at Position 2 were there 
instructional differences; thus, Posi- 
tions 1 and 3 produced almost all of 
the apparent two-way interaction. This 
does not seem to be explainable by 
known factors such as center vs. side 
effects or right vs. left effects. 

Estimated probabilities —The Ss’ es- 
timates of E, frequencies are given in 
Table 3. Inspection of this table re- 
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TABLE 1 


Mean RECIPROCAL LATENCIES (IN SECONDS) 
DURING TRIAL BLOCK 4 ror VARIOUS 
ConpitTions or Exp. I 


Instructions x =,.70 x =.90 Mean 
Accuracy .990 1.190 1.090 
Speed 1.116 1.187 1.152 
Mean 1.053 1.188 1.121 


veals Ss' postexperiment estimates of 
E, probabilities were somewhat closer 
to m values than were the terminal A, 
probabilities obtained during the ex- 
periment. There is no evidence that 
Ss overshoot m levels because they 
overestimate E, probabilities. 

The data from Exp. I suggest that 
performing a number of simultaneous 
probability-learning tasks is sufficient 
to cause overshooting of m values. The 
Ss tended to overshoot in the study 
with the exception of the m = .70 Ss 
who received instructions emphasizing 
speed, rather than accuracy. The most 
important effect of the speed instruc- 
tions in the study seemed to be to cause 
some Ss in the 70 group to pay less 
attention to the E, events? Speed in- 
structions did not produce significantly 
faster responding, although there was 
an interaction which showed that at 
some positions .70 Ss under speed in- 
structions reacted faster than those 
under accuracy instructions. The Ss 
at m = .90 showed only very slight in- 
structional effects on either A, proba- 
bilities or response latencies. 


ExPERIMENT II 


Experiment II was similar to I in 
that Ss performed four simultaneous 
probability-learning tasks, but in this 
study m values were not the same for 


? Two of the eight Ss in this condition had 
a mean A, proportion of .26 for Block 4 while 
the other six had a mean A; proportion of 
20. For these same two Ss the mean 
estimated p (E) = .50. 


all tasks. For Ss in II-A the pattern 
of x values was .70, .70, .90, .90 and for 
those in II-B it was .50, .70, .90, and 
1.00. The extent to which Ss can or 
cannot perform these tasks independent 
of one another and the effects of speed 
and accuracy instructions on perform- 
ance were of interest. 


Method 


Subjects—A total of 32 Ss was recruited 
from the introductory psychology course at 
the Ohio State University. Experimental 
participation was part of their course 
requirement. 

Apparatus —The apparatus was the same 
as that described previously, 

Design.—Probability values were randomly 
assigned to the four task positions. The 
position of the E; event (right or left) was 
counterbalanced across Ss, as was the in- 
structional set. 

Procedure.—The procedure was the same 
as described under Exp. I. 


Results 


Choice data.—The data from the 
II-A Ss are given in Fig. 2. Analyses- 
of-variance data from Blocks 3 and 4 


pia) 


Blocks of 30 Trials 


Fic. 2. Proportion of A: responses as a 
function of x value and instructions when 
Ss had two probability-learning problems at 
each of the r values (Exp. II-A). 
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revealed that the main effect of task 
position was highly significant, F (3, 
42) =6.25, p<.01; breaking down 
this main effect into orthogonal com- 
ponents shows that this significance is 
almost entirely due to the .70 vs. .90 
tasks, F (1, 42) = 17.87, p < .001. No 
other effects or interactions were sig- 
nificant. On Blocks 3 and 4, Ss sig- 
nificantly overshot for the .70 posi- 
tions, t (15) = 4.48, p < .001, but not 
for the .90 positions, t (15) = .14. 
Corresponding data from the II-B 
Ss are shown in Fig. 3. Analyses of 
variance of the data from Blocks 3 
and 4 reveal several significant effects 
and interactions. The analysis of vari- 
ance summary table is shown in Table 
2. The significant interactions can be 
seen in Fig. 3; the source of many of 
them seems to be that the lower z- 
value tasks showed strong instructional 
effects and did not stabilize by Blocks 
3 and 4, while the high z-value tasks 
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Blocks of 30 Triols 
Fic. 3. Proportion of A, responses as a 
function of m value and instructions when 


Ss had four probability-learning problems 
with different m values (Exp. II-B). 
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TABLE 2 


ANALYSIS OF VARIANCE FOR TRIAL BLOCKS 
3 AND 4 or Ex». Il ror Ss Recetvinc Four 
CONCURRENT PROBABILITY-LEARNING 
TASKS WHERE r = .50, .70,.90, 

AND 1.00 UNDER ONE or 
"Two INSTRUCTIONAL 


Sets 
Source df MS F 
Be Ss (15) 
Instructions (A) 1 9.570 
ps 14 17.267 
Within Ss (12) 
Positions (B) 3 1308.633 | 62,87*** 
AXB 3 123,341 5.93 
B XSs/Groups 42 20.814 
Blocks (C) 1 4945 
Axe iG 1 20.320 3.16 
XSs/Groups 14 6.436 
BXC 3 22.966 5.08** 
AXBXC 3 18.799 | 4.16* 
B XCXSs/Groups 42 4.520 
Total 127 


*p <05. 
3 Sr 
"+p <.001. 


showed very slight instructional effects 
and were relatively stable by the sec- 
ond block of trials, 

Analyses of the four m values via a 
Neuman-Keuls range test showed that 
Ay probabilities for the ..90 and 1.00 
positions were not significantly dif- 
ferent, but both differed from the .70 
and .50 positions which also differed 
from one another (p< .01). The 
Block 3 and 4 P(A4i1) means were 
tested for deviation from z values by 
partitioning the data. The .50 speed, 
t (7) = 2.67, p <.05, 70 accuracy, 
t (7) = 4.26, p< 001, and both .90 
conditions, #(7) = 3.42, p <.02 and 
t (7) = 234, p < .06 for accuracy and 
Speed, respectively, overshot their re- 
spective + values, The 1.00 speed, 
t (7) = 2.55, p < 05, mean p(A1) was 
significantly less than 1.00 (primarily 
because of the low variance in this con- 
dition). Other means did not differ 
significantly from the z values. j 

Latency data.—Response latencies 
Were collected during Block 4; only 
latencies for the Ax responses were 
analyzed since a few Ss never made 
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the Az response during Block 4 at 
some task positions. The reciprocal of 
each S's mean A, latency at each task 
position was the dependent variable. 

Analysis of the II-A data showed 
that only the main effect of v level was 
significant, F (1, 14) = 6.87, p < .025; 
Ss responded faster to the two m = .90 
positions. Instructing Ss to emphasize 
speed or accuracy led to no significant 
effects. 

Reciprocal latencies for the II-B Ss 
are shown in Fig. 4. The linear trends 
of the regression lines were evaluated 
using orthogonal polynomials (even 
though the z-values were not evenly 
spaced, following a suggestion by 
Grandage, 1958) and were highly sig- 
nificant, F (1, 42) = 28.00, p < .001. 
The instruction effect was also sig- 
nificant, F (1, 14) =649, p < .025. 
There was a tendency for the slopes of 
the two regression lines to differ, 
though this did not reach a conven- 
tional level of significance, F (1, 42) = 
3.93, p < .10. 

Probability estimates.—The Ss’ esti- 
mates of E; probabilities are given in 
Table 3 along with (41) from Blocks 
3 and 4. Again, Ss' estimates of 
p(E1) were usually slightly closer to 
m values than their terminal 4; proba- 
bilities, and there was no discernible 
trend for A; overshoots to be associ- 
ated with high E; estimates. 


T volue 


Fic. 4, Mean choice speed as a function of m 
value and instructional set (Exp. II-B). 
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TABLE 3 


Ss’ ESTIMATES OF E; PROBABILITIES 
AND AVERAGE A; PROBABILITIES 


Accuracy Speed 
Instructions Instructions 
x $(E) | (4) | (E) | (Ay) 
Estimate| ,Ob, |Estimate| „Qb: 
Exp. I 
-70 642 | .738 642 | .593 
.90 .882 | .946 .893 | .959 
Exp. II-A 
.10 .696 | .790 .]19 | .805 
.90. 786 | .884 .848 | .910 
Exp. II-B 
.50 444 | .423 -500 | .621 
10 .712 | .854 .606 | .740 
-90 815 | .952 .871 | .960 
1.00 1.000 | .998 | 1.000 | .979 


The results of Exp. II are somewhat 
different from those of Exp. I. There 
was obviously some interaction among 
(or confusion among) the four tasks 
when the z values differed from posi- 
tion to position. Where, in Exp. I, 
Ss overshot when m = .90 but not when 
7 = 70, in Exp. II-A just the opposite 
occurred. In Exp. II-B, there was 
considerable interposition confusion, 
with all of the lower v values produc- 
ing some overshooting. 

While the latency data from I and 
II-A showed only that higher z-values 
produced shorter latencies, II-B dem- 
onstrated that this trend was highly 
linear (for z-values above .50). In 
all experiments differential instructions 
had the strongest effect on response 
probability for lower v-value tasks, and 
almost no effect when m was .90 or 
1.00. Differential effects of speed vs. 
accuracy instructions on response la- 
tency were demonstrated only in II-B. 


Discussion 


The choice data from these experi- 
ments seem to present some problems for 
stimulus-sampling theory. The data from 
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Exp. II-A and II-B show that the stimu- 
lus sets from the four tasks obviously 
overlap a good deal, but that the over- 
lap is not symmetrical. In Exp. II-A, 
the m= 70 asymptotes were “pulled up” 
much more than the r=.90 asymptotes 
were “pulled down." Similar results 
were obtained in II-B. One possible 
explanation might be that the .90 tasks 
were more highly discernable or domi- 
nant, and S confused the .70 task with 
the .90, thus overshooting on .70 and 
matching .90, but this argument does not 
seem reasonable when the subjective es- 
timates of z are examined—these are 
much closer at the .70 position than the 
90 position. Indeed, what Ss seemed to 
do in Exp. II-A was to overshoot their 
m estimates by the same extent for both 
values of r. Something like this also 
seems to have occurred in Exp. II-B, but 
the pattern is not so clear, Accuracy 
Ss matched their z estimates for the 
extreme z conditions (.50 and 1.00) and 
overshot their z estimates for the middle 
conditions (.70 and .90) while speed Ss 
overshot their z estimates in all condi- 
tions (except m= 1.00) to roughly the 
same extent. Just why Ss should over- 
shoot their z estimates by approximately 
10 percentage points for intermediate 
values of m is not at all clear, but it would 
be rather hard to predict from a simple 
form of stimulus-sampling theory. 
Again, for three of the four groups in 
Exp. I, Ss overshot their T estimates as 
well as actual z values, The explanation 
-for this cannot be generalization among 
stimulus populations having different 4 
values, since z was the same at all posi- 
tions. The random shifts from position 
to position make it difficult for S to 
retain and use information from se- 
quences of trials in making choices; this 
Seems to be sufficient to cause overshoot- 
ing. Overshooting has been shown to 
Occur when instructions emphasize that 
S is dealing with a random sequence of 
events, when instructions emphasize the 
desirability of maximizing success over 
blocks of trials, when differential rewards 
or payoffs are involved, and now, when 
sequential information is interfered with. 
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The latency data are clear and present 
no unusual problem. Speed instructions 
seem to have relatively little effect in 
probability-learning situations, especially 
when z values are high. On the other 
hand, z value seems to be an important 
determiner of choice speed. This finding 
is similar to the ^g-effect" in discrimina- 
tion tasks (Burke & Estes, 1957; La- 
Berge & Tweedy, 1964) and seems to be 
fairly general, though the linear trend 
found in Exp. II-B seems stronger here 
than in other studies. 
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Experiments were carried out to determine the effect of certain 
variables on rats' preference for signal-shock vs. shock-signal. 136 Ss 
were run daily for 11-hr. periods during which they received unavoid- 
able shock at predetermined intervals. When S was in one end of the 
experimental box the shock was preceded by a signal (light), in the 
other, S received shock-signal. The most marked preference for 
signal-shock was obtained with a 500-v. shock delivered 12-60 times 
per hour using a shock of .5- or 5-sec. duration and an 18-sec. signal. 
Significantly less time was spent on the signal-shock side with a 
decrease in signal duration to 3 or .5 sec. or a decrease in shock 
frequency to 2 per hour. A reversal of side preference followed a 
reversal in the sequence of signal and shock. A significant preference 
for signal-shock was obtained even when shock was delivered through 
ear clips in such a way that S's responses could not affect the part of 


S's body in contact with the electrodes. 


Knapp, Kause, and Perkins (1959) ; 
Lockhard (1963, 1965); Matsuyama 
(1958) ; and Perkins, Levis, and Sey- 
mann (1963) all obtained evidence 
that rats will select signal-shock in 
preference to shock which is not pre- 
ceded by a signal. The results of these 
studies and analogous experiments in 
which observing responses (Ro) were 
acquired without any external reward 
(e.g. Prokasy, 1956; Wyckoff, 1952) 
indicate that “information” about forth- 
coming events may be reinforcing. A 
detailed knowledge of the conditions 
under which information has a rein- 
forcing effect can do much to increase 
understanding of the process of rein- 
forcement, 

The present report describes a se- 
ties of experiments using essentially 
the same apparatus and basic proce- 
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dure as Perkins, Levis, and Seymann 
(1963). The study as a whole is in- 
tended to provide certain controls, to 
demonstrate that the preference for sig- 
nal-shock obtained by Perkins, Levis, 
and Seymann was due to availability 
of a signal before shock rather than 
certain other factors, to determine how 
acquisition of the preference is affected 
by certain temporal relationships, to 
determine if the preference is reversi- 
ble, and to get an indication whether 
the preference is dependent upon an 
opportunity for S to control the part 
of his body that touches the electrical 
contacts. 


METHOD 


Subjects—Data reported are from 136 
naive albino rats. Strain (Sprague-Dawley 
or Emory) and sex were equated within but 
not between experiments. Since there was 
no evidence of systematic effect of strain or 
sex these details are not reported. The 
mean weight of Ss was approximately 175 
gm. for all groups in Exp. I through IV; 
mean weight was 159 gm. for Exp. V. 

Apparatus —The same two 24 X 6X6 in. 
Plexiglas shuttle boxes with a grid floor 
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described by Perkins, Levis, and Seymann 
(1963) were used throughout but they were 
modified for some experiments as indicated 
below. These boxes were mounted on a 
fulcrum which tilted slightly when S moved 
from one end to the other. The two halves 
of each box were painted differently. In the 
first three experiments one end was black, the 
other white. In the last two experiments, 
one was painted with horizontal black and 
white stripes 1 in. wide (H) and the other 
with similar vertical stripes (V). A parti- 
tion could be placed in the middle to keep S 
on one side for “forced trials.” ^ Signal 
lights were located directly over the center 
of each half of the apparatus. 

Each tilt box was enclosed in a black 
wooden box containing a 3.5-cp. lamp ap- 
proximately 2 in. below, and to one side of 
the center of the grid floor. A white noise 
masked extraneous sounds throughout the 
experiment. All programming and recording 
were automated. 

One of the tilt boxes was modified and 
used for all Ss in Exp. V. The signal lights 
were moved off center and a slit cut from 
end to end through the middle of the roof of 
the box. The shock was delivered to S 
through Michel wound clips, one clipped into 
each of S’s ears. The wound clips were 
soldered to fine wire which hung from a point 
about 4 ft. directly over the middle of the 
tilt box. The wire within the box was 
coated with a mixture of quinine, alum, and 
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glycerin to discourage biting. A loop of 
light rubber was used to take up slack so 
that although the wires just reached through 
the slot to S’s ears, only a slight upward 
pull was exerted. 

Procedure.—Most aspects of the procedure 
were common to all experiments. Each 
day when the partition was in place S was 
put in one end of the box for 4 hr. and 
then in the other for the second 4 hr. Fol- 
lowing this the partition was removed and S 
received 10 hr. of free-choice time during 
which he had free run of the box. Shock 
was presented through an alternate bar sys- 
tem at predetermined times regardless of S’s 
behavior. When S was in one end of the 
box, the signal light above that half of the 
box preceded shock; when S was on the 
other end, that light followed shock. Except 
in Exp. V shock was 500 v. ac with a 1-meg- 
ohm resistance in series with S. In each case 
offset of one stimulus and onset of the other 
were simultaneous. Half of the Ss in each 
group were run during the day and half at 
night, also half had the signal-shock condi- 
tion on each side except in Exp. V where 
10 of the 16 Ss had signal-shock on H. All 
Ss had free access to food and water except 
when in the apparatus. 

Those aspects of the procedure not pre- 
viously noted, as well as some described 
above, which were different in different ex- 
periments, are summarized in Table 1. For 
example, the first two rows of Table 1 indi- 


TABLE 1 
SUMMARY OF PROCEDURE DIFFERENCES 
Experi i A D 
pose EA Diretion Dustin Spacing of Shocks Training 
a) 12 B-W a) 0.05 sec. 3 sec. irregular, 6 per 30 min. 6 
1D 12 b) 5.0 sec. 
a) 12 B-W -5 sec. a) .5sec.| irregular, 6 per 30 min. 6 
B d b) 3 sec. 
c 
n : c)18 sec. 
a) 12 B-W -5 sec, 18 sec. | a) every 1 min. 6 
b) 12 b) every 5 min. 
c) 12 c) every 30 min. ^ 
un 12 d) irregular, 6 per 30 min. 
12 H-V .5 sec. 18 sec. irregular, 6 per 30 min. | 3 original 
v 6 reversed 
a) 16 H-V -5 sec.^ 18 sec. irregular, 6 per 30 min. 1-4 


* Shock delivered through ear clips. 
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cate that there were two groups in Exp. I 


In Exp. V the shock was 110 v. for the 

first 4 Ss from which data are reported and 
140 v. for the next 12, During this time 9 
other Ss were run for one session, but their 
data discarded, 8 because S either cut the 
wire or pulled out the ear clips, and 1 
because a light bulb failed, 
In Exp. I-IV if the apparatus broke down 
or ran the wrong condition by mistake on 
the last day of training, data from that S are 
reported up to the last day. lí an error 
occurred earlier, data from that S were 
discarded. own data that were 
discarded. were similar to those retained. 


Resucts 


Figure 1 shows the results for both 
groups in Exp. I and the 3-sec. sig- 
nal group of Exp. II which had identi- 
cal training except for shock duration. 
Data on the last day for the 5-sec, 
shock group of Exp. T and the 3-sec. 
signal group of Exp. II are each based 
on 10 Ss, All other data in Exp. I- 
IV are based on 12 Ss. As can be 
seen in Fig. 1, the .05-sec. shock Ss 
did not acquire a clear preference, 
Only 7 of the 12 Ss in this group spent 
more time on the signal-shock side 
than on the shock-signal side on Day 6. 


Het = ue eae 


e—e 
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Fic. 1. Results of Exp. I: Effect of shock 
duration on percentage of time on the signal- 
shock side. 


Only 10 Ss in the other groups were 
run on Day 6. Nine and 8 of the 10 
Ss run 6 days spent over half their 
time on the signal-shock side on Day 
6 for the 5- and .5-sec., groups, respec- 
tively. 

In all experiments reported here a 
distribution of percentage of 1 day's 
time spent on the signal-shock side by 
individual Ss tended to be bimodal ; 
most measures were close to either 
100% or 096. All significance tests, 
therefore, are based on either the ex- 
pansion of the binomial distribution for 
comparison of the results of a single 
group with chance (50%), or the 
Mann-Whitney U test for between- 
group comparisons. Only the 5-sec. 
group differs significantly from chance 
behavior (p < .03, two-tailed test), 
Although the preference is not signifi- 
cant for the .5-sec. group, other data 
(Perkins et al., 1963) indicate that a 
signal-shock preference can be acquired 
under these conditions, None of the 
groups differed significantly in time on 
the signal-shock side on Day 6 (p> 
Jl). The tendency for percentage of 
time on signal-shock side to increase 
with shock duration is highly sugges- 
tive, but no such relationship is es- 
tablished, 

The results of Exp. II are presented 
in Fig. 2. Within the limits tested 
there seems to be a positive relation- 
ship between signal duration and per- 
centage of time on the signal-shock 
side, The proportion of Ss spending 
over half their time on this side on 
Day 6 was 4/12, 8/10, and 11/12 in 
the .5-, 3-, and 18-sec. groups, respec- 
tively. Only the latter group differs 
significantly from chance (p< O01). 
The difference between the .5- and 
18-sec. groups, and 3- and 18-sec. 
Sroups in percentage of Day 6 time on 
the signal shock side is significant, (p's 
< .01 and .05, respectively). The dif- 


| 
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and S-min. irregular interval groups 
and all 12 Ss in the 5-min, regular 


Fic. 3. Results of Exp. III: Effect of the 
intershock interval on percentage of time on 
the signal-shock side. 
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group spent over half their time on 
the signal-shock side on Day 1, All 
36 of these Ss spent over half their 
time on the signal-shock side on Day 6. 

The 30-min. group differs signifi- 
cantly from 5-min, irregular, 1-min., 
and 5-min. regular groups (p's < .05, 
02, and .01, respectively) in time on 
the signal-shock side on Day 6. The 
three groups with the shorter intertrial 
intervals did not differ significantly 
each other on Day 6. As one 
results for the 


their time on the signal-shock and/ 
or the black side on Day 6. Every § 
spent over half its free-choice time on 


as it is for the other groups. For 
purposes of illustration let us assume 
that the preference for the black side 
of the box over the white depended 
on differential reinforcement (or util- 
ity) of one unit per minute and that 
the preference for signal over 
shock-signal is equal to 10 rable 
units per shock presentation, Then, 
if the effects of color preference and 
signal can be added algebraically, 
shock-signal in black should have 
higher reinforcing properties (greater 
utility) than signal-shock in white 
when shocks are presented only once 
every 30 min. [(1U x 30) - 10U = 
*20U]. However, if shocks are pre- 
sented five times per 3 hr., shock-signal 
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in. black would be less reinforcing 
than signal-shock in white [(1U x 30) 
— (10U x 5) = —20U]. 

The results of Exp. IV are shown 
in Fig. 4. Both the acquisition and 
reversal were marked and significant. 
The smallest percentage of time on the 
signal-shock side for an individual .$ 
was 66 and 93 for the third day of 
training and sixth day of reversal, 
respectively. 

Additional experiments were run 
with six 500-v. shocks every 30 min. 
to determine if the same results would 
be obtained with different signals. The 
major results of these experiments 
were: (a) when the signal was the 
onset of both signal lights rather than 
just the one at S’s end of the box, a 
significant preference for the signal- 
Shock condition was acquired with an 
18-sec. signal, (b) when both lights 
on the roof of the box were on during 
the intertrial interval and the signal 
was lights off for 18 sec., a significant 
preference for the signal-shock over 
shock-signal was acquired, Kc a 
marked but. nonsignificant tendency 
towards a signal-shock preference was 
acquired with a 3-sec. lights-off signal, 
(d) lights-on signal produced a more 
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Fic. 4. Percentage of time on the signal- 
Shock side during training and reversal in 
Exp. IV. 
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marked preference than lights-off when 
signal duration was 18 sec., but no 
evidence of a stronger preference with 
lights-on than with lights-off was ob- 
tained with a 3-sec. signal. It appears 
that a preference for signal-shock does 
not depend on the particular signal or 
a difference in the stimulus conditioris 
during shock (e.g. presence vs. ab- 
sence of light). 

Thirteen of the 16 Ss in Exp. V 
spent more than half of the time during 
the last hour of Day 1 on the signal- 
shock side, This preference which was 
equally marked with signal-shock on 
either side of the box is significant 
(P <.02). Of the 10 animals that 
completed the second day’s training 
with the ear clips still properly at- 
tached only 5 spent over half the last 
hour on the signal-shock side. How- 
ever, 3 Ss completed 3 days and 2 
Ss completed 4 days, and the shortest 
time spent on the signal-shock side by 
any S during the last hour on Day 3 
or 4 was 58 min, 

Except for responses to the ear clips 
(eg. scratching) the behavior of Ss 
was strikingly similar to that of Ss in 
the preceding experiments. There was 
relatively little activity most of the time 
and an increased tendency to freeze 
during the signal especially just before 
shock. No seizurelike behavior was 
observed at any time. 

The strength of the preference for 
the signal-shock condition in this ex- 
periment and in preceding experiments 
should not be compared. Informal evi- 
dence and logical considerations indi- 
cate that at least with very low shock 
intensity little or no preference for 
signal-shock is obtained. The shock 
used in Exp. V was less intense than 
the shock used in the other experi- 
ments. However, the relationship be- 
tween mode of delivery (eg., grid or 
ear clips) and strength of preference 
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has not been studied, and it probably 
has an effect on the strength of the 
tendency to select signal-shock, too. 

The results of Exp. V do show that 
the preference for signal-shock may be 
acquired even when it is unlikely that 
gross postural responses affect the area 
of contact between S and the elec- 
trodes. Apparently at least some of 
the shock-preparatory responses which 
presumably minimize the aversiveness 
of shock are internal. 

The greater effectiveness of an 18- 
sec. signal than a 3-sec. signal obtained 
in Exp. II and supported by interex- 
perimental comparisons increases our 
confidence in the primary role of inter- 
nal preparatory responses. Three sec- 
onds should be long enough for gross 
postural responses but does not seem to 
be long enough for optimal preparatory 
responses. 


Discussion 


Our results and those of Knapp, Kause, 


and Perkins (1959), Lockhard (1963, 


1965), Matsuyama (1958), and Perkins, 
Levis, and Seymann (1963) indicate that 
rats acquire a preference for signal- 
shock over shock not preceded by a sig- 
nal. There does not appear to be a single 
report of results indicating that unex- 
pected shock is less aversive than shock 
preceded by signal. The preference of 
human Ss for immediate over delayed 
shock (Cook & Barnes, 1964; D’Amato 
& Gumenik, 1960) also support this in- 
terpretation. 

The acquisition of observing responses 
(Ro) in tasks terminating in food reward 
by pigeons (Wyckoff, 1952) and rats 
(e.g, Prokasy, 1956) and in rats where 
terminal reward is water (Levis & Per- 
kins, 1965) all in the absence of dif- 
ferential external reward, like the shock 
studies, support the interpretation that 
classically conditioned responses prepare 
S for the UCS. In other words the 
stimulus situation apparently is relatively 


more reinforcing at the time of UCS 
presentation if it has been preceded by 
a conditioned preparatory response. 

So far as we are aware there are no 
reports of Ss learning to avoid either a 
signal preceding unavoidable shock or 
an observing response (i.e, a signal pre- 
ceding a positive reinforcer). At the 
same time there are many instances of 
instrumental responses being acquired 
when they are differentially reinforced 
only by a signal to which preparatory 
responses may be conditioned. 

Such findings are to be expected. from 
a single-factor differential reinforcement 
theory of behavior change (Perkins, 
1955). This point of view implies that 
Ss will acquire a response which pro- 
vides a signal (CS) before presentation 
of any stimulus which could serve as a 
UCS for classical conditioning. 

While a two-factor theory such as 
Skinner's (1938) is compatible with the 
data it does not lead one to expect a 
preference for the signal condition. Simi- 
larly, contiguity theory and those varia- 
tions of reinforcement theory in which 
reinforcement is defined a priori (e.g., as 
drive reduction) would not lead one to 
expect selection of the response followed 
by a signal before a UCS in preference 
to the UCS without a signal. 

A reinforcement theory in which re- 
inforcing properties are determined a 
posteriori by measures of "choice" be- 
havior does imply that Ss will acquire 
a preference for the signal condition. 
"Choice behavior" as the expression is 
used here includes choice of responses 
such as "freezing" and making the GSR 
to stimuli which precede shock. Such 
response selection implies that shock 
while frozen during the GSR is less 
aversive (relatively more reinforcing) 
than shock in the absence of these “re- 
sponse-produced” stimuli. Thus, the sig- 
nal-shock condition will come to be se- 
lected because after conditioning the 
stimulus situation which follows it is 
more reinforcing (less aversive) than 
that which follows its nonoccurrence. 

Thus, the finding that Ss prefer sig- 
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nals to which preparatory responses can 
be made before those stimuli which can 
serve as UCS provides support of a one- 
factor differential reinforcement theory 
of behavior change. 
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Tabulation of errors in syllogistic reasoning has led to alternative 
interpretations in terms of atmosphere effect and invalid conversion, 


but the data and the interpretations overlap. Syllogisms 


were 


arranged in separate scales to test each error separately, and different 


groups got brief training directed 


against each error separately. Both 


errors were measured with fair reliability, and in untrained groups the 
2 were uncorrelated. Antiatmosphere training clearly reduced atmos- 
phere errors, and anticonversion training apparently reduced conversion 


errors. 


Psychologists have noted for many 
years that intelligent adults make fre- 
quent errors in formal syllogistic rea- 
soning, and two serious attempts at 
interpretation have been published. 
Woodworth and Sells (1935) and 
Sells (1936) interpreted most of the 
errors in terms of an atmosphere ef- 
fect; Chapman and Chapman (1959) 
stressed invalid conversion. 

These interpretations have been 
based on comparisons between syllo- 
gisms of various types in respect to 
frequency of errors. Considerable 
overlap can be seen in the frequencies 
reported by Sells and by Chapman and 
Chapman for comparable syllogisms in 
spite of a difference in format. There 
is considerable overlap also in the er- 
rors predicted by the two hypotheses. 
Thus this method probably has little 
more to contribute to the evaluation of 
the alternative hypotheses. The situa- 
tion now calls for experimental manip- 
ulations from which different effects on 
errors can be predicted. 

Several experiments (see Johnson, 
1966) have shown that brief general 
training reduces logical errors, hence 
the present experiment was designed to 
study the results of differential train- 
ing. One kind of training was de- 


signed to reduce errors attributable to 
the atmosphere effect and another to 
reduce errors attributable to invalid 
conversion. 

This plan requires two blocks or 
scales of syllogisms in which the two 
error effects operate separately. Wood- 
worth and Sells (1935) described the 
atmosphere effect in general terms as 
reaching a conclusion to a syllogism 
through a global impression of the 
premises. They added two supplemen- 
tary principles to account for discrep- 
ant results, but these were contradicted 
by the data of Chapman and Chapman 
and may therefore be eliminated. Thus 
the atmosphere hypothesis can now be 
written in specific language: When 
both premises of a syllogism contain 
the same qualifier, “are” or “are not,” 
or the same quantifier, “all” or “some,” 
many Ss will accept a conclusion which 
also contains the common term. Re- 
examination of the Chapman and 
Chapman study shows that this restate- 
ment is not inconsistent with their re- 
sults. With the further elimination of 
syllogisms that confound atmosphere 
and conversion effects, the atmosphere 
hypothesis alone predicts for EE prem- 
ises an E conclusion, for OO premises 
an O conclusion, for II premises an I 
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conclusion, for IO premises an I or an 
O conclusion, and for EO premises an 
E or an O conclusion. (See Cohen & 
Nagel, 1934.) Thus there are five syl- 
logisms for a separate test of the at- 
mosphere hypothesis. 

Although many syllogisms give op- 
portunities for invalid conversion, 
there is only one, AO, that can disclose 
this error uncontaminated by atmos- 
phere error. It can be written in sev- 
eral variations, however. 

To test this two-error interpretation 
one group of Ss was given a descrip- 
tion of the atmosphere error, a warning 
against it, and some practice in avoid- 
ing it. Another group was given par- 
allel training against the error of in- 
valid conversion. Then these groups 
and appropriate control groups were 
given both error scales. 


METHOD 


Subjects —The experiment was conducted 
in recitation sections of 20-30 students each 
that met once a week as part of a large 
introductory psychology course. Distribu- 
tion of students to sections was part of the 
mechanics of registration; There was no 
reason to expect a systematic difference in 
reasoning ability. Two sections were as- 
signed to each of four conditions, and one to 
a general warning condition. There were 32 
Ss who stated that they had had some pre- 
vious experience with syllogistic reasoning 
and were therefore excluded from the 
analysis. Thus there were 190 Ss in five 
conditions. 

Materials.—The atmosphere scale consisted 
of indeterminate syllogisms with five alterna- 
tive conclusions, one of which was an atmos- 
phere error. The five syllogism types men- 
tioned above were varied in order of terms 
and propositions so as to make a scale of 12 
syllogisms, The conversion scale consisted 
of 12 AO syllogisms, varied in order of 
terms and propositions, with five conclusions, 
one of which was an invalid conversion. 
Eleven determinate syllogisms were included 
as filler problems. The three scales were 
printed in irregular order. 

Procedure.—All groups received the same 
general instructions, similar to those used in 


previous studies, giving the nature of a 
syllogism, an illustration on the board, and a 
clarification of “some.” Group 1, the anti- 
atmosphere group, received a description of 
the atmosphere error and six practice 
syllogisms. Practice was confined to AE 
and AI syllogisms to avoid those that would 
appear later on test scales. Group 2, the 
anticonversion group, received a description 
of invalid conversion and practice with six 
premises and conclusions to be judged true 
or false. Group 3 received a general warn- 
ing to control for the possibility that any 
warning coupled with familiarity with any 
type of syllogism might result in an improve- 
ment. Group 4 received the same warning 
as Group 3, then the same practice syllogisms 
as the antiatmosphere group. This provided 
a check on the combined effects of a non- 
specific warning and brief practice. Group 5 
received only the general instructions. All 
groups then took the test of 35 syllogisms 
with five alternative conclusions. 


RESULTS AND Discussion 


Some general features of the results, 
bearing on the validity of the scales, 
should be noted first. As expected, the 
determinate syllogisms were easier 
than the indeterminate syllogisms, for 
which the only logical conclusion was 
“none of these.” Mean number right 
was 7.16 (out of 11) or higher, as com- 
pared with 5.28 (out of 12) or lower. 
The construction of the two indeter- 
minate scales was fairly successful in 
that about 70% of the errors on the 
atmosphere scale were the predicted at- 
mosphere errors, and about 60% of the 
errors on the conversion scale were the 
predicted conversion errors. 

The correlation data from the three 
groups without special training add an- 
other kind of check on the construction 
of the scales. Each S was given a 
score for atmosphere errors on the at- 
mosphere scale, for conversion errors 
on the conversion scale, and for total 
errors on the determinate scale. Cor- 
relations computed separately for 
Groups 3, 4, and 5 were very similar, 
so these were combined to make à 
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TABLE 1 
MEAN Scores or Five TRAINING GROUPS ON THREE SYLLOGISM SCALES 


Deter- 
minate 


Atmosphere Conversion 


Training Group N 


Number | Number | Atmosphere 
Right i 


igl Right 


Other 
Errors 


Number | Conversion 
Right Errors 


Other 


Errors Errors 


Antiatmosphere 40 | 7.78 5.28 
Anticonversion 48 | 7.50 2.13 
General Warning 23 | 7.83 3.00 


Warning and 32 7.16 2.72 
Practice 
Control 47 | 8.60 | 2.60 


4.75 1.97 2.23 6.92 2.75 
6.41 2.86 2.21 6.06 3.73 
6.95 2.05 1.39 5.96 4.65 
5.88 3.40 94 5.50 5.56 


6.79 2.61 5j! 7.417 3.52 


larger group of 102. Reliability coeffi- 
cients for this large group, computed 
by the odd-even method and the Spear- 
man-Brown formula, were .64 for at- 
mosphere errors, .80 for conversion 
errors, and .68 for determinate syllo- 
gisms. Justification for treating at- 
mosphere and conversion errors sep- 
arately comes from the correlation 
between the two, which was only —.12. 
Apparently susceptibility to one type 
of error is practically independent of 
susceptibility to the other. Errors on 
determinate syllogisms correlated .08 
with atmosphere errors and —.51 with 
conversion errors. (Correlations above 
.26 are significant at the .01 level in 
this group.) Good performance on a 
conventional test of logical reasoning 
ability, like the present determinate 
scale, does not predict resistance to 
these two special types of logical error. 

The means for the five groups are 
presented in Table 1. The main hy- 
potheses were evaluated by analysis of 
variance (Table 2) followed by the 
Duncan range test as extended by 
Kramer to the case of unequal n’s. 
Each scale was analyzed separately. 
The significant F on the determinate 
scale was due to Group 5, which was 
significantly higher than Group 4 (p 
< .005) and Group 2 (p < .05). The 
superiority of Group 5 does not, as 


noted above, predict superior perform- 
ance on the other scales. 

Analysis of the number right on the 
atmosphere scale yielded a significant 
F. As expected, Group 1, with anti- 
atmosphere training, was significantly 
different from each of the other four 
groups (p < .005). Another analysis 
by Scheffé’s test verified this difference 
at the .001 level. 

Atmosphere errors yielded a signifi- 
cant F between groups. According to 
the Duncan range test Group 1 was 
significantly different from Group 3 
and from Group 5 (p < .01) and also 
from Group 2 (p < 05), which re- 
ceived anticonversion training. Fur- 
ther analysis by Scheffé’s test sepa- 
rated Group 1 from the other four 
(p < 001). 

The results for the conversion scale 


TABLE 2 


ANALYSIS OF VARIANCE OF CERTAIN SCORES 
FROM THREE SYLLOGISM SCALES 


Deter. | Atmos. | Atmos. | Conv. 
Right Right | Errors | Errors 
Source | df 
F T F F 
Gi 4| 323* 5.18** | 398** 3.08* 
Within 185 | (3.68) |(10.78) | (6.81) (10.55) 
lean. 
juare) 
*p < 05. 
** < 01. 
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are less clearcut. The number of er- 
Tors was very large, hence analysis of 
variance of the number right was not 
feasible. Analysis of conversion errors 
yielded a significant F, due chiefly to 
Group 5, which made significantly 
more errors than Group 2, 3, or 4 
(p < .05). 

In general differential training gave 
differential results. Neither the at- 
mosphere error alone nor the conver- 
sion error alone can account for these 
findings. The effect of the antiatmos- 
phere training was quite clearcut since 
this group after training was signifi- 
cantly low on atmosphere errors and 
not on conversion errors. The conver- 
sion errors were measured as reliably 
as the atmosphere errors but the anti- 
conversion training was not so clear- 
cut. The nonspecific warning of 
Groups 3 and 4 also reduced conver- 
sion errors. 

It is possible that the anticonversion 
training was not as well planned and 
executed as the antiatmosphere train- 
ing, and that further research will im- 
prove the anticonversion training. The 
intent of the present study was to make 
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each kind of training as effective as 
possible in 15 min., but it was neces- 
sary to keep each kind of warning 
specific to one error, eg. to avoid 
warning against the atmosphere error 
during the anticonversion training. It 
is also possible that the conversion 
error has not yet been properly iden- 
tified and can best be studied after the 
atmosphere error has first been re- 
moved. Differential training appears 
to be a useful method for analysis of 
such abstract performances. 
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TEMPORAL FACTORS IN VERBAL SATIATION * 
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The rate of development and decay of verbal satiation were studied 
with a search-time technique, where increased search time for the 
target word defined satiation. The search field consisted of the target 
word, which was either the repeated stimulus word or its primary 
associate, embedded in a random array of words low in meaningfulness. 
6 acquisition and decay durations varying from 0 to 120 sec. were used. 
Search time increased with repetition duration and decreased rapidly to 
control level after the cessation of the repetition. The associated 
words took longer to find than the stimulus words. The hypothesis 
was offered that these changes could be due to transformations in the 
structural character of the stimulus word induced by the repetition 


procedure. 


Meaningfulness of a word has been 
defined by Osgood as the polarity rat- 
ing on a scale of various connotative 
dimensions (Osgood, Suci, & Tannen- 
baum, 1957), and by Noble (1952) as 
the number of association responses 
emitted in a l-min. interval. These 
measures have been found to intercor- 
relate well, however much the theoret- 
ical positions from which they stem are 
at variance (Staats & Staats, 1960), 
and have each been used to index a loss 
of meaningfulness of a word subse- 
quent to its continued repetition. The 
literature on this effect (termed verbal 
satiation) using these measurement 
techniques is not yet a solid one, but 
has reported generally positive enough 
results to establish the existence of the 
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phenomenon (see, e.g. the recent re- 
view by Amster, 1964). 

In the present experiments, it was 
assumed that if the phenomenon is 
real it should yield an acquisition and 
decay function along a temporally de- 
fined parameter. A search-time tech- 
nique was chosen for this purpose since 
it offered a way of testing S immedi- 
ately and without interruption after 
the repetition task without calling at- 
tention to the satiation phenomenon 
itself. 

The assumption was made that the 
more meaningful a word is, the easier 
it will be to find in an array of other 
words low in meaningfulness, and ver- 
bal-satiation effects would be detected 
in an increased latency measure. The 
basis for this assumption is provided 
by Portnoy, Portnoy, and Salzinger 
(1964), who have shown that high- 
association-value nonsense syllables 
embedded in a random array of low- 
association-value nonsense syllables are 
maximally discriminable, as reflected 
in a significantly shorter search time 
than the other three possible field- 
target combinations of high- and low- 
meaningful nonsense syllables. Since 
S must indicate the location by point- 
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ing to it, the influence of response bias 
(Goldiamond & Hawkins, 1958) is 
minimized, 

The present experiments on the ef- 
fect of varying acquisition and decay 
times on search time was carried out 
after a series of preliminary experi- 
ments by the present author demon- 
strating that a 1-min. repetition period 
did in fact increase search time for 
both the repeated word and its asso- 
ciates, and did not yield this effect for 
an intervening unrelated monotonous 
task (Hubbard, 1964). 

Definitions of terms—Meaningful- 
ness will be defined as the time taken 
to identify the target word in a random 
array of other words low in meaning; 
a loss in meaningfulness with the repe- 
tition procedure will be reflected in an 
increase in time taken to identify the 
word, Meaning will be independently 
defined as the time required to identify 
in the search stimulus a primary asso- 
ciate of the repeated word; a loss in 
meaning will be reflected in increased 
search time for a word associated to 
the repeated word. 


METHOD 


Stimulus material. —Six two-syllable nouns 
(Stimulus words) were selected from the 
Minnesota normative data (Russell & 
Jenkins, 1954) along with their primary 
associate (High-Associate words), and were 
embedded as target words in a field which 
consisted of 49 dissyllable words low in 
meaningfulness (m) (Noble, 1952). The m 
values for the words used ranged from .99 to 
2.69. ‘The reason for the selection of these 
words low in meaningfulness for the field 
was the hypothesis that, under normal or 
control conditions the target word, which is 
considerably higher in meaningfulness than 
its surround, would be readily discriminated 
from the low-meaningful surround; and that 
if, in fact, a word loses its meaningfulness 
during the repetition task it would be more 
difficult to find in this field, since discrimin- 
ability is one of the parameters of search 
time. The particular list also suits the 
problem because of the variability in word 


DOROTHY HUBBARD GAMPEL 


length, which allowed some variability in 
length of the target words selected. 

Each Stimulus and its High Associate con- 
stitute a Word Set, with the Stimulus and 
High-Associate components belonging to dif- 
ferent Word Categories. Thus there were 
six Word Sets, each having two Word Cate- 
gories, yielding a total of 12 search tasks. 
The target words were CARPET, BUTTER, 
HAMMER, MOUNTAIN, NEEDLE, and LION ; their 
respective high-associate words were RUG, 
BREAD, NAILS, HILL, THREAD, and TIGER. The 
mean log usage frequencies (Thorndike & 
Lorge, 1944) were 2.2423 for the Stimulus 
Words and 2.3375 for the Association Words. 

The slides presenting the stimulus material 
for the search were prepared from 5X8 
index cards divided into a 5 X 20 array of 
100 possible cells, 50 of which were randomly 
selected 12 times to be filled by the one 
target word and the 49 field words. An 
example of one stimulus array, where the 
target word is the word HAMMER, is pre- 
sented in Fig. 1. Different arrays of these 
50 cells were selected for the 12 different 
targets, as was the order of the dissyllable 
words filling the cells, so that 5s would not 
learn during their 12 search tasks the loca- 


TAROP FERRULE  SAGROLE ICON BALAP 
TUMBRIL NEGLAN 
GRAPNEL XYLEM ULNA 
WIDGEON VOLVAP ENDIVE 
WELKIN BRUGEN 
MATRIX ROMPIN 
GOKEM CAROM MEARDON 
LATUK 
RENNET POLEF 
FEMUR ATTAR 
QUIPSON DAVIT FLOTSAM GOJEY | NOSTAW 
OVUM JETSAM — KUPOD 
ZUMAP TARTAN 
LEMUR NIMBUS — CAPSTAN BODKIN 
KAYSEN 
NARES HAMMER 
LICHENS LOZENGE GAMIN 
MAELSTROM DELPIN 
PERCEPT BYSSUS STOMA 


Fic. 1. Sample stimulus array for search 
where the target is the stimulus word 
HAMMER, 
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tions of the field words. The 50 words were 
randomly distributed in these arrays with 
the restriction that each of the 6 sections into 
which the stimulus field could be divided in- 
cluded one Stimulus word and one High- 
Associate word, so that any difference found 
in search time for these word categories 
could not be attributed to a location variable. 
The words were typed in Executive Type 
from an IBM electric typewriter. 
Design.—A 6 X 2 factorial design was set 
up, with the 6-level factor being duration 
(repetition duration, in the acquisition part 
of the experiment, and delay duration, in the 
decay part of the experiment), and the 2- 
level factor being Word Category. Each 
S received all 12 search targets (6 Stimulus 
words, 6 High-Associate words) after differ- 
ent durations, the order of which was 
randomized 6 times (through six Ss) with 
the restriction that the two words of a given 
word set were separated by at least one task 
involving a different word set. 

First the word sets were randomly as- 
signed to durations for the six S sets, and 
word set, word category, and duration were 
then independently randomized for ordinal 
position of the task. 

Subjects.—The Ss were 72 undergraduate 
students (39 males and 33 females) cur- 
rently enrolled at the City College of New 
York who volunteered for the experiment. 
Median age was 20 yr., with a range from 
17 to 34. All had uncorrected or corrected 
20/20 vision. Only students for whom 
English is the native language were used. 

Apparatus.—The S projected the 8 X 12 in. 
stimulus array when instructed to do so by 
means of a telegraph key which controlled 
the lamp of a Land Polaroid slide projector, 
and released through a relay the clutch of a 
Standard Electric clock calibrated in .01 sec. 
When S released the key to point to the 
target, the clock stopped, thus yielding the 
search-time measure. The E controlled the 
slide projector with a hand switch as S re- 
leased his key so that the image remained 
while he indicated by pointing to the target 
word.  Repetition and delay durations were 
timed with a stopwatch. The pacing of the 
repetition was done with a Lafayette electric 
metronome (Model MP-100-PA) set at 120 
beats/min. 

Procedure—After being seated next to the 
telegraph key and approximately 2 ft. from 
the projection wall S was instructed that his 
task was to press down on the key which 
would project the image of the target, and 
to hold the key down until ready to indicate 


the target word by pointing with the same 
hand. He was given several practice trials 
on the key-pressing task alone, with no slide 
in the projector. A slide containing the 
practice target word TABLE was then inserted 
and $ was instructed to find "either the word 
TABLE or a word related to it" The kind of 
relatedness to expect was illustrated by the 
example: GRASS-GREEN. The other two prac- 
tice trials contained the words carr and 
CLOTH. 

The intervening repetition task and the 
desired 2-sec. rate was then explained. For 
the Acquisition experiment, Ss were told to 
press down on the key as soon as the repeti- 
tion completion was signaled by E. For the 
Decay experiment, Ss were instructed to 
stop the repetition when signaled to do so, 
and to wait for a second signal before press- 
ing the key. Both sets of Ss were instructed 
to look at the 5X8 card containing the 
word while repeating it. Durations for both 
experiments were 0, 5, 15, 30, 60, and 
120 sec. 

Special instructions, required for the zero- 
duration conditions, were given prior to the 
presentation of the task word. The Acquisi- 
tion Ss were told that they were to press 
the key down as soon as the target word set 
was identified rather than repeating it first, 
and the Decay Ss were told to press the 
key down immediately on completion of the 
repetition. 

The assignment of Ss to the acquisition and 
decay parts of the experiment was alternated 
throughout the running of Ss, Running time 
for the acquisition session was about 25 
min, for the decay session about 35 min. per 
S. The slides were arranged in the order 
to be used for a given S in advance of the 
session and were changed manually by E 
between trials. 


RESULTS 


Duration effects —Figure 2 presents 
the means of the log transformed data 
for the combined word categories 
(Stimulus and High Associate). 
Search times significantly increased 
with repetition duration, F (5, 25) = 
4.75, p « 0l, and subsequently de- 
creased with delay duration, F (5, 25) 
= 3.39, p < 05.2 The essential homo- 
geneity of the standard deviations in- 


3 Computational formulas were obtained 
from Scheffé (1959). 
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dicated that the log transformations 
made the data adequately suitable for 
the analysis of variance procedure. 
There were no significant differences 
between the means involved in the di- 
rectional shifts either in Fig. 2 or Fig. 
3, tested by Scheffé’s method of mul- 
tiple comparisons of means (Scheffé, 
1959), implying that these acquisition 
and decay functions can be described 
as monotonic. 

Word-category effects—Mean val- 
ues for the two word categories are 
presented in Fig. 3, where it can be 
seen that search time for the High- 
Associate words is consistently higher 
than for the Stimulus words, with the 
exception of one point in the Decay 
function. This elevation of the asso- 
ciated words is significant in the Ac- 
quisition series, F (1, 5) = 16.41, p< 
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Fig. 2. Mean log search time for all target 
words combined as a function of Repetition 
Duration, with delay time prior to the search 
task constant at zero seconds (A), and Delay 
Duration prior to the search task, with Repe- 
tition Duration constant at 60 sec. (B). 
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Fic. 3. Mean log search time for stimulus 
and high-associate words as a function of 
Repetition Duration, with delay time prior 
to the search task constant at 0 sec. (A), 
and Delay Duration prior to the search task, 
with Repetition Duration constant at 60 
sec. (B). 


-0l, but did not reach significance in 
the Decay, F (1, 5) — 3.49, p > .05. 
The interaction between durations and 
Word categories is not significant in 
either series, indicating parallel tem- 
poral effects on the search-time meas- 
ures for these two word categories. 
Word-set effects—Word-set effects 
Were not significant in either series, 
although significance is approached in 
the Decay series, F (5, 360) = 2.03, $ 
> 05 


Discussion 


Changes in meaningfulness.—Meaning- 
fulness as it has been defined here de- 
creased as a function of length of repe- 
tition, reaching a maximum at 60 sec., 
and was regained sometime between 5 
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and 15 sec. thereafter. The fact that 
there is no clear change in the acquisition 
function until the 60-sec. repetition point 
is difficult to reconcile with other data 
reporting satiation after durations rang- 
ing from 5 to 30 sec. but is probably due 
to task differences, to be discussed below. 
In fact, at the earlier duration, there is 
some suggestion of semantic generation, 
an enhancement of meaningfulness of a 
word with its repetition (Amster, 1964), 
but this is not statistically substantiated. 

The rapid decay time is in accord with 
Paul’s (1962) finding, using m (number 
of associates given) as the measure, and 
is again different from the slow dissipa- 
tion time (more than 5 min.) reported 
by Lambert and Jakobovits (1960) using 
semantic differential ratings. This fast 
recovery time is probably due to the high 
strength of the original visual and asso- 
ciative response hierarchy established and 
maintained throughout a long verbal his- 
tory, and may well be a similar process 
to the recovery of “unlearned associates” 
seen in paired-associates learning experi- 
ments (see, e.g., Turnage, 1963). There 
is no reason to assume that the recovery 
time for the aspect of meaningfulness 
tapped by the semantic differential in the 
Lambert and Jakobovits experiment must 
be the same as yielded by other measure- 
ment operations. 

Changes in meaning—The functions 
shown in Fig. 3 and the absence of a 
significant interaction term would war- 
rant the conclusion that meaning of the 
words was altered along with meaning- 
fulness by the repetition operation. The 
consistently longer search times for the 
High-Associate words than for the Stim- 
ulus words (a difference significant at 
the .01 level in the acquisition series but 
one which did not reach significance in 
the decay series) indicate that, within 
the framework of this particular task the 
word itself is easier to find. 

Effect of task—The present experi- 
ment is the only one in the verbal satia- 
tion literature in which a simple identifi- 
cation task is the dependent variable; the 
word-association and rating-scale tasks 
depend on response emission or judgment 


processes. The fact that the word had 
only to be identified, not emitted, may 
have reduced the likelihood of detection 
of satiation effects at the earlier dura- 
tions, and contributed to the rapid dissi- 
pation time. It would seem that the 
stimulus and task variables involved in 
operations measuring such a complex 
process as verbal meaningíulness would 
necessarily contribute to the variance. 
For example, the rate and amount of 
detectable verbal satiation has been shown 
to be reduced by the number of compo- 
nents contributing to meaningfulness in- 
volved in the satiation operation, by 
Miller (1963), where the parameter was 
the performance of a concurrent response, 
and by Jakobovits and Lambert (1964), 
where the continuum was the number of 
relevant cues in the repeatedly exposed 
stimulus. The present data, regrouped 
on the basis of D, an entropy measure 
(Laffal, 1955), indicated a similar trend 
toward less repetition effects on the 
words having a greater D value. Thus, 
since the measuring operations used here 
and in the literature to date involve such 
different tasks, it would seem unrealistic 
to expect that they would yield exactly 
the same temporal values, although phe- 
nomena identified should be comparable 
(Staats & Staats, 1960). 

Structural transformation with repeti- 
tion: A hypothesis for sanitation effects. 
—The explanation for satiation data 
through other measures in the literature 
has typically been in terms of reactive 
inhibition, either of some central media- 
tional process (Lambert & Jakobovits, 
1960) or of the individual associate 
words assumed to be subvocally emitted 
during the repetition procedure (Smith 
& Raygor, 1956), depending on the 
theory of meaning setting the criteria 
for the design and interpretation of the 
experiment. The perceptual task used 
here enables the alternative hypothesis 
that the transformations in the stimulus 
word as it is repeated, similar to that 
reported by Warren (1961) and by Axel- 
rod and Thompson (1962), set the occa- 
sion for words other than itself or its 
associates. The Ss reported both audi- 
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tory and visual fluctuations between the 
whole word and its component parts 
(eg. BUTTER changed to Bur and HER; 
CARPET to CAR and PET). These changes 
would thus account for the increased 
search times, since the effective stimu- 
lus for the identification tasks was at 
least momentarily absent. If the effect 
achieved here and termed satiation is in 
fact due to this stimulus disorganization, 
search times for the word components 
and their associates should decrease with 
repetition as the search times for the 
entire word and its associates increase. 
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ADAPTATION TO PRISMATIC DISPLACEMENTS: 


HAND POSITION AND TARGET LOCATION * 


ROBERT WM. SEKULER? Ax» JOSEPH A. BAUER, Jr. 
Massachusetts Institute of Technology 


Interpretation of studies of prismatic changes in the apparent rela- 
tionship between hand and eye may be complicated by possible sources 
of artifact, including failure to control the position of S's hand during 
prism viewing. In the study reported S used a stylus to mark the 
location of virtual image targets before and after viewing the marking 
hand through prisms. Differences between pre- and postexposure 
markings measured the effect of the intervening exposure. Inde- 
pendent variables included the direction in which prism displaced seen 
objects, position of the marking hand during exposure, and location of 
the targets to be marked. Although all 3 factors proved highly 
significant (p <.005), the effect of prism orientation was about 4 times 
as great as the other effects. The implications of these findings for 


research on prism adaptation are discussed. 


Studies in several laboratories have 
used a technique like that of Held and 
Gottlieb (1958) to measure adaptation 
to optical transforms of the relation- 
ship between eye and hand. Such ex- 
periments include those of Held and 
Hein (1958), Hamilton (1964), Ham- 
ilton and Bossom (1964), Pick and 
Hay (1964), Weinstein, Sersen, and 
Weinstein (1964), and  Mikaelian 
(1963). In the Held-Gottlieb proce- 
dure S first uses a stylus to mark the 
apparent position of virtual image 
targets. A wedge prism is then put 
before S’s eye and an exposure period 
follows in which S views his hand 
through the prism. After the prism is 
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removed S again marks the position of 
the targets. Systematic differences 
between the pre- and postexposure 
markings provide a measure of the 
consequences of the intervening prism 
condition, 

Despite the widespread use of the 
Held-Gottlieb procedure and the grow- 
ing interest in hand-eye coordina- 
tion there remain several unexamined 
sources of variance which may produce 
artifactual changes confounded with 
adaptation. The present experiment is 
an attempt to identify some of these 
sources of variance, to estimate their 
contribution to previous experiments, 
and to suggest ways of eliminating or 
minimizing them. 

In many earlier studies S moved his 
hand freely back and forth within his 
field of view during the exposure pe- 
riod. The prismatic displacement of 
objects in the visual field could have 
resulted in a shift in the mean posi- 
tion of the hand during exposure. For 
example, when the hand placed in the 
center of the field is viewed through 
a wedge prism, oriented base left, it 
appears displaced rightward and must 
be moved leftward if it is to appear 
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centered within the field. Conse- 
quently, during either base-left or base- 
right exposure conditions S tends to 
Shift the mean position of his hand 
toward the prism base to keep it cen- 
tered. Research on kinesthetic after- 
effects (KA) has shown that if the 
hand viewed during exposure is kept 
displaced for some time, a resulting 
KA could contribute to the difference 
between pre- and postexposure mark- 
ings. 

Another problem as yet unexplored 
is the extent to which prepost differ- 
ences vary with target location. If 
the effect being measured represents 
only a shift in the apparent relation- 
ship between hand and eye, the differ- 
ence between pre- and postexposure 
markings ought to be constant for all 
target locations. To the extent that 
prepost differences vary Systematically 
with target location one might expect 
that other factors are making a con- 
tribution to the observed differences. 
A likely source of target differences 
is the finite length of S's arm with 
which he marks. The experiment to 
be reported explores the effect of tar- 
get location in a situation designed to 
maximize differences among targets. 

One additional dimension of the 
standard eye-hand procedure needs 
closer study. Since using a stylus to 
mark the position of a target is itself 
a motor skill one might suppose that 
an unpracticed S' exhibits a certain 
amount of skill learning over repeated 
trials and sessions, Such learning 
would be reflected in decreased disper- 
sion of markings about each of the 
targets. How much practice do naive 
Ss need to bring them to asymptote ? 
An answer to this question is impor- 
tant not only for increasing the econ- 
omy of experiments but also for avoid- 
ing the Possibility that experimental 
effects are themselves confounded with 
the effects of motor skill learning. Be- 


cause S in the Held-Gottlieb apparatus 
cannot see the hand with which he 
marks, the source of feedback respon- 
sible for the skill learning is nonvisual. 
That a visually guided motor skill can 
be aided by presumably nonvisual feed- 
back is of some interest in its own 
right. 


METHOD 


Subjects—The Ss were 18 college stu- 
dents, all right handed and naive with respect 
to the purposes of the experiment. 

Apparatus and procedure —At the start of 
each session S was seated in front of the 
apparatus, head held fast by a dental impres- 
sion on a metal plate (see Fig. 1). His left 
eye was always occluded. Looking down- 
ward, S could see through a 1.5-in. diameter 
aperture in front of the right eye. Below 
this aperture was mounted a rotatable turret 
which permitted a rapid change in viewing 
conditions, Depending upon the position to 
which the turret was rotated, S looked 
through an 11° prism oriented base left, an 
11° prism oriented base right, or either of 
two pieces of i-in. optically flat Plexiglas. 
The prisms caused Objects in the field of 
view to appear displaced laterally, in a direc- 
tion from base to apex. During pre- and 
Postexposure periods the flat Plexiglas was 
always in viewing position. 

In the preexposure period S used a felt- 
tipped pen to mark the apparent location of 
three 4-in, wide crosses, The virtual images 
of the crosses were located on a plane 19 in. 
from S's eye and perpendicular to the line 
of sight. The horizontally colinear targets 
were imaged in a plane through the center 
of the right eye, and 17°20’ visual angle left 
and right of that plane. The targets had 
been assigned the names 1, 2, and 3 reading 
from left to. right. The S marked the 
targets in a random order, responding to E's 
command, At each command S moved his 
right hand from his side, reached into the 
marking area, zeroed in on the target loca- 
tion, touched the felt pen tip to the paper 
provided, and then withdrew the hand, re- 
turning it to the right side. This marking 
cycle continued until S had made 10 marks 
about each of the target locations. 

After the preexposure marking S closed 
his eyes and the carriages which held both 
fargets and marking paper moved on tracks 
quickly out of view. The turret in front of 
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la. Apparatus in pre- and postexposure periods, 
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(The S views targets through flat 


Plexiglas on turret by way of mirror. Virtual images of targets lie within marking plane 
and S uses pen to indicate the apparent position of targets by striking paper inserted in 


marking plane.) 


lb. Apparatus during exposure period, with carriage moved from field of view. 


(De- 


pending upon the position of the turret, S views his hand either through flat Plexiglas, 


wedge prism base left, or wedge prism base right. 
keeps in the corral depression and retricts movements of his hand. 
arrows, the corral can assume any of three positions, left, center, or right. 


He holds a piece of dowel which he 
As suggested by the 
Supporting 


elements of the apparatus are eliminated for clarity.) 


S’s right eye was rotated, bringing either a 
different piece of Plexiglas or one of the 
prisms into position for viewing. A 10-in. 
square blackboard was inserted in the place 
formerly occupied by the marking paper. 
An 8-in. diameter hole had been cut from 
the board's center. The board (or corral) 
could be locked in any of three positions: 
centered about a sagittal plane through the 
middle of S’s right eye, or about positions 2 
in. right or left of that plane. The corral 
itself extended in a plane perpendicular to S’s 
line of sight (see Fig. 1b). A short piece 
of l-in. diameter dowel was placed in S’s 
right hand and the hand placed in the 
circular hole in the corral. The S opened 
his eyes and viewed his right hand which he 
moved freely within the limits of the corral, 

After 5 min, S again shut his eyes. The 
turret was rotated to bring a piece of Plexi- 
glas into viewing position, the corral re- 
moved, and the carriage with targets and 
marking paper brought back to its pre- 
exposure position. The E took the dowel 
from S and placed in his hand a felt marking 
pen with ink of color different from that used 
previously. The S opened his eyes and in 
response to E's commands marked the targets 


as before. After 10 marks were made about 
each of three targets, S' shut his eyes and 
the marking sheet was removed to prevent S 
from glimpsing it as he left. 

The interval between the end of exposure 
and the first postexposure mark was approxi- 
mately 25 sec. Although marking was self- 
paced, Ss were consistent as to the rate at 
which they chose to work. This rate of 
marking varied between 2 and 3.5 sec/mark. 

Design.—The first two sessions were de- 
voted to the collection of data with corral in 
the center position and no prism during 
exposure. In each of these sessions S went 
through two complete cycles of preexposure, 
exposure, and postexposure. In all subse- 
quent sessions only one complete cycle was 
run. The exposure conditions were defined 
by two factors, prism orientation and corral 
position, with three levels for each factor. 
The three prism orientations were base left, 
base right, and no prism (flat Plexiglas), 
while the corral positions were left, center, 
and right. Each S received the nine prism- 
corral exposure combinations in a different 
order randomized according to two balanced 
Latin squares. At least 23 hr. intervened 
between successive sessions for any S. ` 
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The horizontal displacement of each 
mark from the left side of the mark- 
ing sheet was measured to the nearest 
zy in. Since each sheet had been fas- 
tened in the same location the edge 
of the paper provided a convenient and 
reliable reference point. In order to 
minimize the effects of possible E bias, 
the measurements were made by a 
naive assistant, 

For each session the mean and 
standard deviations of pre- and post- 
exposure markings were computed. 
The differences between mean pre- and 
postexposure scores for the nine ex- 
perimental sessions were entered in a 
four-factorial analysis of variance. The 
factors were S, Prism Orientation, Cor- 
ral Position, and Target Location. 
The F ratios for the main effects and 
higher order terms were computed 
using the appropriate interaction with 
Ss as error term. Each of the three 
main effects proved highly significant: 
Prism, F (2, 34) = 72.64, p < .001; 
Corral, F (2, 34) =7.94, p< .005; 
and Target, F (2, 34) = 6.99, p< 
.005. None of the higher order terms 
approached significance (all p’s > .20). 

The effect of these three variables 
can be seen in Fig. 2. For each of 
the three prism orientations the differ- 
ence between pre- and postexposure 
markings is in the direction expected 
(cf. Held & Hein, 1958). With the 
prism oriented base left postexposure 
markings shift leftward, prism base 
tight produces rightward shifts, while 
with no prism differences between, pre 
and post are approximately zero, The 
5-min. exposure period generated a 
mean shift of 0.60 in., representing 
nearly 2° of visual angle. Corral posi- 
tion seemed to add to the prism effect. 
With corral left postexposure markings 
tend to shift leftward ; corral right pro- 
duces shifts to the right. The effect 
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Fic. 2. Mean difference between pre- and 
postexposure means for each of three levels 
of prism orientation, corral position, and 
target locations. (When postexposure mean 
is to the right of preexposure mean, the dif- 
ference is shown downward; leftward post- 
exposure shift is shown upward.) 


of corral is only about one quarter as 
large as that of prism. The third 
source of variance shown in Fig. 2 is 
target location. In general, leftmost 
target shows small shifts rightward, 
while the rightmost target gives a left- 
ward shift. 

In order to determine whether the 
dispersion of markings changed with 
practice, an analysis of variance was 
performed on the standard deviations 
of the four sets of practice markings 
made in Sessions 1 and 2. The SD 
values for pre- and postexposures, S, 
Target, and Order were entered in a 
four-factorial analysis. Order and Tar- 
get were the only significant main ef- 
fects: Order, F (3, 51) 2 3.532, p< 
05; Target, F (2, 34) = 849, p < 
-001. The only interaction which ap- 
proached significance was that between 
Order and Prepost (p < .05). These 
effects are shown in Fig. 3. In general 
there is a small reduction in SD over 
the four practice marking periods. The 
SD seems to have reached asymptote 
by Period 3, Moreover, the targets 
may be ordered by size of SD. Tar- 
gets in the center and at the left (No. 
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2 and 1) show the smallest SDs while 
marking tends to be most variable at 
the right (No. 3). 


Discussion 


Significant differences in postexposure 
shifts were generated by each of the three 
factors studied, prism orientation, corral 
position, and target location. In addi- 
tion, the analysis of the practice sessions 
suggests that Ss show a decrease in dis- 
persion of markings with practice. What 
are the implications of these findings for 
research on adaptation to displacing 
prisms? It was argued above that in 
previous studies the effects attributed to 
prism condition may actually have been 
a joint product of prism and of changes 
in mean hand position made in response 
to prism displacement of objects in the 
field. This suggestion is reinforced by 
the finding that similar directions of 
prism base orientation and corral posi- 
tion produce postexposure shifts of the 
same direction. For example, the present 
data suggest that either prism base left 
or a left corral position could produce 
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the expected leftward postexposure shift. 
Because some earlier experiments failed 
to control for mean hand position it may 
be argued that the effects of prism and 
hand position were confounded and that 
such experiments gave somewhat inflated 
estimates of the effect of prism viewing 
perse. Perhaps the cleanest way to ex- 
tricate the two effects is to use the 
technique of the present experiments, ma- 
nipulating both prism and corral inde- 
pendently in order to get separate esti- 
mates of the two effects. An alternative 
is to use a corral at a single position. 
Constraint upon hand excursion permits 
specification of the hand movements and 
limits the changes in mean hand posi- 
tion made in response to the prismatic 
viewing conditions. 

Another caveat may be derived from 
the present experiment, Because of the 
practice effect reported above, the use of 
unpracticed Ss in a single experimental 
session seems unwise (e.g, Weinstein et 
al, 1964). Such a procedure is contra- 
indicated for two reasons. (a) Since 
the variability of marking is greater in 
the first session than it would be after 
just a few practice periods, the proce- 
dure of using Ss in one session without 
practice gives an unnecessarily low sig- 
nal to noise ratio, (b) The likelihood 
that pre- and postexposure markings have 
different dispersions makes one uneasy 
about comparisons made between these 
two sets of markings (see Fig. 3). 

The highly significant effect of target 
location upon postexposure shift implies 
that previous investigators would have 
been wise to check the possibility of in- 
teractions between their main effects and 
target location. Because data have not 
been analyzed by target location it is 
possible that an important source of 
variance has been obscured and other 
effects generalized beyond the point ac- 
tually warranted. 

It must be noted that the interpretation 
of the significant corral effect in the 
present experiment is complicated by the 
fact that either of two sources may ac- 
count for the effect. Since S views his 
hand during exposure, any shift in hand 
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position must be accompanied by a cor- 
responding change in eye position. The 
effect of corral position could have been 
produced by prolonged maintenance of 
an atypical hand position, the concomi- 
tant atypical eye position, or by both. 
Experiments are now in progress to de- 
cide among these alternatives, 
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VERBAL REPETITION, SET, AND DECISION LATENCY * 
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Latencies of decisions about the synonymity of pairs of words, 
following 2-sec. prior presentation or 20-sec. verbal repetition of 1 of 
the decision words or of an unrelated control word, were investigated. 
2-sec. prior presentations of a decision word as well as 20-sec. verbal 
repetitions of that word shortened decision latencies and these 
conditions did not differ in the magnitude of their decision speeding 
effects. It was concluded that the decision speeding effects of verbal 
repetition found by Fillenbaum (1964) are completely attributable to 
the setting effects of prior presentation of the decision items. 


One measure of semantic satiation 
(i.e., loss of meaning of a word follow- 
ing its continued repetition) has been 
the time necessary to complete a task 
which requires knowledge of the mean- 
ing of the repeated item. This meas- 
ure is based on the assumption that 
a loss in meaning of the repeated word 
should render the task more difficult 
and thereby result in slower solutions. 

Jacobovits and Lambert (1962) 
found that slower solutions to arithme- 
tic problems occurred when one of the 
two addends had previously undergone 
repetition. Fillenbaum (1964), how- 
ever, found faster decisions about the 
similarity of meaning of pairs of words, 
following continuous oral repetition of 
one of the paired decision words than 
following continuous repetition of an 
unrelated control word.  Fillenbaum 
attributed his results to a “sensitizing” 
or "alerting" mechanism to "specific 
item features" which occurs during 
continued repetition of that word. 

In Fillenbaum's experiment, verbal 
repetition occurred to 12 words each 
followed by a comparison task. In a 
situation in which S knows that he 
may be called upon to make a decision 
about the word he is repeating (a fact 


1A report of this research was made at 
the 1965 Midwestern Psychological Associa- 
tion meeting in Chicago. 


which a normally intelligent S should 
become aware of early in the experi- 
ment) it appears reasonable that he 
will become set to respond accordingly. 
This preparation should shorten deci- 
sion latencies when the repeated word 
is the same as one of the paired de- 
cision words, but not when it differs 
from both. Therefore the effects of 
verbal repetition are completely con- 
founded with the setting effects of 
prior presentation of the word to be 
judged. 

In the present experiment, the ef- 
fects of these two variables are sepa- 
rated. Specifically the speed of de- 
ciding whether pairs of words are 
synonymous is measured under condi- 
tions where the decision pair follows 
a 2-sec. exposure of a decision or un- 
related control word as well as under 
conditions where it follows continued 
verbal repetition. 


METHOD 


The Ss were 48 undergraduates randomly 
assigned to three groups of 16 Ss each. The 
procedure for Group 1 (Verbal Repetition) 
generally resembled Fillenbaum’s (1964) pro- 
cedure, with some modifications. — All Ss 
judged eight pairs of words on similarity of 
meaning, each of the pairs being preceded by 
a 20-sec. continuous oral repetition of a word. 
Half the pairs were synonomous, half un- 
related. For each S the repeated words were 
one of the paired decision words for half the 
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decision pairs and unrelated control words for 
the remainder. A balancing procedure was 
used among Ss so that each decision pair 
was preceded by a decision word for half 
the Ss and by a control word for the re- 
mainder. The repetition words and the 
decision pairs were projected in the same 
area by a Sawyer slide projector, and the 
exposure of the repeated word and the onset 
of the decision pair were controlled by 
Hunter timers, Model 111C. The S's judg- 
ments of similar or dissimilar in meaning 
were indicated by the pressing of either of 
two telegraph keys, which also terminated 
the presentation of the decision pair. Deci- 
sion latencies were measured, by a Standard 
Electric timer, from onset of projection of 
the decision pair to the pressing of the tele- 
graph key indicating S's judgment, 

The procedure for Group 2 (Set A) was 
the same as for Group 1 except that a 2-sec, 
presentation of a decision or control word 
not repeated aloud followed by an 18-sec. 
blank interval before presentation of the 
decision pair replaced the 20-sec, oral 
repetition, 

The procedure for Group 3 (Set B) dif- 
fered from Group 2 only in that the 18-sec. 
blank interval preceded the 2-sec, word 
presentation, 


Resutts AND Discussion 


Mean log decision latencies follow- 
ing presentation or repetition of deci- 
sion words and control words are pre- 
sented in Table 1. 

For every condition, whether or not 
verbal repetition occurred, prior pres- 
entation of a decision word resulted in 
shorter decision latencies than prior 
presentation of a control word. A 
mixed design analysis of variance re- 
vealed this word effect to be signifi- 
cant, F (1, 45) = 145, p < 01. The 
pair effect was also significant, F (1, 
45) 2140, p< 01. Neither the 
group effect nor any of the interactions 
including the Word x Group interac- 
tion was significant at the .05 level. 

The results indicate that Z-sec. prior 
presentation as well as continued ver- 
bal repetition of a decision word short- 
ens decision latency and that these con- 
ditions do not differ in the magnitude 
of their decision speeding effect. These 
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TABLE 1 

MEAN Loc Decision LATENCIES 
Repetition or 
Presentation 

Word 

Groups Pairs —— 

Deci- | Con- 
sion trol 
Word | Word 
Group 1—Ver- |synonymous| .141 | .179 
bal Repetition | unrelated 154 | .183 
Group 2—Set A | synonymous | .118 | .161 
7 unrelated .160 | .204 
Group 3—Set B | synonymous | .088 | .138 
n unrelated A11 | .181 
LoL-L C O NEM INDIES 


results receive corroborative support 
from an independent experiment re- 
ported at the Midwestern Psychologi- 
cal Association meeting by Rohrman 
and Gough (1965), in which similar 
results were found using a 1-sec. prior 
presentation and 15-sec. verbal repeti- 
tion condition, 

The results strongly suggest that the 
decision speeding effects of verbal 
repetition found by Fillenbaum (1964) 
are completely attributable to the set- 
ting effects of prior presentation of a 
decision word. "They also contraindi- 
cate the possibility that the solution 
slowing effects of verbal repetition 
found by Jacobovits and Lambert 
(1962) occurred, but were obscured by 
the setting effects of prior presentation. 

As previous work (Gumenik & 
Spencer, 1965; Yelen & Schulz, 
1963) has shown, verbal-satiation ef- 
fects are elusive phenomena. 

lt remains possible that the differ- 
ences between the present results and 
those of Jacobovits and Lambert 
(1962) may be attributable to differ- 
ences in the experimental tasks. The 
task used in the present experiment 
involved a comparison of the meaning 
of the repeated item rather than utili- 
zation of the repeated item in the solu- 
tion of a problem, the task used by 
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Jacobovits and Lambert. While it ap- 
pears reasonable that any loss of mean- 
ing of an item necessary to the solution 
of the problem should render the prob- 
lem more difficult and therefore in- 
crease solution times, it is not neces- 
sarily true that any loss of meaning of 
an item should render a judgment of 
the meaning of that item more difficult. 
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EYELID TRACE CONDITI 


ONING, CS INTENSITY, 


CS-UCS INTERVAL, AND A CORRECTION 
FOR “SPONTANEOUS” BLINKING? 
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96 Ss were run in eyelid trace conditioning with 3 levels of CS 


intensity (an 800-cps tone of 70, 80, or 90 
intervals of .5, 1, and 2 sec. A correction for 


based on pretrial blink rates was appl: 


db. SPL) and 3 trace 
“spontaneous” blinking 


ied to the data. Both uncorrected 


and corrected percentages of CRs were directly related to the intensity 
of the CS, but this V effect was reduced at longer ISIs and on later 


trial blocks, 


CR strength was a U-shaped function of trace interval 


with raw percentage scores but inversely related to ISI following 


the correction. 


The purpose of this study was to 
determine the effects upon classical 
eyelid conditioning under a trace-con- 
ditioning paradigm of joint variation 
of CS intensity and CS-UCS (trace) 
interval. In analyzing the effects of 
the length of the trace interval, a cor- 
tection for “spontaneous” uncondi- 
tioned blinking was employed which 
consisted of subtracting pretrial from 
on-trial blink rates, 


METHOD 


psychology 
course at the University of Massachusetts. 


The apparatus was the two-chair eyelid- 
by Moore 


Design and Procedure.—Thirty-two Ss, 16 
Were assigned ran- 
trace intervals: .5, 


UCS was a 50-msec. puff of compressed 
nitrogen delivered to 


UCS intensity was 80 mm. Hg static pres- 


1 This investigation Was supported in part 
by Public Health Service Grant NIH-HD- 
00955-02 and was carried out by the first 
author in partial fulfillment of the require- 
ments for the MS degree. 
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sure in a ł-in. manometer, On a given trial 
the CS intensity was either 70, 80, or 90 db. 
SPL for all Ss, depending on random sched- 
ules restricted so that each intensity oc- 
curred equally often within a block of 24 
trials and in extinction. In analysis of 
data from this design the sources of varia- 
tion due to sex, recording channel, and 
block of trials were included in a "mixed" 
factorial analysis of variance together with 
the major independent variables, trace inter- 
val, and CS intensity, 

The Ss were read "neutral" instructions 
over an intercom immediately before Trial 1. 
All Ss received 72 acquisition trials, 75% 
of which were reinforced according to a 
random schedule restricted such that there 
were 18 reinforced trials in each block of 
24 trials, and each CS intensity was rein- 
forced on six of eight presentations within 
the block. Partial reinforcement was used 
in order to avoid Performance levels so high 
as to mask any differences among condi- 
tions. The UCS was completely withheld 
on 18 extinction trials (6 with each CS 
intensity), and intertrial intervals were 15, 
20, and 25 Sec, randomly varied. 

Response definition and correction. —A. 

R was defined as an upward deflection of 
the Tesponse pen of at least 1 mm. occur- 
ring no sooner than 150 msec. after CS 
onset and no later than UCS onset. It was 
assumed that the probability of a CR con- 
tains à component due to actual condition- 
ing plus some independent Probability of an 
unconditioned “spontaneous” or normal 
periodic blink falling within the CR criterion 
interval. The number of blinks during the 
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10 sec. immediately prior to each trial was 
counted and divided by the number of 
intervals equal in length to the on-trial trace 
interval contained in 10 sec. For example, 
if the 10 sec. preceding a trial for an S in 
the 500-msec. group contained 10 blinks, this 
would be divided by 20, the number of 500- 
msec. intervals, yielding .5 blinks per 500- 
msec. interval, ie, 50% responding. This 
figure was taken as an estimate of base-line 
blinking and was subtracted from the raw 
percentage score yielding a corrected per- 
centage score. The equation of a corrected 
percentage score is presented below. 


1 if at least 1 blink falls in the CR 
interval. 


Let X = 
0 if no blinks fall in the interval. 


Corrected score 


no. of pretrial blinks 
L i! on Trial j 
=42X;-%42 
iE i ix y d 


the second term not to exceed 1.00 in ab- 
solute value.? 

The summation was over eight trials be- 
cause this was the number of CS presenta- 
tions of each intensity within a block of 
acquisition trials for each S. The number 
of equal-sized intervals in 10 sec., y, was 
equal to 20, 10, and 5 for trace intervals of 
.5, 1, and 2 sec., respectively. Thus y de- 
creased in the ratio 4:2:1 so as to correct 
longer trace intervals carrying inflation in 
the ratio 1:2:4. The second term of the 
equation could exceed 100% whenever the 
average number of blinks per interval ex- 
ceeded 1. Therefore, in making the correc- 
tions the second term of the equation was 
treated as being equal to 100% whenever 
it actually exceeded that amount. In this 
manner the correction term received the 


? An alternative correction procedure based 
on pretrial response rates would be to use 
the second term in the expression as a co- 
variate in an analysis of covariance adjust- 
ment of group means, thereby taking into 
account the linear regression of the first 
term on the second. This procedure would 
increase the precision of comparisons among 
adjusted means of uncorrected scores. Anal- 
ysis of covariance was ruled out in the pres- 
ent experiment because y, the denominator 
of the second term, was a variable instead 
of a constant, and this would result in gross 
heterogeneity of variance on the covariate 
among the trace interval groups. 
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same restriction as did the raw score, namely, 
that there cannot be a probability greater 
than 1.00 of any given interval containing 
at least one blink. Even with this restric- 
tion, a corrected score could be negative, 
indicating that the estimated probability of 
a blink falling in a given on-trial interval 
was less than that for a pretrial interval 
of equal length. It was thereby possible 
for a corrected score to have any value be- 
tween —100% and +100%. 


RESULTS AND DISCUSSION 


Trace interval.—Figure 1 shows 
that, in acquisition, uncorrected (raw) 
percentage of CRs was a shallow 
U-shaped function of trace interval 
with no significant differences among 
groups. The effect of the correction 
was to lower all curves with the great- 
est reductions occurring at the longer 
trace intervals. The overall difference 
among groups in this case was highly 
significant, F (2, 84) =19.25, p< 
.001, with the linear trend component 
accounting for almost all of this vari- 
ance, F (1, 84) = 3848, p < .001. 
Thus, only with corrected percentage 
scores was an ISI gradient observed 
which was in accord with earlier trace 
conditioning experiments of Bernstein 
(1934) and Reynolds (1945) and with 
the numerous delayed conditioning ex- 
periments in eyelid conditioning re- 
viewed in Kimble (1961). 

As shown in Fig. 1, the uncorrected 
percentage of CRs elicited during ex- 
tinction was an increasing function of 
acquisition trace interval, F (1, 84) = 
4.58, p < .05. The correction reversed 
the slope of this linear trend, F (1, 84) 
= 24.85, p < .001. Finally, Fig. 1 in- 
dicates a greater loss in level of re- 
sponding during extinction at the 500- 
msec. trace interval than at either of 
the other two intervals, both with and 
without the correction. 

CS intensity—CS intensity pro- 
duced significant main effects in acqui- 
sition with raw scores, F (2, 168) = 
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Fic. 1. Raw and corrected mean percentage of CRs as a function of 24-trial blocks in ac- 
quisition and the block of 18 extinction trials with trace interyal and CS intensity as the 


parameters, 


10.01, p < .001, and with corrected 
Scores as well, F (2, 168) = 8.33, p 
<.001. The tendency in both cases 
and in extinction was for the two high- 
est intensities to elicit more responding 
than the lowest intensity of 70 db. 
SPL. As is apparent in the figure, 
the correction with few exceptions left 
the relationships among points within 
a trace interval essentially intact. 

Figure 1 shows greater differences 
in acquisition between CS intensities 
at the shorter trace intervals, with the 
curves for 70 and 90 db. showing an 
inversion at 2,000 msec. This Trace 
Interval X CS Intensity interaction was 
significant with both raw scores, F (4, 
168) =3.91, p< .005, and corrected 
scores, F (4, 168) = 2.94, p < 025. A 
simple Hullian interpretation would be 
that the V effect diminishes with delay 
of occurrence of the UCS. 

Close inspection of the acquisition 


curves of Fig. 1 suggests the CS in- 
tensity effects tended to diminish over 
trials. This is seen most clearly in the 
1,000-msec. condition. This CS in- 
tensity x Trials interaction only ap- 
proached significance with raw scores, 
F (4, 336) = 1.55, p< .20, but did 
attain significance with corrected 
scores, F (4, 336) = 2.30, p< .025. 
This diminution of the V effect over 
trials may not be inconsistent with 
Grice and Hunter's (1964) treatment 
of stimulus intensity dynamism in 
terms of adaptation-level theory. If S 
enters the conditioning situation with 
an adaptation level for stimulus inten- 
sity close to 70 db., and if experience 
with 80- and 90-db. tones gradually 
increases his adaptational level to, say, 
80 db., then the Grice and Hunter 
hypothesis could account for the 
shrinking V effect by arguing that in 
the first block of trials the most in- 
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tense CS is 20 db. from S’s adaptation 
level while on the last block of trials 
this same CS is only 10 db. from S’s 
adaptation level, hence the reduced V 
effect. 

Trials.—Raw percentage of CRs in- 
creased over blocks of trials, F (2, 
168) = 6.58, p < .005, but this evi- 
dence of acquisition was much greater 
with corrected scores, F (2, 168) = 
20.20, p < .001. Figure 1 indicates 
that the source of this added strength 
of the Trials effect was the 2,000- 
msec, group in which acquisition 
curves were horizontal before the cor- 
rections, but attained positive slope 
after its application. The implication 
is that even in the 2,000-msec. group 
there was an orderly rise in on-trial 
responding (relative to pretrial re- 
sponse rates) similar to that for more 
optimal CS-UCS intervals. Some in- 
vestigators have concluded that eyelid 
conditioning does not occur at ISIs as 
long as 2,500 msec. (e.g., McAllister, 
1953). On the basis of this conclu- 
sion, several investigators have treated 
a trial with a long ISI as an extinc- 
tion trial (e.g., Reynolds, 1958). The 
present results indicate that the pos- 
sibility of acquisition with long ISIs 
should not be discounted by those in- 
vestigators (e.g., Price, Abbott, & Van- 
dament, 1965) interested in factors 
which determine the rate of extinction 
with the long ISI technique. 

The correction.—The specific ISI 
function observed in a given study 
seems to depend upon the relative 
strengths of two opposing factors: the 
actual loss of conditioning at longer 
ISIs and the greater inflation of re- 
sponse frequency by normal periodic 
blinking as ISI lengthens. Up until 
now the problem of how to correct 
for this inflation has been handled in 
various ways. Kimble’s (1947) ap- 
proach to the problem was to use 
"catch" trials consisting of a 1,500- 
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msec. CS for groups with ISIs ranging 
from 100 to 400 msec. While it does 
assure equal measuring intervals, this 
method necessarily entails the intro- 
duction of a partial-reinforcement pro- 
cedure where one may not be desirable. 
McAllister (1953) corrected his data 
by subtracting the percentage of CRs 
occurring on the first block of trials 
from the percentage on each succeed- 
ing block of trials. While simple to 
apply, this correction has the disadvan- 
tage of sacrificing the data for the first 
critical trials. Ost (1960) corrected 
for varying ISI by subtracting the 
blink rate on preacquisition CS-alone 
trials from the percentage scores 
for subsequent blocks of conditioning 
trials. The validity of this approach 
depends on the assumption of a con- 
stant blink rate over the conditioning 
session. Finally, Prokasy, Ebel, and 
Thompson (1963) and Boneau (1958) 
proposed a more restrictive definition 
of a CR, one which circumvented the 
problem by scoring an on-trial re- 
sponse as a CR only if it overlapped 
with or occurred immediately before 
the UCR. This type of procedure is 
tied to certain theoretical viewpoints. 
Furthermore it cannot be likely that 
all other responses, excluding alphas 
and betas, which fall in the ISI are 
“spontaneous” blinks which can be dis- 
counted. 

The type of correction employed in 
the present study is devoid of any theo- 
retical position concerning the mecha- 
nism for classical eyelid conditioning. 
It merely yields information about 
relative blink rates without imposing 
any differential taxonomy on on-trial 
eyelid closures other than alpha re- 
sponses which were discarded. Fur- 
thermore, this type of correction does 
not involve procedural complications 
such as preacquisition test trials or 
“catch” trials; and it provides a flexi- 
ble base line which is sensitive to mo- 
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mentary variations in unconditioned 
blink rate. 
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In a series of experiments the similarity between items presented on 
a given trial (II) and on successive trials (PI) is systematically 
manipulated in conjunction with the difficulty of the information 
processing interpolated between presentation and recall. The results 
of the studies indicate that under conditions where forgetting proceeds 
independently of the effects of interpolated task similarity it depends 
upon similarity among stored items and upon the difficulty of the 
interpolated processing. The effectiveness of interference does not 
appear to vary directly with the difficulty of interpolated processing 
but is more closely related to the time material is in store. These 
results are compatible with the view that interfering items work spon- 
taneously during the retention interval to disrupt the original trace 


(Acid Bath) rather than merely competing at the time of recall. 


Studies of short-term retention of 
very small amounts of material have 
provided new opportunities to investi- 
gate the factors which lead to forget- 
ting. Of special importance is the abil- 
ity which such studies afford E to 
exercise control over the period be- 
tween presentation and recall. 

Until recently studies had shown 
that increasing the similarity of inter- 
polated material to that in store causes 
little decrement in the level of reten- 
tion, In those cases where the inter- 
polated material repeated a number of 
the same elements as the original list, 
recall was usually facilitated (Kennelly, 
1941). However, in a recent study 
Wickelgren (1965) has shown that 
when similarity is manipulated by vary- 


1 Experiment I was conducted while the 
senior author was Visiting Research Psy- 
chologist at the Human Performance Cen- 
ter, Department of Psychology, University 
of Michigan. The remainder of the research 
was supported by NSF Grant GB2701 with 
the University of Wisconsin. Special thanks 
are due to R. W. Pew for assistance in the 
formulation of Exp. III and to Blythe Nemi- 
roff and David Aderman for assistance in 
running S's. 

? Now at the University of Oregon. 
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ing the acoustic characteristics among 
consonants, large decrements in reten- 
tion occur. It is clear, however, that 
similarity of interpolated material can- 
not account completely for the effective- 
ness of interpolated items in reducing 
recall performance (RI) since varying 
the difficulty of interpolated processing 
with time in store and similarity held 
constant also has large effects upon re- 
tention (Posner & Rossman, 1965). 
Moreover the effect of interpolated 
material depends upon the number of 
previous trials with similar items which 
S has had (Keppel & Underwood, 
1962; Wickens, Born, & Allen, 1963). 
Thus, while it is true that S must be 
required to perform an interpolated 
task in order to show forgetting of 
small amounts of verbal material, the 
level of retention is often not a function 
of the number and similarity of inter- 
polated items. 

It appears, therefore, that some of 
the forgetting in short-term retention 
must be attributed to interference from 
previous trials (PI), interference 
among items presented on a given trial 
(II) or to factors other than inter- 
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ference (decay). In fact, when for- 
getting is not a function of the simi- 
larity of interpolated items the distinc- 
tion between interference and decay 
notions becomes rather difficult to 
make, since both theories tend to rely 
upon competition between traces at 
the time of recall to account for forget- 
ting. Decay theories hold that such 
competition increases because the 
stored trace is weakened in the ab- 
sence of rehearsal. Interference 
theories maintain that competing items 
strengthen over the interval due to 
spontaneous recovery. In addition one 
variety of interference theory also sug- 
gests that the trace is weakened during 
the interval due to unlearning. Since 
the degree of unlearning is necessarily 
a function of similarity and since un- 
learning is of doubtful applicability to 
studies where the interpolated mate- 
rial is not to be learned, spontaneous 
tecovery of competing items together 
with response competition seem to be 
dominant in the interference theory ac- 
count of short-term retention, 

A more meaningful distinction might 
be between two general views of the 
locus of the forgetting which occurs, 
In the first view, which is called “Trace 
Comparison,” the degree of forgetting 
is a function of the relative strength of 
the stored item as compared with other 
similar items at the moment of recall, 
Forgetting increases in time either be- 
cause previous items which were in- 
hibited at the moment of presentation 
of the new item recover increasingly 
over a retention interval or because for 
some reason other than interference the 
strength of the stored item decreases 
over the interval. The former mech- 
anism is associated with interference 
notions while the latter would be more 
compatible with a decay theory. In 
both views, however, the role of inter- 
ference is to vary the competition be- 
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tween traces at the moment of recall. 
This general position contrasts with 
the “Acid Bath” view in which inter- 
fering items interact with the stored 
trace spontaneously during the reten- 
tion interval to weaken its strength, 
Tn this view the forgetting occurs dur- 
ing the retention interval. This view 
suggests that the trace itself is de- 
stroyed not only as a function of time, 
but also as a function of the number 
and similarity of stored items. 

This series of studies is designed 
to show the role which interfering 
items play in forgetting. The first 
study examines the relative importance 
of similarity and difficulty of inter- 
polated material in determining for- 
getting. The second experiment de- 
termines the effectiveness of alternations 
in material to be recalled between suc- 
cessive trials, An analysis of these two 
studies suggests that acoustic confu- 
sions are the most potent of the types 
of interference employed in varying 
retention. The final two studies ex- 
plore the interactions between diffi- 
culty of interpolated processing and 
degree of interference. Since the two 
positions discussed above lead to op- 
posite predictions as to this interaction, 
these experiments allow a conclusion 
as to the locus of interference effects 
in short-term retention. 


EXPERIMENT I 


Previous work (Posner & Rossman, 
1965) has shown that the probability 
of recall of digit triples is a function of 
the difficulty of the processing required 
on subsequent digit pairs. This study 
compares the recall scores of digit and 
consonant triples during the processing 
of subsequent digit pairs, Presumably 
if similarity is the important variable in 
governing the level of retention in this 
situation the digit tasks should have 
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less relative effect on the consonants 
than upon the digits. 


Method 


Subjects—The Ss were 20 volunteer stu- 
dents at the University of Michigan summer 
session. They were paid $1.50 for participa- 
tion in this study. 


Material.—All materials were recorded on 
magnetic tape. Each trial consisted of a 
warning tone followed by five items .5 sec. 
apart. The first three items were either 
different consonants selected randomly from 
the population of letters B, D, G, H, J, K, 
L, N, Q, S or from the digits 0-9. The 
last two items were always digits selected 
at random from between 40 and 99. Follow- 
ing the five items there was a 10-sec. pause 
followed by the word “recall” and another 
20-sec. pause before the next trial The 
entire tape consisted of 5 practice and 50 
regular trials. The regular trials were di- 
vided into five series of 10 each. The first 
series. consisted of six consonant and four 
digit triples and the remaining series con- 
tained five of each type in a random order. 
Because of this imbalance only the first four 
consonant triples in the first series were 
scored, thus giving 24 trials of each type 
for each of the 20 Ss. 

Tasks—Five interpolated tasks were as 
follows: rest, recording digit pairs, addition 
of digit pairs, counting backwards by 3's, 
and classification of digit pairs into high or 
low and odd or even. These tasks are de- 
scribed in more detail elsewhere (Posner 
& Rossman, 1965). The tasks were selected 
to vary in difficulty in terms of the amount 
of information reduction required. These 
values are 0, 2.7, 3.8, and 4.6 bits, respec- 
tively, for the record, addition, backward, 
and classification tasks. The interpolated 
task always began with the pair of digits 
on the tape and continued at S’s own pace 
with digit pairs on the score sheet in front 
of him. The Ss were instructed to attempt 
to handle as many pairs as they could prior 
to recall, recording each answer on their 
score sheet. The classifications were re- 
corded as HO, HE, etc. 

Procedure.—The Ss were run individually 
and heard the items over a headset. Each 
S was fully informed of the population of 
items to be used. Each S performed 5 prac- 
tice trials, 1 with each interploated task and 
50 regular series, 10 with each interpolated 
task. The order was governed by four 
random Latin squares. 
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Results 


The basic result is shown in Fig. 1 
in terms of percentage of series and of 
items recalled in error for the digits 
and consonants with each of the inter- 
polated tasks. The X axis shows the 
amount of information reduction re- 
quired by the tasks which has been 
found in previous work to be closely 
related to task difficulty ( Posner, 1964 ; 
Posner & Rossman, 1965). The total 
number of possible item errors in re- 
tention was 288 for each point. The 
rest condition was errorless. 

The mean number of interpolated 
transforms performed in order of in- 
creasing information reduction was 
10.4, 94, 5.5, and 4.5, which is in line 
with previous results. The effect of 
task difficulty upon recall of the stored 
material also confirms previous find- 
ings. There were no significant overall 
differences between types of material 
(p> .05). The relative level of re- 
tention over various tasks for digits 
and for consonants was similar except 
the consonants show no increase in 
errors between the counting backward 
and classification tasks. Since the 
products of the classification task are 
the letters, H, O, L, and E, repre- 
senting the initials of the various 
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classes, the fact that consonants show 
less than expected forgetting in this 
condition is clearly not in line with an 
interference explanation. When Ss 
proceed from the record task to the 
classify task the number of interpolated 
digit pairs handled is halved; however, 
the percentage of items in error triples. 
On the other hand, the effect of going 
from the rest condition, in which no- 
thing is interpolated except the two 
digits on the tape, to the record task 
in which S sees and records 10 digit 
pairs, is only a small increase in for- 
getting. 

It is not easy to separate the proc- 
esses of learning, storage, and retrieval 
in the situations discussed in this ex- 
periment. It could be, for example, 
that digits are more easily learned than 
Consonants or the reverse. However, 
stress should be placed on the relative 
effectiveness of the various interpolated 
tasks upon performance at the time of 
recall. In general the tasks have 
identical effects upon the two kinds of 
materials, thus whatever processes are 
causing the interpolated tasks to reduce 
performance would hardly seem to be 
related to the similarity between stored 
and interpolated items. 

This result should not be surprising. 
Kennelly (1941) has shown that varia- 
tions in the similarity of interpolated 
material to that in store, even when the 
interpolated task requires learning, 
make little difference in recall scores, 
in the range of item similarity used in 
the present experiment. Apparently 
the effects of interpolated items simi- 
larity are important in short-term re- 
tention only when the interpolated 
items have a number of elements iden- 
tical with the stored material (Ken- 
nelly, 1941) in which case recall is 
facilitated or when the interpolated 
items are different but highly similar 
such as those selected for high acoustic 
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similarity (Wickelgren, 1965) in which 
case recall scores are reduced. Be- 
tween these two extremes, manipula- 
tions of similarity are apparently of 
minor importance in recall scores. 

If these data are taken to indicate 
that, within the range of this study, 
similarity of interpolated items is not 
the cause of differences in retention, 
what light does it cast on the factors 
responsible? Two results seemed to 
be important in this regard. If the 
results of this study for digit triples 
only are compared with the results of 
previous work the level of forgetting 
obtained is much reduced. This could 
be due to the different S populations 
or to the fact Ss were paid; however, 
the similarity of the results of the two 
studies in number and accuracy of 
transforms made this seem doubtful. 
Wickens, Born, and Allen (1963) 
found that switching once or twice dur- 
ing a session from digit to consonant 
triples led to significantly less forget- 
ting on trials after the switch. It is 
possible that using a mixed-list design 
Such as the present experiment acted 
to reduce the overall level of PI during 
the session and that this accounts for 
the lower levels of forgetting. This 
could mean that forgetting is mainly a 
function of interference between suc- 
cessive lists and that the difference 
between difficulty conditions simply re- 
flects the degree of rehearsal which can 
Occur to counteract the interference. 
However, a second result casts some 
doubt on this since an analysis of errors 
where similar material occurred twice 
in successive trials against those in 
Which a switch occurred, gave only 
very marginal evidence for this view. 
For these reasons Exp. II was designed 
to investigate the role of similarity 
between materials presented on suc- 
cessive trials in determining the level 
of retention, 
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EXPERIMENT IT 


In this study a list which alternates 
digit with consonant triples on succes- 
sive trials is compared with homo- 
geneous lists of all consonants and of 
all digits. 


Method 


Subjects—The Ss were 45 students in 
introductory psychology classes at the Uni- 
versity of Wisconsin. They served for 
course points, An additional 2 Ss were run 
but failed to follow instructions and are not 
included. 

Material.—The materials consisted of three 
separate tapes corresponding to the three 
major experimental conditions. These were 
digit triples only, consonant triples only, and 
mixed digit and consonant triples. Each 
trial was identical to that described in the 
previous study and each tape consisted of 
5 practice and 65 regular trials divided into 
five series. The digits were selected from 
the population 0-9, the consonants were se- 
lected from an acoustic confusion matrix 
(Conrad, 1964) to be minimally confusing; 
the population consisted of consonants H, J, 
K, L, N, P, Q, R, X, and Y. On the 
mixed tape the 13 trials of one series con- 
sisted of a digit triple followed by six digit 
and six consonant triples in an ABBA order. 
This meant that among the final 12 series 
there were an equal number of digits follow- 
ing digits, digits following consonants, and 
consonants following digits, and consonants 
following consonants. In the scoring only 
the final 12 trials of each series were con- 
sidered in all conditions. 

Tasks—The tasks were identical to those 
in Exp. I. 

Procedure.—The 45 Ss were divided into 
three groups using the three different tapes. 
Each S performed all five interpolated tasks, 
one for each series, in an order determined 
by three random Latin squares within each 
condition. 


Results 


The basic results of this study are 
shown in Fig. 2. In this figure the 
percentage of series in error is shown 
for the four interpolated tasks for the 
digit alone, consonant alone, digit 
mixed, and consonant mixed lists: 
The basic results are identical when 
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percentage of items in error is ex- 
amined. 

It is clear that the interpolated tasks 
have the expected effect upon reten- 
tion except that the digit-only lists 
show a small drop from the backward 
to the classification tasks. Since this 
task is not replicated in the mixed list 
and since it was different from the re- 
sults obtained in three previous studies 
it may be due to sampling error. Of 
more interest are the comparisons be- 
tween consonants and digits from the 
separate lists with consonants and 
digits from the mixed list. While the 
figure indicates that mixed digits show 
less forgetting than the digit-alone con- 
dition, this effect is not significant 
(p > .05). Moreover, there is no ten- 
dency for mixed consonants to show 
better retention than consonants alone. 
Another approach to the same question 
is to look at the frequency of series in 
error for triples following like and un- 
like material. The total errors in 
series following like items is 69, while 
for those following unlike items it is 
72. Thus, here again there is no clear 
evidence for decreases in overall for- 
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Fic. 2, Percentage of recall errors as a 
function of the difficulty of interpolated proc- 
essing for different lists of Exp. Il. 
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getting when material is switched back 
and forth during a single session. 

However, the effects of another kind 
of interference look more promising. 
In the first experiment in which a pop- 
ulation of consonants of intermediate 
confusability was used there was no 
significant difference in level of reten- 
tion between digit and consonant 
triples. However, in this experiment 
the retention of consonants is clearly 
and significantly (p < .01) superior to 
digits. In comparing the two experi- 
ments the performance on digits de- 
clined while that on consonants re- 
mained about the same. Since the digit 
material was the same in the two 
studies, reduction in performance may 
have been due to the different popula- 
tion of Ss. If Ss of Exp. II were 
poorer, the lack of effect upon conso- 
nant recall would be due to the choice 
of a minimally confusing set of items, 
While this conclusion is open to con- 
siderable question it is in line with 
other recent results (Wickelgren, 
1965) and it was thought worth ex. 
ploring in further studies, Thus 
whether digits are equal to consonants 
in recall scores seems to depend on the 
population of consonants chosen, If 
this is the case the confusability be- 
tween items in store can be used to 
vary the level of interference. 


ExPrRIMENTS III Anp IV 


The data so far indicate that differ- 
ences in the level of retention are a 
function of the similarity among stored 
items and the difficulty of interpolated 
Processing. The evidence from com- 
parison of the retention of consonants 
vs. digits in Exp. I with that in Exp. II 
Seems to indicate that the population of 
consonants chosen can make important 
differences in the relative levels of 
retention. This is in line with findings 
by Conrad (1964) and Wickelgren 
(1965). that acoustic similarity is an 
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important source of interference among 
consonants. 

If the degree of interference can be 
manipulated by varying the acoustic 
similarity of the stored items, and if 
the difficulty of the interpolated proc- 
essing is also varied, it should be 
possible to determine whether the in- 
terference is having its effect by com- 
petition at the time of recall or whether 
it works on the trace during the re- 
tention interval. If the former is the 
case, then changes in the strength of 
the stored trace during the interval 
which are independent of the time the 
items are in store, should vary the ef- 
fectiveness of the interference. The 
strength of the stored trace at any 
time can be controlled by the degree 
to which rehearsal is prevented due to 
the difficulty of the interpolated proc- 
essing. A decay theory would account 
for this effect of the interpolated proc- 
essing by claiming that the strength of 
the stored trace is weakened in the 
absence of rehearsal, while an inter- 
ference viewpoint would argue that the 
degree to which the stored trace is 
strengthened by subsequent practice is 
reduced by the control of rehearsal. In 
either case the strength of the trace at 
time ¢ must be an inverse function of 
task difficulty. Thus, the effectiveness 
of having similar items to compete with 
the trace must increase as the difficulty 
of the interpolated task increases. 
Therefore, arguing from either decay 
or interference theory a competition or 
“Trace Comparison” view predicts that 
the effectiveness of interference is a 
direct function of the difficulty of the 
pe pate processing. However, if 
the 
the simplest hypothesis would be that 
the effectiveness of a given amount of 
interference depends only upon the 
time it has to work on the old trace and 
is, therefore, independent of the diffi- 
culty of the interpolated task. This 


“Acid Bath” view is correct, then ' 
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means that for a given concentration 
and period of time the acid has a fixed 
effect. Of course, if the strength of 
' the stored trace is renewed through 
rehearsal during this period the overall 
recall scores will be increased, but the 
effectiveness of the interference itself 
would be constant. In other words, 
the effectiveness of interference at time 
t would be a function primarily of £ it- 
self since interference operates regard- 
less of the level of rehearsal. Experi- 
ment III and Exp. IV are designed to 
determine the kind of interaction be- 
tween difficulty and interference in 
determining the level of recall. 


Method 


Subjects.—There were 36 Ss in Exp. III 
and 48 different Ss in Exp. IV, obtained 
from the same source as in the previous 
experiment. An additional 13 Ss were run 
in the two experiments but did not com- 
plete the tasks. The majority switched 
from one to another interpolated task in the 
middle of a series or combined both tasks 
on one or more trials. 

Materials—The materials of Exp. III 
consisted of three tapes each having 4 prac- 
tice and 72 regular trials. The trials were 
similar to those discussed before except that 
the recall intervals were 0, 5, 10, and 20 
sec, each interval occurring three times in 
each of the six series of 12 trials. At the 
zero recall condition no digits were present 
on the tape. The three tapes used materials 
of high (BCDGPTVZ), medium (BDGJK- 
LNS), and low (HJLNQRXY) confusion, 
as measured by Conrad (1964). In Exp. 
IV eight tapes were used, four with a list 
of high and four with a list of low audi- 
tory confusing materials. The populations of 
letters were the same for the high lists as 
in Exp. III, but the low list substituted 
the letter C for H and K for L. The four 
tapes within a condition were used so that 
each trigram appeared once with each of the 
four delay intervals over the four tapes. 

Tasks—The interpolated tasks were the 
addition and classification tasks described 
previously. © 

Procedure.—In Exp. III the 36 Ss were 
randomly assigned, 12 to each of the con- 
fusability conditions. Each S performed all 
72 series alternating between the two in- 
terpolated tasks for successive blocks of 12 
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trials. Half of the Ss began with the 
addition task and half with the classification 
task. In Exp. IV 24 Ss were randomly 
assigned to each of the two confusion tapes. 
Within each group of 24 one quarter used 
each of the four subtapes which varied in 
the assignment of trigrams to intervals. 
Otherwise the procedure was the same as 
for Exp. III. 


Results 


The major differences between Exp. 
III and Exp. IV are that the low 
population of letters was altered in the 
latter experiment to avoid using two 
of the letters which are the products of 
the classification tasks H and L. Al- 
though previous work gave no evidence 
that these led to systematic increases in 
forgetting, recently published material 
(Wickelgren, 1965) plus the rather 
peculiar shape of the low classification 
condition in Exp. IIT, led us to replace 
these letters. Experiment IV did not 
contain the medium condition which 
also had one of the letters in question, 
Finally, Exp. IV was balanced so that 
each trigram occurred once with each 
of the four intervals. 

The major results of the two experi- 
ments are shown in Fig. 3 and 4. 
Both of the figures show the percentage 
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Retention curves for items of 
varying interference and for interpolated 
tasks at two levels of difficulty, Exp. IIT. 
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of items in error as a function of time 
in store for varying degrees of con- 
fusability and with tasks of varying 
difficulty. The two figures are vir- 
tually identical for the low list except 
that in Exp. IV the curve for classifica- 
tion does not rise as sharply from 10 
to 20 sec.; this is the one rather aber- 
tant result in Exp. III and seems to 
be due to the use of H and L in the low 
list. In the high conditions the two 
experiments are identical except that 
in Exp. IV the high-list classification 
task shows somewhat less forgetting 
particularly at 10 sec. The reliability 
of the results between the two studies 
can also be seen in Table 1, which 
shows the overall error rates as a func- 
tion of conditions collapsed across all 
intervals, 

As a rule the forgetting curves for 
all conditions tend to be negatively ac- 
celerated. The bulk of the forgetting 
occurs during the first 5 sec., particu- 
larly with the high interference values. 
The main effects of task difficulty, in- 
terference level, and time are all clear 
and in the expected directions. Analy- 
sis of variance of both experiments 
Shows these effects to be significant 
beyond the 01 level. Effects due to 
acoustic interference seem to be some- 
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Fic. 4. Retention curves for two levels of 
interference and for tasks at two levels of 
difficulty, Exp. TV. 
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TABLE 1 


MEAN PERCENTAGE OF ITEMS IN ERROR 
Over ALL DELAY INTERVALS 


Add | Classify 


| 
Exp. | Exp. | Exp, | Exp. 
at | av E; 
High List 17.9 | 169 | 224 | 198 
Medium List} 10.1 | — | 13.7 | — 
Low List 47 | 48 | 115 | 93 


what larger than those due to difficulty, 
but all differences maintain themselves 
over the 20-sec. interval. Some inter- 
actions are significant in either Exp. 
III or IV, but none reach statistical 
significance in both experiments. 

Of major interest is the interaction 
between level of interference and inter- 
polated task difficulty. The overall ef- 
fect can be seen in Table 1 by looking 
at the difference in percentage of item 
errors between the high and low lists 
for both levels of task difficulty. These 
values are 13.2 and 12.1 for the addi- 
tion task and 10.9 and 10.5 for the 
classification task. The effectiveness of 
this increase in interference is nicely 
reproducible in the two experiments 
and in the opposite direction from that 
predicted by the “Trace Comparison” 
notion. The effectiveness of interfer- 
ence in going from the low to the me- 
dium list is 5.4 for addition and 2.2 for 
classification. Only in going from me- 
dium to high does the classification task 
show a slightly larger effect of inter- 
ference, 8.7 against 7.8 for addition. 
In neither experiment, however, does 
the interaction between task difficulty 
and interference reach statistical sig- 
nificance, 

Table 2 shows the effectiveness of 
interference as a function of interval. 
Table 2 shows clearly that interference 
has some effect even at the zero time 
delay. This is probably due partly to 
listening errors, since it is obviously 
harder to discriminate the high list, and 
partly to memory errors, since the first 
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TABLE 2 


EFFECTIVENESS OF INTERFERENCE, AS MEAS- 
URED BY PERCENTAGE OF ITEM ERRORS 
(Hic List Minus Low List) 

FOR ALL INTERVALS 


Interval 
Task 
0 5 10 20 
Add 
Exp. III} 2.7 13.6 | 13.9 | 23.1 
Exp. IV} 45 16 11.2 | 17 
M 3.6 14.7 12.6 | 20 
Classify 
Exp. III} 4.8 17 16.6 | 17.1 
Exp. IV} 5 12 7 13 
M 4.9 14.5 11.8 15 


item has been in store over a second 
when S begins his recall in the zero 
condition. Task difficulty also seems 
to make some difference at zero delays 
even though no digits are actually pre- 
sented or processed, This may reflect 
some difference in set between trials 
performed with the two interpolated 
tasks. It does not seem to make much 
sense to attempt a rigid distinction 
here between the effects of interference 
in the learning or acquisition process 
from those upon memory, since the re- 
hearsal which continues during the 
interval constitutes more opportunity 
for learning. In any case, the small 
differences at time zero make relatively 
little contribution to the overall effect 
shown in Table 2. The effectiveness 
of interference increases markedly be- 
tween O and 5 sec. for both tasks. 
This effect appears as an interaction 
between level of interference and time. 
The high interference conditions show 
a more rapid increase in recall errors 
over time, especially within the first 
5 sec. This effect reaches significance 
in Exp. IV (p € 01) and is in the 
same direction, but not significant in 
Exp. III. After the first 5 sec. the 
effectiveness of interference remains 
relatively stable. 

Of particular interest is the near 
quantitative equality of the interfer- 
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ence effects for the two different tasks 
at 5 and 10 sec. when the results are 
averaged across studies. At 20 sec. 
the addition task does show a larger 
effect of interference than the classi- 
fication task. This is partly due to 
the one point in the low-classification 
condition in Exp. III which seemed 
to reflect interpolated similarity or at 
least was reduced when the new popu- 
lation of letters was used. 

There may be some question about 
the legitimacy of using the absolute 
difference in percentage of errors be- 
tween the interference conditions as a 
measure of the effectiveness of inter- 
ference. Since the percentage of error 
is never greater than 50% there is no 
apparent problem of a ceiling effect. 
If the relative increase in errors were 
used the  high-difficulty conditions 
would show even less effectiveness of 
interference with respect to the low- 
difficulty conditions. It is, therefore, 
clear that effectiveness of interference 
does not increase with task difficulty 
as demanded by the “Trace Compari- 
son” view. The data in Table 2 indi- 
cate that at least for this method of 
assessing interference effects they are 
a close function of the time in store as 
suggested by the “Acid Bath” formu- 
lation. 

Apparently the major effect of the 
acoustic interference is over at the end 
of 5 sec. After that time the high- 
interference conditions remain at a 
relatively constant distance from the 
low conditions, with a tendency for 
this distance to be reduced at 10 sec. 
and to increase again at 20 sec. These 
temporal effects are also apparent in 
Fig. 3 and 4 where the high-interfer- 
ence conditions show rather little in- 
crease in forgetting after the first 5 sec. 
except for a rise from 10 to 20, and 
the low lists remain stable after 10 sec. 

In both studies there is a tendency 
for the effects of task difficulty to in- 
crease over time especially with low 
lists. Thus while the effects of inter- 
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ference seem to end rather quickly, 
the role of task difficulty may become 
more important late in the 20-sec. re- 
tention interval. 

In summary, the effectiveness of in- 
terference seems to depend on the time 
in store. If anything, interference has 
a greater effect with the less difficult 
interpolated task although this inter- 
action is not significant. In any case, 
this finding is in the opposite direction 
from the “Trace Comparison” idea and 
Table 2 corresponds closely with the 
prediction from the “Acid Bath” view 
that interference is a function of the 
time during which it operates. 


CONCLUSIONS 


These data seem to provide a number 
of rather important basic facts about the 
nature of forgetting, In the first place, 
they confirm findings that variation in 
levels of retention cannot always be at- 
tributed to the number and similarity of 
interpolated items. In these studies the 
number of interpolated digit pairs de- 
creases with task difficulty. Forgetting, 
however, increases with task difficulty. 

Our efforts to manipulate interference 
between stored items (PI or II) also 
seem to show that what are intuitively 
important similarity manipulations such 
as alteration between digits and conso- 
nants do not lead to significant changes 
in retention. The only successful method 
found to manipulate PI over the whole 
session was in varying the acoustic simi- 
larity between stored items. Of course 
it cannot be told whether this manipula- 
tion was effective only between the let- 
ters on a given list (II) or was also 
effective between successive lists since 
these were confounded. 

Having shown that task difficulty can 
vary the level of retention within a fixed 
time interval without changes in simi- 
larity and that interference can be pro- 
duced by manipulation of acoustic simi- 
larity, it becomes possible to pinpoint the 
locus of the interference effects. The 
results of Exp. III and IV show that 
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acoustic similarity effects are not present 
in full strength immediately following 
storage of an item. Rather they increase 
over the first 5 sec. in store. The extent 
to which they increase appears to be a 
function of time in store and not a direct 
function of the level of interpolated task 
difficulty. The effect of interference is, 
if anything, greater at lower levels of dif- 
ficulty than at higher levels. In addition, 
the effectiveness of task difficulty in pro- 
ducing forgetting is greater with the low 
than the high interference lists. 

These results indicated that the locus 
of interference is during the interval it- 
self (Acid Bath) rather than occurring 
at the time of recall as a function of the 
relative strengths of present and previous 
traces (Trace Comparison). The “Acid 
Bath” idea implies the following concern- 
ing the way in which forgetting occurs. 
Stored items will tend to lose precision 
of information over time. Such effects 
can be eliminated when full capacity is 
available for rehearsal. However, when 
opportunity for rehearsal is reduced the 
rate at which such precision is lost is a 
function of the number and similarity of 
items which have been stored within an 
as yet unspecified temporal period of the 
trace which is to be recalled. Presuma- 
bly the rate of information loss is a func- 
tion both of the amount of acid (number 
of stored items) and its concentration 
(similarity). That the number of items 
Which are stored within a limited tempo- 
ral period are important in determining 
the rate of forgetting is evidenced by the 
Severe limitation of the memory span. 
An analysis of such studies (Posner, 
1963) reveals that as the number of items 
which are stored increases, the effective- 
ness of a period of rehearsal in prevent- 
ing loss during a subsequent task is 
reduced. The present study shows re- 
markably similar effects for item simi- 
larity at least as manipulated by acoustic 
pattern. When items are similar the rate 
of loss of information for a fixed number 
is increased and the effectiveness of re- 
hearsal tends to be reduced. 

The results of this study were some- 
What unexpected since both decay and 
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interference theories tend to rely heavily 
upon response competition to account for 
whatever forgetting occurs as a function 
of interference. However, previous re- 
sults in the analysis of RI in criterion 
learning (Melton & Van Lackum, 1941) 
have shown that response competition 
required supplementation with processes 
which involved changes in association 
due to new learning. More recently 
Koppenaal (1963) has shown that PI 
effects in criterion learning need not be 
a function of response competition. In 
one sense the “Acid Bath" notion is anal- 
ogous to his "unlearning" hypothesis in 
that PI effects occur during the retention 
interval. However, in other ways the 
notions are rather different since the 
“Acid Bath" does not require new learn- 
ing but only the operation of old learning 
upon the trace. Also, unlike the un- 
learning hypothesis it does not depend 
upon the similarity of interpolated items 
and is probably not subject to spontane- 
ous recovery. 

This view is especially attractive be- 
cause it tends to eliminate conflicts be- 
tween decay and interference theories. 
One of the strongest arguments for an 
interference notion is that forgetting will 
not occur on the early trials of a memory 
experiment when there are few compet- 
ing items (Keppel & Underwood, 1962). 
On the other hand, those results which 
show that forgetting increases with time 
in store without increases in interfering 
items (Anderson, 1960; Conrad & Hille, 
1958) have argued for a decay notion. 
Both of these results are explicable with 
the view that the rate of loss of informa- 
tion from a trace is a function of the 
number and similarity of interfering 
items. 

The *Acid Bath" analogy suggested by 
these data argues very strongly that the 
coding of short-term memory for conso- 
nants is partly, at least, by acoustic simi- 
larity. This point has also been made 
by Sperling (1963) and Wickelgren 
(1965). The neural view, which Wickel- 
gren suggests, would be compatible with 
the results of this study, although the 
notion of interference between similar 
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stored traces over time would have to be 
added. Thus, not only new incoming 
stimuli disrupt the stored information, 
but previously stored material affect it 
as wel. However, other neural models 
might also be compatible with these no- 
tions so that the “Acid Bath" analogy 
seems appropriate to the level of knowl- 
edge which can be formulated from our 
data. 
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BACKWARD MASKING AND MODELS OF 
PERCEPTUAL PROCESSING * 


NAOMI WEISSTEIN 
Committee on Mathematical Biology, University of Chicago 


The decision between parallel and serial operations in perceptual 
processing has always been made on the basis of whether or not, as a 
visual array size increases, there is a total increase in time from 
presentation to report. The results from this type of design are 
ambiguous: no operation within the perceptual processing sequence 
itself is measured, thus it is equally likely that additional operations are 
being added or that the operations are repeating themselves. A design 
using U-shaped backward-masking functions provided a measure of an 
operation occurring within the Processing sequence; the duration over 
which this operation occurred for arrays of different sizes reflected the 


type of processing occurring. There was an increase in masking 


range as array size increased; thus, 


occurs, 


no strict parallel processing 


Since these increases were not whole multiples of the increase 


in array size, the processing is not serial, item-by-item. These results 
have some general implications for visual backward masking. 


A visual stimulus produces a large 
number of discrete points of stimula- 
tion on the retina. After this stimu- 
lation, operations occur which trans- 
form the stimulus energies into the 
information which an O has available 
for report. This sequence, from input 
to report, may be called perceptual 
processing. 

Certain studies have supported the 
idea of simultaneous operations, or 
parallel processing, for at least some 
parts of this process (Neisser, 1963). 
Other studies suggest that the opera- 
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tions in perceptual processing are in 
serial order, i.e., fully carried out on 
one part of the visual array before 
proceeding to another part. The cur- 
rent formulation of short-term memory 
(Averbach & Coriell, 1961 ; Sperling, 
1963) assumes that operations are car- 
ried out on each item of a visual array 
in turn, that is, serially. 

For both of these formulations, the 
decision between parallel and serial 
Processing has been made on the 
basis of whether or not there is a 
total increase in response time with an 
increase in the size of the array. In 
other words, if the time between pres- 
entation and report increases for 
larger arrays, then serial processing has 
been assumed. On the other hand, if 
the time between presentation and re- 
port remains the same as array size 
increases, then parallel processing has 
been assumed. However, this type of 
design, which measures a total increase 
in the time between presentation and 
Teport (Sperling, 1963) does not un- 
equivocally suggest one formulation or 
the other. An increase in processing 
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time may mean either that more opera- 
tions are being performed simultane- 
ously for each part of an array (paral- 
lel), or, equally likely, that the set of 
operations is repeating itself in turn for 
additional parts of the array (serial). 

A technique which can help decide 
between these two formulations is one 
which can identify an operation occur- 
ring within the sequence from presen- 
tation to report. If such an opera- 
tion can be identified, then its dura- 
tion can be measured for arrays of 
different sizes. If duration is constant 
as array size increases, parallel proc- 
essing can be assumed; if duration in- 
creases in certain regular ways as 
array size increases, serial processing 
can be assumed. 

Certain types of backward-masking 
designs generate data which provide 
direct measures of operations occurring 
within the perceptual processing se- 
quence. Backward masking refers to 
the power of certain stimuli, the mask- 
ing stimuli, to disrupt or mask the 
processing of other stimuli, the target 
stimuli, which have been presented 
earlier. (See Raab, 1963, for a gen- 
eral summary.) When the amplitude 
of this disruption is a U-shaped func- 
tion of the delay between offset of the 
target stimulus and onset of the mask- 
ing stimulus, then this indicates an 
operation or set of operations which is 
distinct from other operations within 
the processing sequence. (See Weis- 
stein, 1964; Weisstein & Haber, 1965, 
for a fuller discussion. ) Moreover, the 
minimum duration of the processing 
sequence itself may be inferred to be 
at least the time between the onset of 
the earlier, target stimulus and the 
greatest delay at which the onset of the 
masking stimulus still has an effect. 
Tt is convenient to bound the oper- 
ation by considering only the delays 
between target and mask at which 
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maximum masking, or complete oc- 
clusion from perception occurs. These 
maximum masking ranges can be ob- 
served for arrays of different sizes and 
their durations compared ; such a com- 
parison will provide an indication of 
the type of processing occurring. 


GENERAL METHOD 


Target stimuli were presented in either 
1-, 2-, 4, or 8-item arrays. The array was 
presented tachistoscopically for a fixed, brief 
duration. Following its offset, a lighted 
blank field was presented, equal in intensity 
to the preexposure field, for a variable length 
of time, Then, a ring was presented, in 
a location which would have been encircling 
one of the letters in the array, had the array 
still been present. The time between offset 
of the array and onset of the ring is desig- 
nated as the interstimulus interval (ISI). 


EXPERIMENT I 


Method.—One-letter and four-letter arrays 
were presented. Each array was constructed 
from the letters O, D, or blanks, These 
were presented in a three-channel mirror 
tachistoscope (Scientific Prototype Manufac- 
turing Corporation, Model G). The visual 
field was 5 in. horizontally X 7 in. vertically, 
47 in. from S. Accuracy for all ranges of 
duration was + 296. Duration and intensity 
of the target field was monitored by an 
oscilloscope. In the target and masking 
channel, a tape of 140 rows of one- or four- 
letter displays and a tape of one ring per 
row, respectively, were displayed. After 
each presentation both tapes were advanced 
2 in. and the next array and ring combina- 
tion was aligned. 

Each of five Ss, four males and one fe- 
male, were tested individually; each had 
at least 10 hr. of previous practice. Sixteen 
sessions were run for each S. In each ses- 
sion, S first dark adapted for 1 min. and 
then light adapted to a blank preexposure 
field, illuminated, as were the target and 
mask fields, at 9.4 ftl. A black fixation point 
of less than 2’ was present in the center of 
the field. All fields were viewed binocu- 
larly. The array was presented for 20 msec. 
in one field, and then, after a variable delay 
during which the preexposure field was 
viewed, the ring was presented for 50 msec. 
in another field. The Ss responded by desig- 
nating what was inside the ring: O, D, or 
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blank. The location of the target letter and 
the subsequent ring was varied in one of 
four positions in the fields. For each of the 
seven ISIs per tape, there were 64 instances 
when a target letter was presented, and 
16 instances when a blank was presented. 
A ring was presented where the target let- 
ter had been or where the target letter would 
have been, in the case of a blank. There 
was one ISI for which the ring was drawn 
directly on the target tape. ISIs were pre- 
sented in two sets of seven randomized 
orders, The first set (containing ISIs of 
—20, 30, 40, 60, 80, 90, and 100 msec.) was 
run on all Ss before the second set (con- 
taining ISIs of —20, 5, 10, 20, 35, 50, and 
70 msec.), Time between individual presen- 
tations was no less than 5 sec. All variables 
for a particular size-array location of target 
letter (one of four positions), ISIs (seven 
values), kind of target (O, D, or blank), 
were randomized within each session. One 
four-letter array and one-letter array were 
presented in each session, in ABBA order 
over sessions. Two controls were run. The 
first used arrows as indicators (instead of 
rings) but was in all other respects identical 
to the experimental condition, for the four- 
letter arrays. For the one-letter array, 
only concurrent presentation of arrow and 
letter was used. The second presented the 
one-letter array alone without indicators; 90 
instances of these were run, 

Each letter subtended .30° horizontal x 
.62° vertical with a thickness of .08°. The 
angular distance between the fixation point 
and the center of a letter was .61? for the 
letters appearing in the two middle positions, 
and 1.73° for those in the two outside posi- 
tions. The masking ring had an inner di- 
ameter of .61° and a thickness of .051°. 
The angular separation between the outer 
edge of the letter and the inner edge of the 
ring was about .083°. All figures were done 
in black India ink with a Leroy scriber and 
template, and had a contrast of about .95. 


Resulis.—Figure 1 presents the er- 
rors made as a function of ISI for 
each array size and indicator sepa- 
rately, The two sets of ISIs are com- 
bined. The dotted line shows errors 
made with the arrow indicator for the 
four-letter array; no arrow indicators 
were run in the one-letter condition. 
All curves appear to be U shaped; and 
the maximum masking range for four 
letters appears much larger than the 
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range for one letter. The Ss are com- 
bined. 

It was first of all established that 
both the one-letter error function for 
rings and the four-letter error func- 
tions for both rings and arrows were 
U shaped, that is, were increasing in 
the early part of the delay range (0- 
30 msec.) and decreasing in the later 
part (50-100 msec.) by testing each 
part of each array-condition combina- 
tion separately across Ss by a Friedman 
two-way analysis of variance ( Mosteller 
& Bush, 1954). In all cases, p<.05. 

When one letter was presented alone, 
without a subsequent indicator, there 
was perfect accuracy. Thus the one- 
letter error function is clearly a mask- 
ing function. In order to get some 
estimate of masking for the four-letter 
array, independent of a base rate of 
error, the magnitude of errors made 
with rings was compared with the 
magnitude for arrows, and was greater 
by p< 001. Arrow indicators yield 
U shapes which rise and fall at the 
same delays as do the ring functions 
(x? Arrow vs. Ring for each ISI, for 
each S, p> .05); thus a subtraction 
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procedure will not give an accurate pic- 
ture of the four-letter masking func- 
tion. But they establish the presence 
of masking insofar as the masking 
amplitudes obtained are much less than 
those obtained for the rings. 

Finally, there was more masking 
with four letters, than with one letter 
(p < .001). This was tested by tak- 
ing the difference in error between 
rings and arrows for the four-letter 
condition, and comparing this differ- 
ence to the one-letter ring functions. 

The measure used to test the width 
of maximum masking ranges should 
be independent of the amplitude of 
masking at the maximum, Within 
certain limits, this may be done by 
taking the proportion of error to total 
error which occurs within a 20-msec. 
range of highest error. This propor- 
tion will be designated, depending on 
the array, as Pl or P4. This treat- 
ment of the data allows two or more 
maxima, if they exist, to be accounted 
for, whether they occur contiguously 
or at a separation of a number of in- 
tervals. Notice that the relationship 
between P and the width of maximum 
masking is inverse: the smaller the 
proportion the wider the masking 
range. 

The major question asked was 
whether the width of maximum mask- 
ing was greater for the four-letter ar- 
rays than for the one-letter arrays. 
Ranges restricted to 20 msec. of high- 
est error could be considered since it 
was established that the functions for 
the two arrays, for each S, differed 
generally from each other beyond p = 
01 (x2: ISI vs. array). A compari- 
son of differences in maximum ranges 
by differences in proportion of error 
under a 20-msec. range (Pl vs. P4) 
for each S, gives p < 0l. An expres- 
sion for the increase in masking range 
for the four-letter array is the ratio 
P1/P4, From this, the maximum 
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masking range for the four-letter array 
was, on the average, about 1.54 times 
the range for the one-letter condition.* 

Since the maximum masking range 
for the four-letter array is bigger than 
the maximum masking range for the 
one-letter array, no strict parallel proc- 
essing can be assumed, 

Similarly, since the maximum mask- 
ing range for the four-letter array is 
not four times the maximum masking 
range for the one-letter array, there is 
a strong indication that the processing 
is not proceeding serially, item by 
item. However, this indication is not 
conclusive, since there remains the 
possibility that there are more than 
one maxima which are separated from 
each other by a certain range of de- 
lays. With the addition of other array 
sizes, and longer delay intervals this 
possibility can be checked. Thus, Exp. 
II repeated the basic design of Exp. I, 
adding four additional array sizes and 
longer delay intervals. 


EXPERIMENT II 


Method.—The method here was the same 
as Exp. I with these exceptions. Each of 
four new Ss was tested individually for 30 
sessions; each had 8 hr. of previous prac- 
tice. One-letter, two-letter, four-letter, and 
eight-letter arrays were run. Two types 
of one-letter arrays were run: eight positions 
for the target and ring pair, and four posi- 
tions for the target and ring pair. This pro- 
vided a means of comparing directly the 
eight-letter results while holding possible 
spatial effects constant. The arrays will be 
designated 1-4, 1-8, 2-4, 4-4, and 8-8, where 
the first number designates the number of 
letters in the entire array and the second, 
the number of positions at which the target 
letter occurred. ISIs were combined into 
one set, yielding 13 intervals from —20 to 


2]t is obvious that the ceiling for the 
ratio P1/P4 depends both upon the time at 
which the later maxima would begin (if 
there are, in fact, later maxima), and on 
the number of ISIs tested. If maxima fol- 
lowed each other immediately, discounting 
—20 msec, the maximum range would be 
represented by the minimum proportion ob- 
tainable, 3/10. All Ss are below this range. 
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120 msec. in 10-msec. steps from 10 to 120 
msec. and in one 30-msec. step from —20 
to 10 msec. ISI presentation and order 
was automatically programmed and control- 
led by a stimulus programmer (Haber, Her- 
shenson, & Schroeder, 1962) so that the delay 
intervals were presented in random order 
in every session for every S. There were 
64 trials for each S at each delay interval. 
No blanks were presented, and only a re- 
sponse of O or D was allowed. Controls 
were arrows as indicators instead of rings; 
only four tapes for each array size (32 in- 
stances per ISI) were run as compared with 
eight tapes for each array size on the rings 
(64 instances per ISI) ; otherwise, the con- 
trol conditions were in all respects identical 
to the experimental conditions. 


Results.—Figure 2 presents the er- 
rors made with ring masks as a 
function of ISI for each array size 
separately, averaged over Ss. A com- 
parison of the ring functions with those 
of Fig. 1 shows that magnitudes dif- 
fered by a greater amount in the first 
experiment; this is a result of the re- 
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sponse indicator change. Recall that 
in Exp. I errors were scored on the 
basis of S errors when there was a 
target present and this includes calling 
"blanks" when there is a letter present ; 
thus the amount of errors could go 
much higher than the chance line. 
Other than this, there was no change 
in the data as a result of using a forced- 
choice response indicator. Most im- 
portantly, all ring-indicator functions 
obtained were U shaped—Friedman 
two-way analysis of variance gives p 
X .05 for increases for each array in 
the first part (0-30 msec.) of the delay 
range and for decreases for each array 
in the last part (60-120 msec.) of the 
delay range. 

Figure 3 presents the errors made 
with arrow indicators for each array 
size separately, averaged over Ss. The 
magnitude of error again, as in Exp. 
L, was significantly greater for rings 
than for arrows, P < .001, z test on 
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proportions. Arrow indicators yielded 
U shapes which rise and fall at the 
same delays as do the ring functions 
in 13 out of 20 cases (x? ISI vs. indi- 
cator, p 2.05). Thus again, as in 
Exp. I, a subtraction procedure will 
not give an accurate picture of the 
masking functions, but arrows do es- 
tablish, by their reduced amplitude, the 
presence of masking. If the arrows 
had shown any consistent rise at the 
later ISIs, then a correction for mem- 
ory decay would have been considered. 
There was no consistent error pattern 
in the seven arrow functions which 
were not U shaped; thus the measure 
of masking used was, as in Exp. I, 
the number of errors S made at each 
delay interval. 

Finally, an analysis of variance 
shows that difference in error scores 
between arrows and rings differed sig- 
nificantly from each other in strict 
ascending order by p<.001. This 
means that ring masking increases as 
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array size increases, relative to arrow 
masking. 

A general 13 X 5 x? test of the dif- 
ferences in frequency distributions at 
each ISI for each S gives all Ss differ- 
ences between arrays at p < .05. Thus 
a closer analysis of the differences be- 
tween functions is justified. 

The major question asked was 
whether the width of maximum mask- 
ing was an exact multiple of the num- 
ber of items in the array. Thus, the 
first phases of the statistical treatment 
are the same as in Exp. L. First, each 
array was compared with every other 
array, by 2 X 13 x's, arrays vs. ISIs. 
For three out of four Ss, 14 differed 
significantly from both 2-4 and 4-4; 
for all Ss 1-8 differed significantly 
from 8-8 (f in all cases < .05) ; and 
in no case did 1-4 differ significantly 
from 1-8. Next the width of maxi- 
mum masking was investigated: The 
measure of maximum masking is again 
the proportion of error to total error 
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which occurs this time within a 30- 
msec, range of highest error. This will 
be designated as Pn-m, n = the num- 
ber of letters in the array, m — the 
number of positions the target letter 
was presented at. Pi-j is a typical 
proportion. Within a 30-msec. range, 
all Pi-j differed from each other by 
P< .001 (1X 5 xs, S vs, Pi-j for all 
i). When each Pi-j is tested against 
Pk-l, all Ss differ on Pl-4 ys, P4-4 
and P1-8 vs. P8-8; three Ss differ on 
P1-4 vs. P2-4; two Ss differ on P2-4 
Vs. P4-4 and on P2-4 vs, P8-8 (p in 
all cases < .05) ; and no Ss differ on 
P4-4 vs. P8-8, or on P1-4 vs, P1-8, 
Whether or not individual pairs of 
proportions differed significantly, all 
arrays produce an increase in masking 
range, measured as the proportion of 
error to total error under a 30-msec. 
range, for each increase in the number 
of items in the array. (Friedman anal- 
ysis of variance, Pi-j x Ss, p < 01.) 
Thus Exp. II corroborates Exp. I: all 
ranges “increase with increase in the 
size of the array. A significant differ- 
ence was found between arrays for all 
Ss in general and for all Ss in terms 
of rank order. Thus it is reasonable 
to look at the precise differences in 
width of masking ranges. It is first 
of all apparent that the linearity of the 
increase in range cannot be precisely 
determined, since the measurement 
used to describe width of masking 
range is sensitive to errors outside this 
range, and there is no way of Sepa- 
rating out errors due to short-term 
memory and errors due to the persist- 
ence of masking. However, some 
comparisons can be validly made. Re- 
call that serial processing which is 
proceeding one item after another im- 
plies that masking ranges for N-sized 
arrays would be N times as large as 
masking ranges for one-letter arrays. 
With the measurement used in this 
study, errors outside the range of 
maximum masking tend to increase the 


estimate of width of range; thus it 
would be likely that masking ranges 
for N-sized arrays would appear to 
be N times as large, even when they 
were not. If the finding is in fact that 
they are not N times as large, then, 
there would be strong reason to con- 
clude that the processing is not pro- 
ceeding serially one item after another. 
If maximum masking ranges were N 
multiples of the range for one letter 
alone, then for these data, by two 
letters, the proportions would have 
reached their ceiling in all but one 
case; and the proportions for larger 
arrays would have been steady around 
this ceiling. Likewise, if there were 
one peak per letter, all functions above 
two letters would resemble the two- 
letter functions. It was established by 
comparing masking widths that all 
proportions differed from each other; 
moreover, a £ test shows that all P1- 
4/Pn-m differ significantly from the 
ceiling ratio by p < .01. Thus it can 
be concluded that the increase in mask- 
ing range is not N times the masking 
range for one item alone. 

There exist two possibilities; one 
that the masking widths are under- 
estimated; the other, that the masking 
widths are overestimated. The first 
has been shown unlikely, The second 
possibility, more specifically, is that 
due to memory decay, masking widths 
are overestimated to such an extent 
that if there had been some way to 
correct entirely for memory, the widths 
would not have differed at all from 
each other. But this is unlikely for 
the following reasons. If short-term 
memory decay were to explain the in- 
crease in masking range, then there 
should be no difference between the 
one-letter and the two-letter ranges, 
lé, between arrays well within the 
span of immediate memory. However, 
the combined x? differences (Mosteller 
& Bush, 1954) between the ranges of 
these arrays are less than p= .001. 
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Second, an analysis performed on the 
data in Exp. I restricted the total num- 
ber of delay intervals considered from 
0 to 60 msec.; maximum masking 
ranges still differed between arrays by 
p< .01. Finally, there was no con- 
sistent effect due to memory decay; 
that is, the errors did not significantly 
and consistently rise at the later ISIs. 
Thus, although there may be some 
overestimation due to memory effects 
and higher error rates, these do not 
account entirely for the increase in 
masking ranges with increase in array 
sizes, 


Discussion 


One-letter and N-letter arrays were 
presented, and one letter out of these 
arrays was subsequently masked by the 
presentation of a ring which encircled 
the position where the letter had been. 
Two types of response indicators were 
used: yes-no and forced-choice. As was 
to be expected, the magnitude of error 
decreased when forced-choice indicators 
were used; other than that, the change 
to forced-choice procedure had no effects 
on the data. For both, the amplitude of 
masking was U shaped with respect to 
the delay between offset of the target 
and onset of the ring. It was argued 
that since this kind of backward-masking 
function identifies an operation or a dis- 
tinct set of operations within perceptual 
processing, then the duration over which 
maximum masking occurred would re- 
flect the type of processing occurring. 
If the maximum masking ranges between 
one-letter and N-letter arrays did not 
differ, then parallel processing could be 
assumed; if they differed as an exact 
multiple of the number of items in the 
array, then the short-term visual memory 
formulation of item-by-item processing 
could be assumed. 

Backward-masking data, in these ex- 
periments, yielded partial answers to 
these processing alternatives. The proc- 
essing is in no strict sense parallel, since 
on both experiments, masking ranges in- 
creased with increase in the size of the 
array. However, this increase for an 
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N-letter array was not N times the range 
for the one-letter array; thus processing 
is not proceeding serially, item by item, 
as has been assumed in some recent mod- 
els. (See Sperling, 1963.) 

If the maximum masking ranges indi- 
cate the occurrence of some perceptual 
operation or set of operations, as has 
been proposed in this study, then there 
are some important questions left to be 
answered. For instance, it is possible 
that the duration of occurrence of this 
set increases linearly with respect to 
items, that is, there is the possibility that 
serial processing is proceeding with re- 
spect to some feature of each item, al- 
though not item by item. In order to 
answer this question, analytic expressions 
for the masking functions are needed so 
that their width at maximum masking 
may be given in a precise manner, Such 
a procedure, in addition, would be able 
to test a serial vs. a parallel formulation 
with respect to other units besides items, 
such as the various types of contours in 
the target and masking stimuli. This 
analysis is underway. 

In addition to the question of serial 
vs. parallel processing, this study bears 
directly on some questions within the 
general area of visual backward masking. 

First, it will noticed that U shapes are 
obtainable both with yes-no and with 
forced-choice response indicators, Thus, 
this shape cannot be explained solely on 
the basis of a change in criteria for de- 
tection as delay interval changes. In 
general, the obtaining of U shapes is at 
some variance with an explanation which 
sees masking as due merely to reduction 
in discriminability of the target due to 
the contrast reduction from luminance 
summation (Eriksen & Collins, 1964). If 
discriminability of the target gets better 
and better the longer the time between 
target and mask, or if discriminability of 
the target is impaired during a certain 
constant time interval after initial recep- 
tion, then only monotones or shapes of 
zero slope should be obtained for masking 
functions. But monotones are only some 
of the functions that can be obtained. The 
conditions under which U shapes may be 
obtained have been fairly well investigated, 
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and thus were expected in this study. (See 
Raab, 1963.) However, U shapes ap- 
peared also where they were not expected 
at all, namely, where arrows were used 
as indicators. The only exception pre- 
viously noted to the contour rule for U 
shapes, that is, the rule that states that 
the inner contour of the mask should be 
quite similar to the outer contour of the 
target, is the work of Averbach and 
Coriell (1961). However, they obtained 
no masking when there was one letter 
alone in the target stimulus field ; nor, 
paradoxically, did their bar indicators 
give masking shapes. Yet in the present 
study, not only were masking shapes ob- 
tained for one letter alone, but also, 
masking shapes occurred with arrow in- 
dicators for five out of eight conditions 
in which there was only one letter in 
the stimulus field. 

The failure to obtain masking with one 
letter in the target field prompted Aver- 
bach and Coriell ( 1961) to propose that 
for masking, the target material had to 
be too complex in number to be dealt 
with all at once: items had to be put 
in temporary visual Storage. The storage 
was unstable and hence items appearing 
later could replace those already in the 
Storage. It is interesting that they did 
not question the absence of masking 
shapes for their bar indicators with such 
à formulation, since their distinction be- 
tween maskable and nonmaskable did not 
depend on the characteristics of the stim- 
uli but on the control System operating. 
The data in this study indicate that what- 
ever the nature of the initial 100 msec. 
or so of perceptual processing, one item 
can be affected as well as more than one 
item, and the amplitude of masking varies 
with the characteristics of the stimuli. 
Notice that the masking amplitudes in 
this study for the forced-choice data are 
not extremely high; none go so high as 
chance. Also, even though arrow indi- 
cators frequently give masking shapes, 
their amplitude is well below that of the 
ring indicators. 

However, error amplitudes of the 
functions in this study increase when the 
stimulus size increases; it is probable 
that this accurately reflects an increase 


in masking amplitudes also, although the 
determination of this was not possible in 
this study since arrow-ring differences 
reflect a differential increase in masking 
width as well as in masking amplitude. 
In light of these considerations, it is 
suggested that the masking effect with 
l letter out of 26 was too weak to be 
noticed in S performance; as Averbach 
and Coriell increased their display, error 
amplitudes increased to such an extent 
that the U-shaped effect could be no- 
ticed. But it is the duration of the 
effect, rather than its presence, that var- 
ies with the number of items presented. 
It is suggested that, given the general 
conditions under which U shapes will 
obtain, this statement probably applies 
generally to a wide range of masks and 
targets; failure to obtain masking may 
be due simply to the weakness of the 
effect, rather than its absence in proc- 
essing operations. 
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BACKWARD RECALL WITH COMPOUND STIMULI 


ROBERT K. YOUNG, JONELLE M. FARROW, SUE SEITZ, ann MARY HAYS 


University of Texas 


Backward (R-S) recall was measured following the learning of a 
paired-associate task in which compound stimuli, each consisting of a 
nonsense syllable and a color, were employed. 2 levels of stimulus 
meaningfulness were used, and within each level Ss recalled either 


colors or nonsense syllables. 
nonsense-syllable stimuli. 


2 control groups learned PA lists with 
An interaction between stimulus meaning- 


fulness and stimulus-component recalled was obtained between the 


experimental groups during recall. 


In paired-associate (PA) learning 
the effective stimulus (the stimulus 
used by S) may be the particular 
verbal unit designated by E or it may 
be some component of that unit. To 
investigate this, Underwood, Ham, 
and Ekstrand (1962) required Ss to 
learn a PA list employing compound 
stimuli, each consisting of a verbal unit 
and a unique color. The effective 
stimulus was determined by measuring 
transfer to a second list which used 
either the verbal units or the colors 
as stimuli. When the meaningfulness 
of the verbal units was low, transfer to 
a list using the colors as stimuli was 
high while transfer to a list using the 
verbal units as stimuli was low. When 
the meaningfulness of the verbal units 
was high the opposite results were ob- 
tained: Transfer to the color list was 
relatively low while transfer to the 
verbal list was high. 

The strength of backward (or R-S) 
associations has been shown to paral- 
lel the strength of favored (or S-R) 
associations, e.g., Jantz and Under- 
wood (1958). The present experi- 
ment was designed to determine if re- 
sults similar to those of Underwood 
et al. also obtain when R-S recall is 
measured in a PA list employing com- 
pound stimuli. 


METHOD 


Materials—The materials used in the 
present experiment were presented manually 
on 34 X 6ł in. white plastic cards. On each 
card in the experimental groups, a sheet 
of colored construction paper was mounted, 
completely covering the card. For each 
stimulus, the color of the card was unique, 
and the colors used were blue, red, green, 
brown, yellow, orange, purple, and gray. 
The control group cards were not covered 
and were therefore a uniform white. Cen- 
tered on each card, for all groups, was a 
2X § in. strip of white paper upon which 
was typed a nonsense-syllable stimulus, On 
the white back of each card the adjective 
response was typed on a similar sized strip 
of white paper. 

The nonsense syllables, which were as 
dissimilar as careful inspection would allow, 
were taken from lists prepared by Glaze 
(Hilgard, 1951) and the adjectives (also of 
low intralist similarity) were taken from 
lists prepared by Melton (Hilgard, 1951). 

Design—Each of 72 Ss learned an eight- 
pair PA list and then was given three R-S 
recall trials. Two levels of stimulus mean- 
ingfulness (93% and 20% association-value 
nonsense syllables) and three treatment lev- 
els (two experimental and one control) 
were combined factorially. The two experi- 
mental groups (Groups Col and N) learned 
a compound stimulus list—a list with stimuli 
consisting of combinations of colors and 
nonsense syllables—while the control group 
learned the same list without the colors. 
During R-S recall Group Col was instructed 
to recall the appropriate stimulus color and 
Groups N and Control were instructed to 
recall the appropriate nonsense syllable. 
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Procedure.—Each stimulus was presented 
for 2 sec. (all timing was done by the 
silent counting of E); the card was then 
flipped over and the response presented 
for 2 sec. A 2-sec. interpair interval and 
a 6-sec. intertrial interval were used. The 
cards were shuffled after each trial to pre- 
vent serial learning and the criterion of 
learning was one errorless trial. During 
the three R-S recall trials a similar proce- 
dure was used except that the stimuli were 
never presented and Ss were never told 
whether they were correct. Approximately 
1 min. elapsed between learning and recall 
for additional instructions. The experimen- 
tally naive Ss were taken from introductory 
psychology classes and were assigned to 
groups in a predetermined random order. 
Two samples of nonsense syllables and two 
samples of adjectives were used within each 
level of stimulus meaningfulness, The three 
Es ran an equal number of Ss in each con- 
dition. 


RESULTS AND Discussion 


Trials to learn the lists were ana- 
lyzed. The only significant difference 
found in the analysis was between the 
two conditions of stimulus meaningful- 
ness, F (1, 36) = 4.98, p < .05. The 
mean trials to learn the 93% and 20% 
lists were 5.36 and 7.22, respectively, 

The recall data are presented in 
Table 1. The variances were hetero- 
geneous (Fmax = 9.86, p<.01) but 
the data were not transformed. In 
this analysis stimulus meaningfulness 
did not approach significance, F (1, 


TABLE 1 


MEAN CORRECT STIMULI PER TRIAL 
DURING R-S RECALL 


Stimulus Meaningfulness 


Group 90% 20% 
SD M SD 
Control] 6.80 94 5.00 1.65 
N 6.20 2.16 2.60 2.94 


[o 2.10 | 129 | 520 | 2:52 


36) — 1.62. The three treatment 
groups differed in the number of stim- 
uli recalled, F (2, 36) = 5.79, p < .01. 
Partitioning of the treatment sum of 
squares by use of orthogonal weighting 
coefficients indicated that the control 
groups differed from the experimental 
groups, F (1, 36) = 9.93, p < .01, but 
that the experimental groups did not 
differ from one another. The Treat- 
ment X Stimulus Meaningfulness in- 
teraction was also significant, F (2, 
36) = 12.85, p < .01. Use of orthog- 
onal weighting coefficients indicated 
that the interaction between stimulus 
meaningfulness and the experimental 
and control groups was not significant, 
F (1, 36) — 1.57, but that the inter- 
action between stimulus meaningful- 
ness and the experimental groups was 
highly significant, F (1, 36) = 24.12, p 
X 01. No other Fs approached sig- 
nificance. 

The Stimulus Meaningfulness X Ex- 
perimental group interaction indicates 
that more colors were recalled when 
stimulus meaningfulness was low and 
more nonsense syllables were recalled 
when stimulus, meaningfulness was 
high. These results are in close agree- 
ment with those obtained by Under- 
wood et al. (1962). 

The results of the present experi- 
ment indicate that stimulus selection 
during S-R learning plays an impor- 
tant role in R-S recall. Just as S’s 
choice of an effective stimulus can in- 
fluence the amount of transfer when 
the effects of forward associations are 
measured, as demonstrated by Under- 
wood et al, so too can his choice of 
an effective stimulus similarly influ- 
ence the amount of transfer in the 
backward direction. In addition, it 
should be noted that whether the in- 
fluence of stimulus selection is meas- 
ured through the use of forward or 
backward associations, the same rela- 
tive results are obtained, 
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SHAPE PERCEPTION FOR ROUND AND ELLIPTICALLY 
SHAPED TEST OBJECTS + 


H. W. LEIBOWITZ AN» KATHLEEN A. MENEGHINI 


Pennsylvania State University 


In order to determine whether the phenomenon of shape constancy as 
observed with circular test objects is influenced by a tendency to 
respond in terms of a familiar circular shape, matches were obtained 
for a round and 2 elliptically shaped test objects. With this procedure, 
shape constancy for a round test object is manifested by matches 
which are rounder than the retinal image, but for elliptical objects, the 
tendency toward constancy requires the matches to be less round than 
the retinal image. The data, obtained in 2 experiments involving 
variation of angle of rotation and exposure duration, indicate that the 
constancy phenomenon is toward the true shape of the test object and 
that circularity is not a factor. These results emphasize the im- 
portance of cues, present during observation, as mediators of shape 


constancy. 


The phenomenon of shape “con- 
stancy” may be described as the tend- 
ency to recognize the shape of objects 
independent of variation in their retinal 
image patterns. The biological func- 
tion of shape constancy is to emphasize 
the permanent characteristics of ob- 
jects under varying conditions of ob- 
servation and to stabilize object per- 
ception, Experimentally, constancy of 
shape may be investigated by instruct- 
ing an S to match a disk, presented at 
various angles to the line of vision, 
with one of a graded series of two- 
dimensional ellipses. Typically, the 
matched shape will be rounder than 
would be expected from a prediction 
based on the geometry of the retinal 
image. This demonstrates the tend- 
ency toward shape “constancy” or 
“phenomenal regression to the real ob- 
ject” (Thouless, 1931; Woodworth & 
Schlosberg, 1954). 

A number of mechanisms have been 
proposed to account for this phenome- 
non among which are visual cues such 

1 Supported in part by Grant MH-08061 
from the National Institutes of Health and 


by Undergraduate Participation Grant GE- 
1507 from the National Science Foundation. 
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as those for binocular depth, texture, 
slant, etc. (Beck & Gibson, 1955; 
Hake, 1957). However, in object per- 
ception, information may also be con- 
veyed by nonvisual factors. To know 
that an object is a member of a class 
designated as “human” or “book” im- 
plies certain constraints regarding its 
physical characteristics. It has been 
demonstrated that the sizes of familiar 
objects may be accurately indicated 
with the eyes closed (Bolles & Bailey, 
1956). Obviously, the perceived char- 
acteristics of objects are not deter- 
mined exclusively by the stimuli or 
cues available during observation. Past 
experiences may play some role in de- 
termining the perceived qualities of 
familiar objects (Borresen & Lichte, 
1962). 

In the experimental investigation of 
perceived shape, the majority of stud- 
les have employed circular test objects 
Which are rotated so as to produce an 
elliptical retinal image. If the tend- 
ency toward shape constancy is ob- 
served, S’s matches will be rounder 
than the retinal image in the direction 
of the true shape of the test object 
(Thouless, 1931). Since round objects 
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are quite familiar in the past experi- 
ence of normal adult S’s, the question 
may be raised as to whether the mani- 
festation of shape constancy observed 
with round objects results from a tend- 
ency, mediated by language or concep- 
ual factors, to respond in terms of 
the familiar round shape, or whether 
the constancy effect occurs in relation 
o the true shape of the test object 
(Graham, 1951). The purpose of the 
present study is to evaluate the role 
of the specific shape of the test object 
by obtaining shape matches for both 
elliptically shaped and round test ob- 
jects. 

Two experiments are reported. In 
the first, S matched the shape of the 
est objects, presented at various angles 
of rotation, with a series of line draw- 
ings of ellipses. If shape constancy is 
mediated primarily by visual cues, 
these functions, when plotted as per- 
centage of constancy or Brunswik 
ratio, should be similar despite the fact 
that manifestation of the tendency to- 
wards constancy results in matches to- 
ward the more familiar circular shape 
for the round test object, but, for the 
elliptically shaped test object, may be 
either toward or away from circularity 
depending upon the angle of rotation. 
In Exp. II, the test objects were posi- 
tioned at a fixed angle of rotation such 
that the retinal image for the circular 
test object was always elliptical while 
those for the elliptical test objects were 
always circular. Matches were ob- 
tained for various viewing durations. 
As in Exp. I, the tendency to match 
in terms of the more familiar circular 
shape should be apparent upon com- 
parison of the functions for matches 
tending toward or away from circu- 
larity. 


EXPERIMENT I 
Method 


Subjects—The Ss were undergraduates at 
the university enrolled in introductory psy- 


chology classes who volunteered for the ex- 
periment in order to fulfill their course 
requirements. 

Apparatus.—The basic apparatus consisted 
of a two-channel transparent mirror tachis- 
toscope. In one field the test object was 
mounted on a turntable which could be ro- 
tated at various angles of inclination. In 
the other field, one of a graded series of 
ellipses, drawn on white vellum paper with 
black ink, was visible. This series con- 
sisted of 42 ellipses with axis ratios (ratio 
of minor to major axis) ranging from 0.194 
to 2,282 in approximately equal steps. The 
test objects, as well as the ellipses, were 
illuminated by fluorescent lamps which pro- 
duced an illuminance of 10 ftc, (For details 
of the apparatus, see Leibowitz & Bourne, 
1956.) The test object and the comparison 
field appeared alternately in the tachistoscope 
at viewing distances of 31 and 24 in, respec- 
tively. The test objects were cut from 
3-mm. thick white plastic with a reflectance 
of 62%. The minor axis of all three test 
objects was the same at 1.2 in, and was 
always oriented horizontally and perpendicu- 
lar to the line of sight. The major axis 
of the circular test object was 1.2 in, while 
those of the elliptical test objects were 1.8 
and 24 in. Thus, the two elliptical test 
objects presented axis ratios of 1.5 and 2.0. 

Procedure.—Forty-two Ss matched all 
three test objects at seven angles of inclina- 
tion (rotation around horizontal axis) 10°, 
14°, 23°, 30°, 40°, 60°, and 90° (vertical). 
The order of presentation of the angles was 
determined by a Latin-square design. The 
order of presentation of the test objects was 
counterbalanced, and 14 Ss served in each 
of three groups. The Ss viewed alternately 
either the test object for 1 sec. or the com- 
parison ellipse for 3 sec., and were instructed 
to choose the drawing, by manipulation of 
appropriate knobs, which “looked the most 
like the shape of the test-object." The Ss 
were permitted as much time, ie, as many 
exposures, as was necessary to make a satis- 
factory match. A given testing session was 
concerned with only one of the test objects 
and was completed within 15 min. Sessions 
were spaced 2 days apart. 


Results 


Table 1 presents the mean matched 
axis ratio as a function of the angle 
of inclination for the three test objects. 
These data are plotted in Fig. 1 in 
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Fic. 1. Mean Brunswik ratios as a func- 


tion of angle of inclination for a circular and 
two elliptically shaped test objects. 


terms of the Brunswik ratio or per- 
centage of constancy as a function of 
the angle of inclination. The Bruns- 
wik ratio is used to indicate the extent 
to which the tendency towards con- 
stancy is manifested independently of 
the particular shape of the test object 
in question (Leibowitz, 1956). The 
matches for all three functions, as is 
typical of these studies, represent a 
compromise between predictions based 
on the law of shape constancy and the 
law of the retinal image (Thouless, 
1931). Of particular interest in the 
present context is the fact that the 
three curves produce a similar tend- 
ency toward shape constancy at all 
angles of inclination. 


TABLE 1 


MEAN Marcuep Axis RATIOS AS A FUNCTION OF ANGLE OF TILT FOR A CIRCULAR 
AND Two ELLIPTICALLY SHAPED TEST On JECTS 
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Discussion 


The functions for all three test ob- 
jects are highly similar implying that 
the particular shape of the test object is 
not a variable in determining the relative — 
amount of constancy. Inclinations in the 
vicinity of 42? and 30? are of particular 
interest since at these angles the 1.5 
and 2.0 ellipses project circular retinal 
images. Therefore, if there were a tend- 
ency toward circularity, the responses — 
should be biased toward the law of the 
retinal image in these regions. The im- 3 
plication of the present data is that Sst 
matches are based on the true shape of 
the test object, whether it is circular or 
elliptical, and irrespective of whether 
constancy is manifested by increased or 
decreased circularity of Ss’ matches. 
These results emphasize the importance 
of stimulus cues as opposed to factors 
relating to circularity. f 


EXPERIMENT II 
Method 


Subjects and apparatus.—The same appa- 
ratus and design were used in Exp. Il as 
in Exp. I with the variable of angle of rota- 
tion replaced by exposure duration. Forty-. 
two additional undergraduates served as Ss 

Procedure.—The same three test obj 
were set at predetermined angles of inclina 
tion, ie, 30° for the circle, 30° for 
2.0 ellipse, and 42° for the 1.5 ellipse, SO 
that the projected retinal images of 
two ellipses were always circular while thi 
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ee s 
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taape as $ 
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TABLE 2 


Mean Matcnep Axis RATIOS AS A FUNCTION or EXPOSURE DURATION FOR A 
CIRCULAR AND Two ELLIPTICALLY SHAPED TEST OBJECTS 


Duration of Exposure (seconds) 
Test Object 
01 05 40 E .50 E 1.00 
Circle 
M .567 .626 .638 .100 -746 .168 .182 
SD .148 .188 143 .149 .176 .155 .161 
1.5 Ellipse 
M 1.147 1.180 1.234 1.298 1.358 1.384 1.362 
SD .236 .197 .203 242 .248 .267 .240 
2.0 Ellipse 
M 1.261 1.319 1.354 1.388 1.450 1.473 1.485 
SD 215 .281 .304 .319 .323 .345 .359 


of the circular test object was elliptical with 
an axis ratio of .50. Seven exposure dura- 
tions were employed: .01, .05, .10, .25, .50, 
«75, and 1.00 sec. The comparison field was 
illuminated for a constant duration of 3.0 
sec, when the test object was not visible. 


Results 


The mean matched axis ratios for the 
three test objects as a function of expo- 
sure duration are presented in Table 2, 
and plotted as the mean Brunswik ratio 
as a function of exposure duration in 
Fig.2. It will be noted that all func- 
tions rise rapidly at first, and then 
more slowly, before reaching a limiting 
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Fic. 2. Mean Brunswik ratios as a func- 
tion of exposure duration for a circular and 
two elliptically shaped test objects. (The 
angle of tilt for the circular test object was 
30°, for the 1.5 ellipse, 42°, and for the 2.0 
ellipse, 30°.) 


value at about .50 sec. (Leibowitz & 
Bourne; Leibowitz, Toffey, & Searle, 
1966). At the shorter durations, the 
axis ratios of the matched ellipses lie 
close to the law of the retinal image. 
As duration is increased, the matched 
axis ratios become larger, tending to 
approach the theoretical prediction rep- 
resenting the law of shape constancy. 
The final levels reached for the circle 
and 2.0 ellipse are similar, but that for 
the 1.5 ellipse is higher. 


Discussion 


The results of Exp. II provide further 
evidence that circularity does not play a 
significant role in determining shape 
matching responses. If circularity were 
a factor in this situation, the circular 
retinal image would be expected to bias 
the functions of the elliptical test ob- 
jects toward the law of the retinal image 
since an increase in the tendency towards 
constancy would necessitate “less round” 
matches. Thus, the increase in the tend- 
ency toward constancy as a function of 
duration should be greater for the circu- 
lar than for the elliptical test objects. 
However, constancy is shown with re- 
spect to the true shape of the test object, 
regardless of its projected retinal image. 
It would appear that there is no bias as 
to whether constancy is away from or 
toward circularity. Rather, the tendency 
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towards constancy is in the direction of 
the true shape of the test object. Reduc- 
tion of exposure duration is equivalent, 
functionally, to reducing the cues that 
have been shown to be necessary for the 
perception of the true shape of a test 
object. If circularity were a factor in 
shape constancy, it would be expected to 
manifest itself as exposure duration is 
reduced. As the data indicate, there was 
no generalization of circularity when 
stimulus cues were effectively reduced. 

A greater tendency toward constancy 
was exhibited for the 1.5 ellipse than for 
the 2.0 ellipse. It may well be that spe- 
cific shapes tend to produce more con- 
stancy than do others and that past ex- 
perience with specific shapes, although 
not necessarily circular shapes, is a vari- 
able in shape matching. It is of interest 
to note that Fantz (1965) has re- 
cently reported that human infants ex- 
hibit a preference for observing ellipti- 
cally shaped test objects of approximately 
the same shape as the 1.5 test object used 
in the present study, It is not clear why 
this tendency appears as a function of 
exposure duration but not as a result of 
rotation. 

Tt will be observed that the Brunswik 
ratios in Exp. I are generally higher than 
those in Exp. II obtained under similar 
conditions, i.e., exposure duration of 1.0 
sec, rotation of 30°, 40°, and 42°. 
There is no obvious explanation for this 
discrepancy. However, it may be noted 
that the matched values for the circular 
test object in Exp. I are highly similar 
to those obtained in a previous study 
under conditions identical to those in 
Exp. II of the present study (Leibowitz 
& Bourne, 1956). In view of the marked 
individual differences observed in shape 
matching at the longer durations of ex- 
posure (Leibowitz, Mitchell, & Angrist, 
1954; Leibowitz, Waskow, Loeffler, & 
Glaser, 1959), it might be well to empha- 
size relative rather than absolute com- 
parisons for a given group of Ss. 

The results of the present experiments 
do not provide evidence that familiarity 
with circular objects plays a significant 
role in the determination of shape-match- 
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ing responses. In spite of the fact that 
human beings have extensive past ex- 
perience with circular test objects, it 
would appear that this is not a factor in 
determining perceived shape. The ab- 
sence of such evidence suggests that 
stimulus cues are critically important as 
determiners of perceived shape. These 
conclusions are in agreement with those 
obtained by Thouless (1931) as a result 
of an experiment utilizing circular and 
elliptically shaped test objects in which 
information concerning the true shape of 
the test object was systematically varied, 
as well as with the results of Buytendijk 
and Ratingen (1949). The term coined 
by Thouless “phenomenal regression to 
the real object," or the suggested modi- 
fication by Woodworth and Schlosberg 
(1954), "phenomenal regression toward 
the real object," would seem to describe 
accurately the results of experimental in- 
vestigations of shape constancy. Which- 
ever term is used, the data support the 
generalization that when adequate cues 
are present, Ss tend to preserve the 
invariant features of the test object re- 
gardless of the shape of the test object 
or its angle of inclination. 
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CONSOLIDATION AND RETROACTIVE INTERFERENCE IN 
SHORT-TERM RECOGNITION MEMORY FOR PITCH * 


WAYNE A. WICKELGREN 
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Ss listened to a standard tone for 2, 4, or 8 sec., followed by an inter- 
ference tone lasting 2, 4, or 8 sec., followed by a comparison tone 
lasting 2 sec., followed by a 4-sec. period in which they decided whether 
the standard and comparison tones were the same or different and 
stated their confidence on a scale from 1 to 5. Operating charac- 
teristics were approximately straight lines on normal-normal paper, 
and d' values were computed for each condition for each of 10 Ss. 
The d' value for a condition is a measure of the difference in strength 
of the correct and incorrect comparison tones at the time of the test, 
greater d' meaning more accurate performance. By this measure, 
trace strength increased with longer duration of the standard tone, 
decreased with longer duration of the interference tone, and generalized 


to adjacent tones. 


A simple generalization of the psy- 
chophysical method of successive com- 
parison has appeared to many psychol- 
ogists to be an elegant way of studying 
nonverbal memory. The generalized 
procedure in question is generally re- 
ferred to as “delayed comparison” be- 
cause it takes into account the interval 
between the standard and the compari- 
son stimuli, suggesting experiments 
which vary the nature and duration of 
that interval. After a brief summary 
of what is known about delayed com- 
parison of pitch, the present paper will 
discuss some difficulties involved in 
this type of experiment and propose 
the use of a powerful analytic tool from 
signal-detection theory, the operating 
characteristic. Then, the operating 
characteristic will be used to analyze 
the results of two experiments on de- 
layed comparison of pitch, and various 
theoretical interpretations of the data 
will be discussed. 


1 Supported primarily by Grant MH 088-90 
-01, from the National Institute of Mental 
Health, Public Health Service. Further aid 
was received from a National Aeronautics 
and Space Administration grant, NsG 496, 
to Hans-Lukas Teuber. 


Most studies on delayed comparison 
of pitch have varied the duration of a 
"blank" interval between the standard 
and comparison stimuli. On the whole, 
the results are that accuracy decreases 
with increasing time interval (Angell 
& Harwood, 1899; Bachem, 1954; 
Harris, 1952; Koester, 1945; König, 
1957; Postman, 1946; Wolfe, 1886). 
Anderson (1914) found no significant 
differences in pitch discrimination for 
interstimulus intervals (ISIs) of fe 
4, 4, $, 1, 2, 3, and 4 sec., and Irwin 
(1937) even got a reversal in the re- 
lationship between an ISI of 1 sec. 


and an ISI of 10 sec. As suggested. 


by Harris (1952), experiments in 
which ISI has had very little effect 
are all experiments in which only one 
standard stimulus was used. This is 
extremely conducive to the develop- 
ment of long-term memory (LTM) 
for the one standard stimulus. The 
decline in accuracy with increasing ISI 
appears to be entirely reflected in an 
increasing difference limen with no 
change in the point of subjective equal- 
ity, ie, no time error (Koester, 1945; 
Postman, 1946). , 

Only one study on delayed compari- 
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son of pitch (Postman, 1946) intro- 
duced a retroactive-interference (RI) 
tone into the interval between standard 
and comparison tones. Once again, no 
time error was found, but Postman does 
not report whether the RI tone in- 
creased the difference limen. Surpris- 
ingly enough, no one has ever studied 
the relationship between duration of 
an RI tone and pitch discrimination, 
and this is the first purpose of the pres- 
ent study. 

In standard pitch-discrimination ex- 
periments with brief ISI (less than 1 
sec.), increasing the duration of the 
standard and comparison tones is gen- 
erally found to increase accuracy (An- 
derson, 1914; Békésy, 1929; Turnbull, 
1944), but there appears to be an opti- 
mum duration of about 1 sec. (Kónig, 
1957). The second purpose of the 
present study is to determine if much 
longer durations of the standard tone 
(4 or 8 sec.) are optimal when pitch 
judgments are limited by memory, 
rather than by sensory discriminative 
capacity. 

There appear to be three difficulties 
with delayed comparison as a method 
for studying memory for pitch. First, 
there is the problem of rehearsal dur- 
ing the ISI. Rehearsal of tones (silent 
or vocal) is probably less effective than 
rehearsal of verbal material because of 
the limitations of the speech-motor 
system. Nevertheless, attempts to hum 
the standard tone during the ISI must 
be presumed to have some effect, and 
elimination of this rehearsal would be 
highly desirable. The present experi- 
ment attempts to do this by the obvi- 
ous procedure of filling the ISI with 
an interference tone of variable dura- 
tion. Naturally, this does not permit 
us to distinguish between decay and 
interference as the cause of forgetting, 
but blank intervals do not either. The 
only proof that filled intervals are bet- 
ter than blank intervals is the reliability 
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and simplicity of the results obtained, - 
but the decision to use filled intervals 
is rather well motivated by results in 
verbal memory. 

Second, there is the problem of what 
judgment to use. One might consider 
various subsets containing two or more 
of the following responses: "same," 
"different higher," and "lower." 
Subjectively, there appear to be four 
and only four qualitatively different re- 
actions to a delayed comparison, cor- 
responding to the four responses listed 
above. There is nothing surprising 
about this, except the fact that Ss can 
react to the comparison tone as differ- 
ent from the standard tone without 
reacting to it as either higher or lower 
than the standard tone. 

Since there seem to be four subjec- 
tively different reactions, it might ap- 
pear reasonable to have a different 
response for each reaction. However, 
no theoretical framework exists for 
analyzing these four-category judg- 
ments. For that matter, there is no 
theoretical framework for the three- 
category judgment: “same,” “higher,” 
or “lower,” though a choice model of 
the type suggested by Luce (1959, 
1963) might be developed for it. 
Until recently there was a satisfactory 
method of analysis only for the two- 
category judgment: “higher,” “lower.” 
Now with signal-detection theory and 
the operating characteristic, there is an 
elegant way to analyze the other 
reasonable two-category judgment: 
“same,” “different.” In the latter case 
“different” includes “higher” and 
“lower” reactions as well as the anoma- 
lous “different” reactions that are 
neither “higher” nor “lower.” 

The present study uses “same-dif- 
ferent” judgments rather than “higher- 
lower” judgments, and the choice is 
well motivated. If the four qualita- 
tively different reactions specified above 
really exist, there can be no ambiguity 
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in how S partitions the four reactions 
among the two responses “same” and 
“different.” On the other hand, there 
is considerable ambiguity in the assign- 
ment of “same” and “different” reac- 
tions to "higher" and "lower" re- 
sponses, and unknown response biases 
could influence the results rather mark- 
edly. The only theory that justifies 
the use of "higher-lower" judgments 
in preference to “same-different” judg- 
ments is the theory that there are only 
"higher" and "lower" reactions. Even 
if this theory were correct (which in- 
tuitively it is not), “same-different” 
judgments would provide almost the 
same information as "higher-lower" 
judgments, albeit in a less convenient 
form. When “same-different” judg- 
ments are employed, it is less ambigu- 
ous to refer to the experiment as con- 
cerned with recognition memory for 
pitch. When the retention interval 
(ISI) is less than 1 min., the clear- 
est name for such an experiment would 
seem to be short-term recognition 
memory (STRM) for pitch. This con- 
vention will be observed in the rest of 
the paper. 1 

The third difficulty in delayed com- 
parison studies is concerned with the 
method of analysis. Previous studies 
have not found any simple law relating 
ISI to a measure of the memory trace 
for pitch. In early studies the trace 
measure was the percentage of correct 
judgments in various conditions. In 
later studies the trace measure has 
been the difference limen computed on 
"higher-lower" judgments, Signal-de- 
tection theory suggests an alternative 
analysis of two-category judgments in 
terms of the operating characteristic. 
This analysis can be applied to recog- 
nition-memory experiments in exactly 
the same way that it is applied to de- 
tection experiments (Egan, 1958; Nor- 
man & Wickelgren, 1965) to separate 
the effects of memory-trace strength 
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and response bias on the obser 
judgments. If the assumptions of 
theory are valid for recognition mi 
ory, then the operating characteri: 
yields a measure on an interval 
of the strength of the memory t 
(d'). The theory makes testable p 
dictions about the shape of the op 
ing characteristic, and the first task 
analyzing the present experiments 
to examine the validity of these 
sumptions. But even if the normali 
and equal-variance assumptions o! 
nal-detection theory are not valid. 
recognition memory of tones, so 
measure of the height of the opera! 
characteristic may serve as a relia 
measure of trace strength, indepe 
ent of response bias. The d' value 
sessed at the intersection of the op 
ing characteristic with the nega! 
diagonal (Pollack, 1959) or 

area under the operating character: 
(Pollack, Norman, & Galanter, 19 
have both been suggested as measu 
of trace strength in recognition men 
ory. Thus, the second task of th 
present study is to determine if so 
measure of trace strength has a 
tively simple relationship to the 
tions of the standard and interferen 
tones. 


METHOD 
Experiment I 


Procedure.—On each trial Ss listened 
a “ready” signal, followed by a stand 
tone (S tone) lasting 2 sec., followed 
mediately by an interference tone (I 
of 930 cps lasting 2, 4, or 8 sec., follow 
immediately by a comparison tone (C to 
lasting 2 sec., followed by a 4-sec. period 
which Ss decided whether the S and C t 
were the "same" or "different" and stated 
their confidence in that decision on a rating 
scale from “1” (least confidence) to “ 
(most confidence). : 

Design.—There were 3 I-tone duratii 
(f: —2, 4, and 8 sec.), 3 S tones (800, 8 
and 840 cps), and 3 C-S tone different 
(0, +10, and +15 cps). The zero-differem 
condition occurred as often as both pos 
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difference conditions put together. The de- 
sign was factorial and the order of condi- 
tions was randomized in blocks of 3X 3 X 
4-36 trials. There were four blocks in 
a l-hr. session (144 trials), 3 different ses- 
sions, and 4 replications of each session (12 
sessions altogether). 

Apparatus.—Two Hewlett-Packard wide- 
range oscillators (Model 200CD) and one 
Hewlett-Packard audio oscillator (Model 
201C) were used to produce the pure tones 
for recording on tape. Time intervals were 
controlled by Hunter decade interval timers 
(Model 111-C). The entire experiment was 
recorded on tape and played back over a 
loudspeaker at a comfortable listening in- 
tensity. 

Subjects.—The Ss were five volunteers re- 
cruited through the Massachusetts Institute 
of Technology student employment office. 
They were paid $1.50 per hr. for participa- 
tion in the experiment. 


Experiment IT 


The apparatus and procedure were the 
same as in Exp. I, except that the duration 
of the I tone was held constant at 8 sec. 
and the duration of the S tone (fs) was 
varied from 2 to 4 to 8 sec. The design 
was identical to the design of Exp. I, sub- 
stituting three different S-tone durations for 
different I-tone durations. There were five 
Ss recruited in exactly the same way as in 
Exp. I, with one S participating in both 
experiments (Exp. II first). 


RESULTS 


Form of the operating characteris- 
tics —The confidence-judgment method 
was used to obtain operating charac- 
teristics for each S for each condition 
in the experiment. Discussions of this 
method for determining operating char- 
acteristics in recognition memory can 
be found in several sources (Egan, 
1958; Norman & Wickelgren, 1965). 
In brief, the operating characteristic is 
a plot of the probability of correct rec- 
ognition as a function of the probability 
of false recognition. In the present 
experiments, correct-recognition prob- 
abilities are obtained from conditions 
in which the C tone is identical to the 
S tone (O-cps different) and false- 
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recognition probabilities are obtained 
from conditions in which the C tone 
is 10 or 15 cps greater than the S tone. 
Thus, for each of the five Ss in Exp. I 
we obtain 18 operating characteristics 
(by plotting the 0 vs. 10 and 0 vs. 15 
operating characteristics for each of the 
nine combinations of three S tones and 
3 trs). Similarly we obtain 18 op- 
erating characteristics for each of the 
five Ss in Exp. II. 

If one only obtained “same-differ- 
ent" judgments from an S, there would 
be only one probability value for each 
condition. When a confidence judg- 
ment is also obtained, one can order 
the responses on a unidimensional 
scale from a “most confident same" 
through the less confident same’s and 
different's to a “most confident differ- 
ent.” In the present experiment this 
means 10 responses on a scale: S5, S4, 
S3, S2, S1, D1, D2, D3, D4, D5. By 
assuming that the trace strengths nec- 
essary to elicit these responses are also 
ordered, we can make a cut between 
any two responses and determine the 
probability that S made any of the re- 
sponses to the left of the cut. Thus, 
nine probabilities are obtained for the 
condition, coming from what are pre- 
sumed to be nine different criteria on 
the underlying strength scale. Each 
of these probabilities of being to the 
left of some cut are called “recogni- 
tion probabilities," regardless of where 
the cut is located. Whether the rec- 
ognition probability is correct or false 
depends on whether it is obtained from 
a condition in which the C tone is the 
same as the S tone or different from it. 

When plotted on normal-normal 
probability paper, the 180 operating 
characteristics (for individual Ss in 
particular conditions) appear to be 
straight lines, exactly as required by 
the normality assumption of signal- 
detection theory. A few operating 
characteristics are markedly curvilinear 
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on normal-normal paper, but these de- 
viations from the general rule do not 
appear to be systematic in any way. 
Thus, it is reasonable to suppose that 
there is some (approximately) normal 
variation in either the trace strengths 
or the decision process that maps these 
trace strengths into recognition-mem- 
ory responses. For five Ss, the slopes 
of the best-fitting straight lines do not 
appear to deviate significantly from 
unity, which implies that the variances 
of the underlying normal distributions 
for correct and incorrect tones are ap- 
proximately equal. For the other five 
Ss, the slopes tend to be significantly 
greater than unity, indicating that the 
variance of the trace strength of in- 
correct tones is greater than that of 
correct tones. A selected sample of 
these operating characteristics is shown 
in Fig. 1 and 2. The number of points 
on each curve varies, up to a maximum 
of nine, depending on how many con- 
fidence categories an S chooses to use 
with any frequency. 

If the operating characteristics were 
consistent with both the normal distri- 
bution and equal variance assumptions, 
it would -be economical to summarize 
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Fic. 1. Operating characteristics for indi- 
vidual Ss in Exp. I with an S tone of 820 
cps an I tone of 930 cps, a C tone of 830 
cps, ts — 2 sec, fi—2 sec, and te=2 sec. 


WAYNE A. WICKELGREN 


NORMAL DEVIATE 
2 E LM 


+ +2 


95) 


jn 


a» 
NORMAL DEVIATE 


Pr (CORRECT RECOGNITION) 
FS 


2 


1 e is a L 1 
T5 1$ 74 5 3 95 
Pr (FALSE RECOGNITION) 


Fic. 2. Operating characteristics for indi- 
vidual Ss in Exp. II with an S tone of 
800 cps, an I tone of 930 cps, a C tone of 
815 cps, ts = 8 sec., tı — 8 sec., and to = 2 sec. 


each of them by a single number, d’, 
representing the difference in mean 
trace strength of the correct and in- 
correct C tones in units of the stand- 
ard deviation of the trace-strength 
distributions. Although the equal- 
variance assumption is not valid for 
half of the Ss, it is still quite reasonable 
to determine a single d' value for each 
operating characteristic, provided one 
determines the d' value at a point 
which is not very sensitive to violation 
of the equal-variance assumption. Such 
a point is the intersection of the op- 
erating characteristic with the negative 
diagonal. Tables 1 and 2 show d' 
values for each condition for each in- 
dividual in Exp. I and II, with each 
d' being determined for the point 
where the best-fitting straight line in- 
tersects the negative diagonal. An 
average over all Ss and all standard 
tones is also shown in both tables. 

. Interference tone duration.—Keep- 
ing in mind that larger d' values imply 
better recognition memory for the cor- 
rect comparison tone, it is clear from 
Table 1 that the longer the duration 
of the interference tone, the poorer 1$ 
recognition memory for pitch. Anal- 
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ysis of variance of Ss X Standard 
Tone Frequencies X Interference Tone 
Durations showed the effect of inter- 
ference tone duration on d’ for the 
comparison of C=S and C=S+10 
to be significant at the .001 level, F (2, 
8) = 23.55. A separate analysis of 
variance showed the effect of inter- 
ference tone duration on d’ for the 
comparison of C = S and C=S+15 
to be significant at the .05 level, F (2, 
8) —6.54. Exponential decay of the 
trace strength, d', provides a far better 
description of the facts than linear 
decay, but there is a systematic devia- 
tion from the exponential decay hy- 
pothesis in the direction of too little 
decay from 4 to 8 sec. compared to the 
decay from 2 to 4 sec. The trace 
strengths for pitch may be approaching 
an asymptote above zero strength or 
there may be some less rapidly decay- 
ing traces involved, which are never- 
theless decaying. We cannot distin- 
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guish between these hypotheses on the 
basis of the present data. 

Obviously, it is also not possible to 
distinguish between decay and inter- 
ference theories of the degradation of 
the memory trace for pitch in this 
experiment, and the word “decay” 
should be interpreted as implying noth- 
ing about the mechanism that produces 
the decay. Perhaps the 930-cps in- 
terference tone strengthens the in- 
correct comparison tone more than 
it strengthens the correct comparison 
tone. Perhaps the interference tone 
simply holds S's attention during the 
retention interval and permits a spon- 
taneous decay process to occur, Per- 
haps both occur. The 930-cps inter- 
ference tone was chosen because of a 
guess that it was far enough away so 
that its generalization gradient would 
be close to zero in the region of the 
standard and comparison tones, but 
close enough to prevent simultaneous 


TABLE 1 


DIFFERENCE IN TRACE STRENGTH OF CORRECT AND INCORRECT COMPARISON TONES AS A 
Function OF INTERFERENCE Tone DURATION (Exp. I) 


d'(S, S + 10) d'(S, S +15) 
t1(sec.) tı (sec.) 
S S Tone 
2 4 8 2 4 8 
JE 800 4.40 4.03 3.10 5+ 4.30 4.20 
820 3.26 1.88 1.25 3.76 2.56 1.68 
840 3.28 2.62 2.16 SF 3.63 2.80 
RR 800 2.80 2.16 1.05 3.28 2.62 1.51 
820 2.08 1.72 1.22 3.28 2.35 144 
840 3.14 2.56 1.94 344 3.10 2.56 
JK 800 1.68 1.05 51 2.35 1.83 t 
820 1.54 1.01 71 2.29 1.38 1.01 
840 1.61 1.05 17 1.98 1.90 .88 
AH 800 1.36 141 1.08 1.87 1.83 1.52 
820 61 01 .56 48 -26 51 
840 64 43 37 1.25 1.03 1.16 
DG 800 26 —.05 05 -19 .26 .08 
820 33 BE 08 .69 .33 42 
840 .85 54 60 1.51 1.16 1.16 
Avg. 1.86 1.35 1.03 2.46 1.90 1.45 
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TABLE 2 
DIFFERENCE IN TRACE STRENGTH OF CORRECT AND INCORRECT COMPARISON TONES AS A 
Function or STANDARD Tone Duration (Exp, I1) 


d'(S, S 4- 10) d'(S, S + 15) 
Ig (sec.) Is(sec.) 
S S Tone = ss 
2 4 8 2 4 8 
A8 1.44 1.32 1.16 1.86 2.32 
i 820 30 71 58 54 1.01 1.41 
840 1.64 1.51 2.16 2.80 2.56 3.93 
RR 800 .82 .85 1.11 1.28 1.83 1.52 
: 820 1.05 .85 1.08 1.16 1.61 1.41 
840 1.80 2.03 2.08 2.51 3.20 3.10 
LN 800 „51 17 1.14 17 1.48 1.68 
820 1.02 1.01 1.16 2.03 2.26 1.76 
840 1.16 1.64 1.58 1.48 1.61 2.08 
SH 800 48 17 1.22 1.25 1.05 1.05 
820 56 74 .91 1.05 1.08 1.28 
840 1.05 80 1.19 1.35 1.80 1.72 
KC 800 .08 .06 .09 —.05 26 .28 
820 .30 .69 48 77 17 -66 
840 1.08 1.11 1.01 1.32 1.83 1.90 
Avg. .82 1.00 1.14 1.29 1.61 1.74 


attention to the interference tone and 
the standard tone. If this guess was 
correct, the effect of interference tone 
duration was a relatively pure decay 
effect, but there is no way of deciding 
this matter on the basis of the present 
experiment. 

Standard tone duration—It is clear 
from Table 2 that increasing the dura- 
tion of the standard tone from 2 to 4 
to 8 sec. facilitates recognition memory 
lorpitch. Analysis of variance showed 
that the effect of standard tone dura- 
tion on d' for the comparison of C — S 
and C = S + 10 was significant at the 
01 level, F (2, 8) = 8.67. The effect 
of standard tone duration on d' for the 
comparison of C = S and C 2 S 4- 15 
was significant at the .05 level, F (2, 
8) =6.75. As might be expected, the 
effect of the duration of the standard 
tone was not large, but it was quite 
consistent. Thus, it appears that, up 


to 8 sec., the longer the standard tone 
is present, the stronger the memory 
trace for that tone becomes. Consoli- 
dation of the memory trace for pitch 
occupies a much longer period of time 
than that required to establish the most 
accurate perception of pitch, which is 
on the order of 1 sec. (Kénig, 1957). 

Standard tone frequency—Tables 1 
and 2 also indicate that the three dif- 
ferent standard tones were not equally 
well remembered. Almost all Ss 
showed large and consistent differences 
in the trace strengths for different 
tones, but the differences were not 
always consistent over different SS- 
On the whole, the 840 tone seems to 
have been remembered best with the 
800 tone being remembered slightly 
better than the 820 tone, but there were 
numerous exceptions to the latter re- 
lation. The effect of standard tone 
frequency was significant in two of 
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the four analyses of variance previ- 
ously mentioned, F (2,8) = 1.87, 2.88, 
14.11, and 6.72. 


Discussion 


There are four principal findings in 
the present study. First, the operating 
characteristics in recognition memory for 
pure tones 10-15 cps apart are straight 
lines on normal-normal probability paper. 
Second, filling the interval between stand- 
ard and comparison tones with an inter- 
ference tone produces a very reliable 
inverse relationship between interference 
tone duration and recognition memory 
for pitch. Third, increasing the dura- 
tion of the standard tone from 2 to 4 to 
8 sec. facilitates recognition memory for 
pitch. Fourth, the false-recognition rate 
for tones 15 cps above the standard tone 
is substantially less than the false-recog- 
nition rate for tones 10 cps above the 
standard tone. This is interesting be- 
cause it implies that there is a generaliza- 
tion gradient in STRM for pitch and it 
places a lower bound of 10-15 cps on 
the width of this gradient. 

There are two subsidiary findings. 
First, the memory trace for a standard 
tone appears to be decaying to a level 
substantially above zero, and second, the 
highest tone appears to be remembered 
better than the other tones. The last 
two findings can be given a common 
explanation in terms of relatively long- 
term traces for any characteristics that 
distinguish the set of standard tones from 
the set of comparison tones. The most 
obvious characteristic is tone height, 
since the comparison tones are on the 
average higher in pitch than the stand- 
ard tones. If we make the rather reason- 
able assumption that most Ss develop a 
general idea of the range of standard 
tones, then the incorrect comparison 
tones used for the 840 standard tone 
may be discriminable on this basis. Fur- 
thermore, there may be some gradient of 
long-term trace strength within the range 
from 800 to 840 cps such that lower 
tones have greater long-term trace 
strength than higher tones. This would 
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account for why the d’ values do not ap- 
pear to be decaying to zero, but to some 
positive base strength which is decaying 
more slowly, if at all. 

The observed shape of the operating 
characteristic could mean that the under- 
lying correct and incorrect trace strengths 
are normally distributed, which would 
be the obvious analogy to signal-detec- 
tion theory. However, the shape of the 
operating characteristic could also have 
resulted from a normal variation in the 
locations of the decision criteria for rec- 
ognition-memory responses. 4 

If the variance of a trace-strength dis- 
tribution is small compared to the vari- 
ance in placement of decision criteria, 
then it is possible to consider the under- 
lying trace strength to be a real vari- 
able rather than a random variable. 
The probabilistic character of recogni- 
tion-memory responses is then attributed 
to the decision-response system, rather 
than to the memory system, This is a 
key feature of Luce’s (1959, 1963) choice 
theory. However, it is quite possible to 
use the signal-detection decision rule 
after adding “decision variance” to the 
trace strength, much as Hull, Hovland, 
Ross, Hall, Perkins, and Fitch (1940) 
did by assuming “oscillation of the re- 
action threshold.” ^ Either way, one 
achieves a great simplification in the 
memory model since the laws of trace 
consolidation and decay are assumed to 
apply to a real variable, at a stage before 
that at which the principal source of 
variation exists. It is important to note 
that such a model can be an extremely 
close approximation to reality despite 
considerable variation in the consolida- 
tion or decay of an item in memory under 
constant stimulus conditions, provided 
that this variance is small in relation to 
the decision variance. 

Because of the likelihood that some 
long-term traces are contaminating the 
present findings in STRM for pitch, it 
is not possible to evaluate rigorously any 
mathematical models of memory for 
pitch. However, it is clear that such a 
model must specify that the trace strength 
(in either short- or long-term memory) 
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of the standard tone increases during 
presentation of the standard tone over 
a period lasting longer than 4 sec. and 
decreases during presentation of the in- 
terference tone over a period lasting 
longer than 4 sec, Furthermore, the 
difference in trace strength of the tones 
10 and 15 cps above the standard tone 
makes it necessary to assume that tones 
adjacent to the standard tone get con- 
solidated, to a lesser degree. These find- 
ings are consistent with a large variety 
of models, one example of which is de- 
scribed as follows. 

Let s(S, ts, tr) and s(S +x, lg, tr) 
stand for the strengths in STM of a 
standard tone S cps (lasting ig sec.) and 
an incorrect C tone of S +x cps, respec- 
tively, after an I tone lasting ty sec. Let 
a(S, S+ x, lg, tr) stand for the differ- 
ence between these two strengths in units 
of the standard deviation of the decision 
distribution, The d’ value of the empiri- 
cal operating characteristic is an estimate 
of the true d value. The assumptions of 
the model are as follows: 

1. Initial Trace Strength: s(X, 0, ty) 
=0 for all tones X. 

2. Linear Consolidation: There is a 
consolidation process active during pres- 
entation of a tone S, that acts to increase 
the strength of S and adjacent tones, 
S+#, at a rate, a(x), which is a mono. 
tonic decreasing function of v. 

3. Exponential Decay: There is a decay 
process active at all times which acts to 
decrease the strength of all tones in STM 
at a rate, Bs, which is proportional to 
trace strength. 

The linear consolidation process and 
the exponential decay process are both 
active during presentation of the stand- 
ard tone, so their combined effects are 
precisely described by the following dif- 
ferential equation: 


ds (ts) 
dig 


= a(x) — Bs(tg) [1] 
Under the boundary condition specified 
by the first assumption, this differential 
equation can be solved to yield the fol- 
lowing equation for the increase in 
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strength during the consolidation period: 


sq) = 2) (1 — coe) 


[2] 

During the interference tone the model 
postulates a pure exponential decay proc- 
ess, so the final equation for the strength 
of any particular tone in memory as a 
function of distance from the standard 
tone, v, duration of the standard tone, ty 
and duration of the interference tone, tp 
is as follows : 


s(S + x, ts, tr) 


= a(x) (1 — e-?8)e-?5 [3] 
8 
Because the location of the decision 
criteria is variable and dependent on un- 
known properties of the decision system, 
it is not possible to test predictions of 
the strength of individual tones. Rather, 
we must test predictions of the differ- 
ences in strength of correct and incorrect 
comparison tones, both of which are sub- 
ject to consolidation and decay. These 
expected differences, d, can be compared 
to the obtained differences, d^. Letting 
a(0) =a and a(x) =a’, we obtain the 
following : 


d(S, S + x, ts, tr) 


r 
= S. (1 — etsjeti 
Notice that the equation for d is iden- 
tical in form to the equation for s, and 
the form is quite simple considering the 
fact that consolidation, decay, generaliza- 
tion, and decision processes have all been 
included in some fashion. The model is 
computationally simple and makes quali- 
tative predictions consistent with _the 
facts. However, it must be emphasized 
that the quantitative predictions of the 
model cannot be tested by the present 
experiments. The model should be in- 
terpreted as an example of the rathet 
elegant strength models that can be con- 
structed for STRM of pitch; many other 
models would be equally consistent with 
the present findings. 


[4] 
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STIMULUS GENERALIZATION OF A POSITIVE CONDITIONED 


REINFORCER. 


IV: CONCURRENT GENERALIZATION OF 


REINFORCING AND DISCRIMINATIVE STIMULUS FUNC- 
TIONS FOLLOWING FIXED-INTERVAL TRAINING? 


DAVID R. THOMAS 4x» DONALD V. DEROSA 
Kent State University 


30 naive homing pigeons were trained to peck a _ translucent 
unilluminated key on an FI-1 schedule. 2 sec. before feeding, the key 
was illuminated by a light of 550 mu. Ss were tested in counter- 
balanced succession on 3 values of monochromatic light (550, 530, and 
510 my) programmed according to the original FI schedule. Measures 
of time and responses to extinction, and scalloping were taken. Times 
and responses to extinction showed significant decremental gradients. 
Scalloping was excellent to all 3 stimulus values but tended to be best 
following presentation of the 550-my stimulus. 


Thomas and Williams — (1963) 
trained pigeons to peck at a blank un- 
illuminated key for VI reinforcement 
with each feeding proceeded by a 2-sec. 
exposure of a green light on the peck- 
ing key. Greatest resistance to extinc- 
tion was shown when the original hue 
was programmed on a VI schedule, 
with successively less resistance to ex- 
tinction for hues farther removed from 
the original one, 

On a VI schedule the food serves 
two complementary roles, to reinforce 
the preceding response or responses 
(the reinforcing role) and to serve as 
à cue for returning to the key for addi- 
tional responding (the discriminative 
role)J. In a VI situation a stimulus 
paired with food may acquire both its 
reinforcing and discriminative func- 
tions. Both would have led to in- 
creased resistance to extinction of the 
key-pecking response in the "Thomas 
and Williams (1963) experiment. 

With fixed-interval reinforcement, 
however, the reinforcing and discrimi- 


1 This research was supported by Research 
Grant RO 1-HD-00903-04 under the direc- 
tion of the senior author. Gary Handler, 
William Simone, and Jerry Homick aided 
in the collection of the data. 


native effects of the food work in op- 
position to each other. Following re- 
inforcement the tendency to peck the - 
key is at its lowest ebb, gradually in- 
creasing in strength throughout the 
interval. Thus the discriminative func- 
tion of the food is to produce a scallop 
whereas the reinforcing function is to 
maintain responding. In this study 
we attempted to determine whether a 
stimulus associated with food in ag 
fixed-interval situation would acquire | 
both its reinforcing and discriminative 
roles and further whether both stimu- 
lus functions would generalize lawfully 
to other stimuli as a function of stimu- 
lus similarity. 


METHOD 


Subjects.—The Ss were 30 experimentally 
naive homing pigeons maintained at o 
proximately 75% of their ad lib weigl 
throughout the study. : f 

Apparatus—The apparatus consisted F 
two pigeon chambers described in detail by 
Hiss and Thomas (1963). i : 

Procedure.—The Ss were first trained in 
the apparatus to peck an unilluminated key 
for continuous reinforcement with each 3-sec 
operation of the feeder preceded by a 2-se% 
exposure of a light of 550 my on the E 
key with no responding required during E A 
2-sec. period. The light remained on 10 
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5 sec. the last 3 of which coincided with 
the presence of the food magazine. Fol- 
lowing this Ss were placed on a l-min. 
fixed-interval (FI-1) schedule with the 550- 
mu light again preceding and coinciding 
with each feeding. After 7 days of such 
training in 60-min. sessions, a condition was 
introduced whereby on 2 out of 5 min. 
(determined randomly) the 550-mz stimulus 
was presented but the food was not. The 
purpose of this procedure was to increase 
the resistance to extinction of the conditioned 
reinforcing function of the light. This effect 
of a partial pairing of the primary and 
conditioned reinforcer has been well docu- 
mented by Zimmerman (1957) and others. 
The training continued for another 10 days 
and then generalization testing in extinction 
was initiated. Ten Ss were randomly as- 
signed to each of three groups which were 
tested in extinction with the continued FI-1 
presentation of 550, 530, and 510 my, re- 
spectively. A record was taken of respond- 
ing at the 15-sec., 30-sec., and 45-sec. point 
in each 1-min. interval. The light was 
exposed on the key immediately following 
the first peck after 55 sec. of the interval 
had elapsed. Testing was continued until 
an extinction criterion of 1-min. without a 
single response was met. All Ss then re- 
ceived 1 additional day of training (with 
the 550-mp value) and then were retested 
in extinction with another stimulus value. 
After an additional training session, a third 
and final test was given on the remaining 
stimulus value. The order of testing con- 
ditions was completely counterbalanced. 


Resutts AND Discussion 


The results of this study may be 
reported separately with regard to the 
reinforcing and discriminative stimu- 
lus functions. Indication of the differ- 
ence in the reinforcing value of the 
three different stimuli is seen in Fig. 1. 
In this plot are two measures of resist- 
ance to extinction, time to extinction, 
and responses to extinction. The val- 
ues plotted are the means computed for 
all 30 Ss under each stimulus condi- 
tion. The form of both gradients is 
the same, showing the greatest decre- 
ment between the 530- and 510-mp 
values. Since resistance to extinction 
decreases with repeated testing, an 
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utes and responses to the extinction crite- 
rion, 


analysis of variance was performed 
which extracts variance attributable to 
the testing sequence. Both the dif- 
ferences in time to extinction and re- 
sponses to extinction attributable to 
stimuli proved to be significant, F (2, 
48) = 5.68, p < .01 for time, F (2, 48) 
= 3.64, p < .05 for responses, Thus 
this study utilizing an FI schedule with 
partial pairing of the conditioned and 
primary reinforcers replicates the origi- 
nal demonstration by Thomas and 
Williams (1963) and in addition ex- 
tends its generality. 

With regard to the discriminative 
effect of the monochromatic stimuli, 
the results are equally striking. Cu- 
mulative records indicated excellent 
scalloping to all three monochromatic 
stimuli on all three tests. The meas- 
ure of scalloping analyzed was the pro- 
portion of responses which occurred in 
the first 15 sec. of each 55-sec. re- 
sponse period; the lower this value the 
better the scalloping. Because scallop- 
ing was equally good in all three tests 
the data were pooled across tests and 
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the values obtained were 3.42% for 
550 my, 4.34% for 530 mp, and 5.47% 
for 510 my. The scalloping index var- 
ies in the expected direction, showing 
greatest scalloping (mainly determined 
by longer pausing) following presen- 
tations of 550 mp and least following 
510 mp, Although the overall F was 
not significant, F (2, 58) = 2.19, a t 
comparing the 510- and 550-mp values 
was significant at the .05 level, ¢ (58) 
= 2,08. It is clear that the discrimina- 
tive function of the primary reward in 
fixed-interval training is acquired by 
an associated stimulus and then gen- 
eralizes broadly to other stimulus val- 
ues. An additional confirmation of 
this comes from an observation made 
after this experiment was completed. 
Four of the Ss were retrained for one 
session with 550 mp and then were 
tested in the usual manner but with 
a white light rather than a monochro- 
matic stimulus programmed according 
to the original FI-1 schedule. Al- 
though resistance to extinction tended 
to be low, scalloping was noticeable, 
even with this entirely novel stimulus. 

The finding that scalloping tends to 
be better following presentation of the 
550 mp (ie, pauses are longer) yet 
resistance to extinction is greatest 
under this condition provides addi- 
tional evidence against an interpreta- 
tion of the Thomas and Williams 
(1963) study which refers only to the 
discriminative effect of the monochro- 
matic stimuli. A study by Caronite 
and Thomas (1966) utilizing a “new 
learning” rather than a “resistance to 
extinction” method for demonstrating 
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the generalization of conditioned re- 
inforcement also vitiates against such 
an interpretation, 

The comparison of the slope of the 
generalization gradient across stimulus 
functions can only be suggestive. Al- 
though the dimension of the stimulus 
which is varied in this study is, of 
course, the same for both stimulus 
functions the response measures are 
not. Itis surely possible that another 
measure of scalloping, for example 
latency, would yield generalization 
gradients of greater slope. 

The present study adds to the grow- 
ing body of knowledge regarding the 
stimulus control of behavior. In par- 
ticular it suggests that whatever the 
role (or roles) played by a stimulus 
in its relation to behavior, these roles 
are acquired by other stimuli on the 
basis of association, and further, that 
once this association is formed the ac 
quired functions of the associated stim- 
uli generalize lawfully to other stimuli 
on the basis of physical similarity. 
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ROLE OF STIMULUS COMPARISON IN CHILDREN’S 


DISCRIMINATIO 
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A study was made of the relative difficulty of simultaneous and 
successive presentation of stimuli in a discrimination-learning situation. 


Although simultaneous presentation 


allowed the children to make 


direct comparisons among the stimuli it also decreased the likelihood 
that the stimuli would be perceived as individual cues. The net effect 


was easier learning under successive 


presentation. When the unique- 


ness of the cues was enhanced by having them present at the time the 
response was made, simultaneous presentation led to faster learning. 
Both response-duration data and backward-conditioning curves pro- 
vided evidence that learning occurred in a sudden rather than gradual 


manner. 


The simultaneous discrimination 
problem traditionally involves the con- 
current presentation of two or more 
stimuli which differ along a given 
dimension. The S is required to re- 
spond to one of the stimuli regardless 
of its spatial position on any given trial. 
In the successive problem only one 
stimulus is presented on each trial and 
S learns to make a different positional 
response to each stimulus. A number 
of studies have compared the relative 
difficulty of these two discrimination- 
learning problems with both infrahu- 
mans and children. Studies with infra- 
humans have yielded conflicting results. 
Some experiments (North & Jeeves, 
1956; Spence, 1952) reported that the 
simultaneous problem was easier, while 
others (Bitterman & Wodinsky, 1953; 
Weise & Bitterman, 1951) found the 


1 The writer is grateful to H. M. Landrum, 
Superintendent of Schools, Spring Branch, 
Texas, for his generous assistance in making 
children and facilities available. Apprecia- 
tion is also expressed to the principal and 
teachers at Frostwood, Bendwood, Rummel 
Creek, and Bunker Hill elementary schools 
for their fine cooperation and assistance. 
A paper reporting the results of this study 
was read at the Biennial Meeting of the 
Society for Research in Child Development, 
Minneapolis, 1965. 
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opposite to be true. The findings with 
children are more consistent. Erickson 
and Lipsitt (1960), Jeffrey (1961), 
Perkins, Banks, and Calvin (1954), 
and Rieber (1964) all report that the 
simultaneous problem is learned more 
easily. However, a study by Lipsitt 
(1961) adds an important qualifier to 
these findings in that simultaneous was 
found to be easier than successive only 
in situations where the response was 
made directly to the stimulus source. 
When the apparatus called for a re- 
sponse to be made to a point removed 
from the stimulus, the successive prob- 
lem was learned more quickly. 

One of the difficulties inherent in 
comparisons of the two types of prob- 
lems is that they differ in not one, but 
three respects. First, the simultaneous 
method allows S to directly compare 
the stimuli employed while the succes- 
sive one does not. The importance of 
stimulus comparison has been stressed 
by Lashley and Wade (1946). They 
claim that when direct comparison is 
not possible, discrimination learning 
becomes more difficult because stimulus 
traces must be utilized. Second, the 
two tasks differ in terms of the num- 
ber of relevant stimulus dimensions. 
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In the simultaneous problem .$ has to 
attend only to the particular stimulus 
independent of its spatial characteristics 
whereas solution of the successive prob- 
lem depends upon cue-position pattern- 
ing, and both the spatial and stimulus 
dimensions are relevant. Finally, the 
two problems differ in the number of 
habits which must be learned in order 
to master them. The simultaneous task 
requires that a response be learned to 
only one of the two or more stimuli 
which are presented. With the suc- 
cessive problem, different responses 
must be learned to each stimulus. Just 
what role each of these variables plays 
is not clear. The present study will 
attempt to examine the importance of 
stimulus comparison in a situation 
where the number of relevant stimulus 
dimensions and the number of habits is 
held constant. 


METHOD 


Apparatus—The apparatus consisted of a 
gray metal cabinet 12 X 22 X 14 in. Located 
on the top front of the cabinet was a row 
of three illuminated pushbuttons separated 
from each other by a distance of 3 in. Each 
pushbutton consisted of a translucent 1x? 
in, plastic screen which covered three dif- 
ferent colored bulbs. A small indicator light 
was located 2 in. below the center push- 
button. The apparatus was wired so that 
any one of the three colors could be pro- 
jected on each screen and the stimuli could 
be presented either simultaneously or suc- 
cessively. The method of stimulus presen- 
tation is shown schematically in Fig. 1. 

In the case of simultaneous presentation 
for a given trial all three pushbuttons lit up, 
each with a different color, and stayed on 
for 3 sec. The lights then went off and 2 
sec. later the small indicator light came on. 
This indicator light signaled to S that he 
could then make his choice and push one of 
the three buttons. On the following trials 
the same three colors reappeared but in dif- 
ferent spatial arrangements. 

The successive presentation of stimuli 
took place as follows. On a given trial the 
left pushbutton lit up with a given color 
and stayed on for 1 sec. One and one quarter 
seconds following termination of the left 


MORTON RIEBER 


pushbutton the center one came on with a 


different color and stayed on for 1 sec. One 


and one quarter seconds following that, the 
right pushbutton lit up with the third color 
and stayed on for 1 sec. Two seconds later 
the indicator light came on signaling to § 
that he could make his choice. Under both 
simultaneous and successive methods a total 
Of 3 sec. was devoted to stimulus presenta- 
tion. Also, in both cases S had to withhold 
his response for 2 sec., which was the time 
lapse between the final termination of the 
stimulus and onset of the indicator light. 

The apparatus was also designed to pro- 
vide informational feedback to S. Under the 
Single condition, when .$ made his choice 
and pushed a particular button, the color 
which had previously been projected on the 
screen of that button reappeared and stayed 
on as long as he held the button depressed. 
In the All condition when S pressed the 
button, all three colors reappeared simul- 
taneously and stayed on as long as the but- 
ton was depressed. In the third, or None 
condition, when S pressed the button the 
lights remained off and there was no indi- 
cation as to which color had appeared on 
that button during its initial presentation. 
An elapsed time clock recorded the length 
of time S held the button depressed on each 
trial. Correct responses caused a chime to 
ring. 

Two studies are to be reported. They 
differed only in respect to the similarity 
among the colors employed. In Exp. I the 
colors utilized were red, blue, and green, In 
Exp. II the colors were orange yellow- 
orange, and red-orange. 

Subjects—The Ss for Exp. I consisted 
of 144 fourth-grade children from two ele- 
mentary schools in the Spring Branch, 
Texas, Independent School District. In Exp. 
II Ss consisted of 196 fourth-grade children 
from two other elementary schools in the 
same school system. In both experiments 
Ss were divided equally into six groups and 
a 2X 3 factorial design was employed. The 
simultaneous and successive problems repre- 
sented one factor and the three feedback 
conditions represented the other factor. 

Procedure—The Ss were brought into a 
semidarkened experimental room one at à 
time. The S' was told that he would play à 
game involving pushbuttons and lights. He 
Was seated in front of the apparatus and 
told that the pushbuttons would light up 
with different colored lights and that this 
would be followed by illumination of the 
indicator light. He was to wait until the 
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Fic. 1. Schematic representation of the two methods of stimulus presentation. 


latter came on and then push one of the 
pushbuttons ; when he made a correct choice 
a chime would ring. He was also advised 
that after a while he would be able to figure 
out how to make the bell ring every time. 

One of the three colors was arbitrarily 
designated as correct for each S. This 
assignment was made on a random basis 
with the exception that in Exp. II use of the 
color red-orange as a reinforced stimulus 
was discontinued after 70 Ss had been tested, 


This was done because it became apparent 
that there was both a strong initial pref- 
erence and a distinctiveness associated with 
this color, resulting in unusually rapid 
learning when it was the correct stimulus. 

The Ss were run for a minimum of 10 
trials or to a criterion of 6 consecutive cor- 
rect responses. Training was discontinued 
if Ss failed to reach this criterion within 42 
trials. The same procedure was used in 
both Exp. I and Exp. II. 
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REsuLTS 


The number of trials to criterion was 
used as one measure of discrimination 
learning. An analysis of variance was 
carried out on these data. Type of 
stimulus presentation—whether simul- 
taneous or successive—constituted one 
factor in this analysis, The second 
factor was the amount of stimulus feed- 
back provided at the time that S 
pressed the pushbutton. The third 
factor compared the role of stimulus 
similarity as it differed in the two ex- 
periments. Similarity of colors had no 
effect on speed of learning the discrimi- 
nation task under any of the conditions. 
Evidence for this lies in the fact that 
there was no significant main effect for 
experiments, F (1, 330) — .22, nor did 
that factor interact significantly with 
method of stimulus presentation, F (1, 
330) — 1.00, or amount of feedback, 
F (2, 330) = 1.90. In view of these 
findings the data for the two studies 
were combined in the remaining analy- 
Ses. The mean numbers of trials to 
criterion for the two experiments com- 
bined are given in Table 1. 

Two effects were significant. The 
amount of stimulus feedback, F (2, 
330) = 6.44, p < .001, and the Feed- 
back x Method of Presentation interac- 
tion, F (2, 330) = 3.64, p < .05. The 
latter indicated that the effect of in- 
formational feedback was dependent 
upon the manner in which the stimuli 
were presented. Separate analyses of 
variance comparing None with Single, 
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TABLE 2 


PERCENTAGE OF Ss WnHo FAILED TO 
REACH CRITERION 


Feedback Conditions 
Method of 
Presentation 
All Single None 
Simultaneous 12.3 14.0 49.1 
Successive 28.1 28.1 31.6 


and None with All yielded significant 

Feedback X Presentation interactions, 

F (1, 224) = 4.90, p <.05; F (1, 224) 

= 5.75, p < .05, respectively. Under 

no stimulus feedback the successive 

presentation resulted in faster learn- 

ing, but with either Single or All con-- 
ditions simultaneous presentation was 1 
superior. In a comparison of Single 

vs. All the Feedback x Presentation 

interaction was not significant, F (1, 

224) = .05. However, the main effect 

of presentation was reliable, F (1, 224) 

= 422, p < .05, which demonstrates 

that under either of these conditions 

the simultaneous method led to faster. 

learning. 

Table 2 contains the percentage of ! 
children who failed to reach criterion in 
each of the subgroups for both experi- 
ments combined. The relationship 
among the groups is identical to that 
found with the other measure of learn- 
ing. These data were analyzed by 
comparing the three conditions of stim- 
ulus feedback, two at a time, using à 


TABLE 3 


MEAN RESPONSE DURATION (IN SECONDS) ON 
TRIALS PRECEDING START OF 
CRITERION RUN 


Feedback Conditions 


Method of 
Presentation 
All Single None 
Simultaneous | 14.2 15.6 26.3 
Successive 18.8 19.4 20.7 


Feedback Conditions 


Type of — 
Response 
All Single None 
Correct 0.59 0.59 0.54 
Incorrect 0.74 0.78 0.66 
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method suggested by Snedecor (1956, Feedback x Method of Presentation 
p. 338) for evaluating the significance — interaction is significant with the Sin- 
of interactions in binomial data. Just  gle-None comparison and the All-None 
as in the trials to criterion data, the comparison (p< .001). There was 
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Fic. 2. Mean response duration on trials preceding and during criterion run. 
(Those Ss who began their criterion run before 5 trials or who failed to reach 
criterion in 42 trials were excluded.) 


268 


no significant interaction in the All- 
Single comparison. 

The measure of response duration, 
as determined by length of time that 
the child held the button depressed, 
provided some interesting data. Table 
3 presents the mean duration on correct 
as compared to incorrect responses for 
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each of the three feedback conditions, 
These means are based on the average 
response duration on all trials preced- 
ing the criterion run for each S. Only 
Ss who made at least five responses 
before starting criterion were included 
in these data. Regardless of feedback 
condition, response duration during in- 
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five trials immediately preceding 


(Percentage of correct responses on the trial preceding cri- 
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correct responses was longer than dur- 
ing correct responses. 

Although response duration was 
clearly related to whether or not the 
response was correct, there was reason 
to suspect that it was also related to 
S’s utilization of the information con- 
tained in stimulus feedback. If so, then 
the duration should increase gradually 
as the child approached criterion and 
drop sharply once he had reached cri- 
terion or learned the task. 

Response durations on each of the 
five trials preceding the start of cri- 
terion and on each of the six criterion 
trials are presented in Fig. 2 For Ss 
in the All and Single conditions under 
both simultaneous and successive pres- 
entations, instead of a gradual decrease 
the durations rise sharply on the trial 
preceding start of the criterion. In an 
analysis of four groups combined, the 
duration on Trial —1 is significantly 
longer than on Trial —2, ¢ (114) = 
2.89, p < .01. 

The failure to find a gradual increase 
in response duration but instead a sud- 
den increase on the trial preceding the 
criterion run, suggested that learning 
itself was sudden rather than gradual. 
In order to check this out a tabulation 
was made of the percentage of correct 
responses on the five trials preceding 
criterion. These data, which are pre- 
sented in Fig. 3, clearly indicate the 
sudden manner in which learning took 
place. 

Discussion 


The procedure used in this study was 
such that it required S to withhold his 
response for 2 sec. following offset of 
the stimulus lights. It was necessary 
to introduce this response delay in order 
to evaluate the two methods of stimulus 
presentation under conditions where 
everything but the opportunity to make 
direct comparisons was held constant. It 
may be that response delay itself inter- 
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acts with method of stimulus presenta- 
tion. The only data available on this 
point is the experiment by Perkins, 
Banks, and Calvin (1954) which re- 
ported that a 5-sec. response delay had 
no differential effect on the learning of 
the traditional simultaneous and succes- 
sive problems. 

Because of the delayed-response method 
employed in this study, responses were 
not made in the presence of the actual 
stimuli but were made to stimulus traces. 
For the successive group, the colors 
came on one at a time and the traces 
probably consisted of three separate 
stimuli (colors). This tended to em- 
phasize color as the relevant dimension. 
For the simultaneous group, however, 
the trace could be regarded as a single 
configuration. composed of three colors 
side by side. If responses were made 
to such a configuration there would be 
confusion on the part of S as to just 
which stimulus dimension was relevant, 
as well as an increased tendency to at- 
tend to the irrelevant positional dimen- 
sion or some complex color-positional 
cues, Although simultaneous presenta- 
tion provided S with the opportunity to 
make direct comparisons among the three 
colors, this would be of little benefit if 
he failed to perceive them as unique 
stimuli, which apparently was the case 
with the simultaneous-None group. 

Under single-feedback conditions the 
pushbutton that S chose lit up when he 
responded. Presence of the single color 
cue at the time of the response empha- 
sized the uniqueness of the colors, Once 
the stimuli were attended to in this man- 
ner the simultaneous group could take 
advantage of the opportunity to make 
direct comparisons among the stimuli. 
This may account for the superior per- 
formance of the simultaneous group under 
the single-feedback condition. 

An assumption which has been made 
so far is that without feedback, Ss 
learning under simultaneous were less 
likely to perceive the stimuli as three 
distinct colors. In view of this, it may 
be argued that under the All condition, 
the simultaneous group should not have 
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been helped, as all three stimuli were 
presented together at the beginning of 
each trial and at the time the response 
was made and therefore were seen as a 
configuration both times. It seems more 
likely, however, that S was attending to 
the particular button he was pushing 
and the color projected on that button. 
The fact that the performance of simul- 
taneous groups is essentially the same 
under the All condition as under the 
Single condition provides some evidence 
for this. 

Response duration was found to be 
longer on incorrect responses than on 
correct ones. A sharp increase in dura- 
tion was also found on trials preceding 
the start of criterion. It is important 
to note that by definition this trial in- 
volved an incorrect response for all Ss. 
In view of the finding that incorrect 
responses were longer in duration it is 
possible that this explains the longer 
duration on the trial preceding criterion. 
There is, however, a control group con- 
tained in these data; the group labeled 
None which received no stimulus feed- 
back. If the rise on the trial immediately 
preceding the start of criterion was due 
to the fact that all responses were in- 
correct at that point, then durations 
under the None condition should also 
Show an increase on this trial. They did 
not. The fact that durations increased 
on the trial immediately preceding the 
criterion run suggests that the greatest 
utilization of the information provided 
by the feedback occurs on that trial. 

A recently reported study by White 
and Plum (1964) measured eye move- 
ments during children’s discrimination 
learning. These investigators found that 
there was a sharp increase in movements 
during the four or five trials immediately 
preceding the start of criterion, They 
also compiled a considerable amount of 
evidence to indicate that discrimination 
learning is not a gradual process. The 
Present findings of increased response 
durations immediately preceding learn- 
ing and White and Plum’s similar find- 
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ings with eye movements lend support 

to Zeaman and House's (1963) notion 

that discrimination learning is a function 
of the child attending to the relevant 
stimulus dimensions and their compo- 
nents, 
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VERBAL DISCRIMINATION REVERSAL AS A FUNCTION OF 


OVERLEARNING AND PERCEN 


TAGE OF ITEMS REVERSED * 


COLEMAN PAUL 
Adelphi University 


To study overlearning effects upon discrimination reversal, Ss received 


either high or low degrees of trainin, 
and then received reversal training. 


g on a verbal discrimination task 
In reversal training either 25, 50, 


75, or 100% of the original 12 pairs were reversed. Results suggest a 
nonmonotonic effect of percentage of reversal which interacts with 


degree of training. 


Reid (1953) reported that overlearn- 
ing trials facilitated discrimination re- 
versal in rats. His report has been 
followed by a number of experiments 
which were designed to reproduce and 
to elucidate his findings, While some 
investigators have found a facilitating 
effect of postcriterial trials (PCT) 
upon reversal, others have not (Paul, 
1965). Experiments wtih human Ss, 
however, suggest that with the excep- 
tion of “pre-verbal” children (Gollin, 
1964), PCT facilitates both discrimina- 
tion reversal (Youniss & Furth, 1964) 
and concept reversal (Caul & Ludvig- 
son, 1964; Erlebacher & Archer, 1961 ; 
Ludvigson & Caul, 1964). 

The mechanism(s) by which PCT 
facilitates reversal learning has not yet 
been specified, though the majority of 
reports seem to rely on the notion that 
the PCT increase the discriminability 
between original and reversal learning 
conditions, hence facilitating at least the 
initiation of the reversal shift. Differ- 
ences between overtrained and control 
groups in discriminating shifts in ex- 
perimental conditions are typically re- 
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at the Eastern Psychological Association 
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Walter for their assistance in various phases 
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sponse inferred; that is, inferred from 
measures of the extent to which pre- 
shift behavior perseverates during post- 
shift conditions. In cases where the 
overtrained group makes fewer per- 
severative responses than the control, 
the magnitude of the difference often 
seems too small to account for total 
differences between groups in reaching 
reversal criterion. Within the context 
of animal learning Paul (1965) sug- 
gested that reversal learning may be 
conceptualized as consisting of at least 
two phases; first is the extinction of the 
original response and second is acquisi- 
tion of the new response. 

The present experiment examines the 
two-phase notion of reversal learning 
under conditions of overtraining and 
partial reversal. The Ss were pre- 
sented with 12 pairs of items until the 
correct member of each pair was 
learned. Following either 0 or 50% 
postcriterial trials discrimination-re- 
versal conditions were instituted. For 
various subgroups of Ss, 100%, 75%, 
50%, or 25% of the 12 items received 
reversed-reinforcement contingencies. 


METHOD 


Subjects —One hundred and sixty Adelphi 
University students served as Ss to fulfill 
an introductory psychology course require- 
ment. Any S that did not reach original 
criterion within 50 trials was eliminated and 
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another S was run. There were six such 
replacements. 

Procedure—Lists: The following pairs of 
words were presented for 3 sec/pair, in 
serial order, on a Stowe memory drum: 
fast-slow,  big-little, rough-smooth, black- 
white, hot-cold, hard-soft, fat-thin, few- 
many, dim-bright, far-near, square-round, 
and sweet-sour. There were four random 
orders of the 12 pairs and each member of 
a pair appeared equally often as the right 
and left member. For original learning, one 
member of each pair was designated, by 
random selection, to be reinforced (correct). 
A row of asterisks separated the random 
orders. 

All Ss were given instructions appropriate 
for the verbal discrimination task; they 
were informed that the “correct” word was 
arbitrarily selected and that if they guessed 
correctly E would say “correct,” but if 
they guessed incorrectly E would not say 
anything. The Ss were assigned so that 
each cell contained n Ss before any cell 
contained »--1 Ss. The Ss in the criterion 
group learned to a criterion of one errorless 
trial and then received one of the four 
percent-reversal treatments. Overlearning 
group Ss achieved the one errorless-trial 
criterion in original learning and then re- 
ceived 50% additional trials. If the cri- 
terion trial was an odd number (eg. 9) 
then overtraining was taken as 50% of the 
next higher even number (eg. 10). 

The experimental transition from original 
learning to reversal learning was continuous 
(e.g, no lists had to be changed) and sim- 
ply started with the appearance of the first 
item on the next order of the list. No 
indication of the onset of reversal condition 
was given. Correct items in reversal were 
determined as follows: For the 25% reversal 
condition three randomly selected items of 
the 12 items received reversed-reinforcement 
contingencies (e, the formerly incorrect 
member of a pair, when given as a response 
was followed by nonreinforcement—in re- 
versal, it was reinforced). In the 50% re- 
versal condition three items, in addition to 
the three reversed in the 25% condition, 
were reversed. In the 75% condition three 
more randomly selected items from the 
remaining six were reversed, and in the 
10076 reversal condition all the items were 
reversed. 


RzsurrTs 


Although the trials and errors to cri- 
terion data were positively skewed for 
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each of the subgroups, analyses of vari- 
ance were used. 

Simple randomized-design analyses 
of variance indicated that the eight sub- 
groups of the reversal condition did not 
differ with respect to trials to original 
criterion, F (7, 152) < 1, or with re- 
spect to errors to original criterion; F 
(1, 152) <1. The trial F's ranged 
from 11.7 to 16.7 and the error 2's 
ranged from 39.4 to 64.1, Analysis of 
trials to reversal criterion suggested 
the Overlearning and the Control 
groups did not differ; F (1, 152) = 
3.18,.05 < p < .10. The respective 7's 
were 9.59 and 11.61. The main effect 
of Percentage of Items Reversed was 
significant; F (8, 152) = 172 p< 
001. The z's for the 25, 50, 75, and 
100% conditions were 8.83, 15.90, 
12.65, and 5.02, respectively. The in- 
teraction was not significant. 

In the analysis of errors to reversal 
criterion the main effects of the Over- 
learning-Control groups and Percent- 
age of Items Reversed were each sig- 
nificant, F (1, 152) = 7.98 and F (3, 
152) = 21.76, p<.01 and p< .001, 
respectively. The interaction was also 
significant, F (3, 152) = 4427, p< 
001. The error mean square was 583. 
Figure 1 suggests that the Control 
group made more errors than did the 
Overlearning group under all but the 


70: 
LJ 
E so. 
o 
2 
$ «o 
H 30 
* 
20 
w 
2s 30 75 100 
PERCENTAGE ITEMS REVERSED 
Fic. 1. Mean number of errors to reversal 


criterion, 


OVERLEARNING AND VERBAL DISCRIMINATION REVERSAL 


REVERSED ITEMS 


X NO. ERRORS PER ITEM 


273 


NONREVERSED ITEMS 


PERCENTAGE ITEMS REVERSED 


Fic. 2. Mean number of errors, per item, in reversal learning for 


reversed and nonreversed items. 
figure.) 


25% reversal one, where the two 
groups are virtually equal. 

One might argue that the nature of 
the Percentage-Items-Reversed varia- 
ble is such that the probability of errors 
expected in one condition (e.g, the 
25%) is less than that in another (e.g., 
the 50%). But if this argument com- 
mits one to some simple uniprocess 
position than it is contraindicated by 
the nonmonotonicity of both the trials 
and errors to reversal-criterion data. 

The errors to reversal criterion can 
be broken down into errors on re- 
versed items and on nonreversed items. 
Figure 2 indicates that as the percent- 
age of items reversed increases up to 
75% there is an increase in the num- 
ber of errors on the nonreversed items. 
However, on the reversed items, errors 
decreased after a 50% reversal. It 
should be noted that the 25% and 50% 
reversal groups made more errors on 
the reversed items than on the non- 
reversed items; while the 75% group 
made more errors on the nonreversed 
items than on the reversed ones. This 
finding suggests that the 75% group 
responded as if a 100% reversal shift 
were in effect. Such responding would 
of course decrease the errors to the 


(Ns for each point are given in the 


reversed items and increase the errors 
to the nonreversed items. 

To assess whether or not overtrain- 
ing had an effect early in reversal 
training, analyses were performed on 
the total number of errors on the first 
reversal trial and on the first four re- 
versal trials combined. In both analy- 
ses the Percentage of Items Reversed 
was the only significant effect (p < 
.01 in each analysis). The rank order 
of the means for the four percentage 
conditions was similar to the order for 
total errors (see Fig. 1). This re- 
sponse measure is not a satisfactory 
indicant of perseverative responding be- 
cause it includes errors on nonreversed 
items as well as on reversed items. A 
more sensitive indicant of persevera- 
tive responding is the number of times 
that the originally correct responses 
are given successively to the reversed 
items, or more simply, the number of 
trials to the first correct response to 
each of the reversed items. For each 
S the mean of the trials on which 
reversed items were first emitted cor- 
rectly was determined. For the analy- 
sis reported the data for four Ss in the 
100% reversal condition (two over- 
learning and two control) were omitted 
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because of extreme deviancy. The n 
for this condition is 36 rather than 40. 
Analysis of variance indicated that the 
only significant difference was attrib- 
utable to Percentage of Items Re- 
versed; F (3, 148) = 49.8, p < .001. 
The #’s for the 25, 50, 75, and 100% 
reversal groups were 4.09, 2.94, 1.91, 
and 1.61, respectively. The decreas- 
ing order of the means suggests that 
the smaller the percentage of items 
reversed, the greater the number of 
trials to the first correct response to 
the reversed items. 


Discussion 


The results suggest that under some 
conditions PCT facilitate verbal discrim- 
ination reversal. In the absence of an 
appropriate control group it cannot be 
stated unequivocally whether the facilita- 
tion represents positive transfer, reduced 
negative transfer, or some combination 
of the two. In any case the data may be 
examined for clues to the locus of the 
facilitating effects of PCT, 

The analysis of perseveration responses 
suggests that the smaller the percentage 
of items reversed the greater the number 
of perseveration errors, There are at 
least two interpretations possible for this, 
One is that the percentage of items re- 
versed affects the discriminability of the 
onset of shift conditions; and second is 
fhat discriminability of the shift onset 
does not differ with percentage of items 
reversed but that the greater the per- 
centage of nonreversed items the stronger 
the generalized tendency to give the old 
correct responses, This latter interpre- 
tation is similar to notions of interlist 
differentiation. If the first interpretation 
holds then the number of PCT given 
did not seem to affect the discriminability 
of the shift onset. The second interpre- 
tation would not be inconsistent with the 
perseveration data. 

The facilitating effects of PCT could 
not be attributed to the first phase (ex- 
tinction of old responses) of reversal 
learning, but rather seems to be in the 


COLEMAN PAUL 


second or postextinction phase. Many 
accounts of the PCT facilitation effect 
postulate the acquisition, during PCT, of 
a response which transfers to and facili- 
tates reversal learning (e.g., Reid, 1953); 
Paul (1965) speculated that in at least 
some reversal studies PCT decreased 
negative interference rather than in- 
creased positive transfer. In the present 
study the Overlearning and Control 
groups made the same number of errors 
to the first correct reversal response, 
Yet the two groups differ with respect 
to the total number of errors. Hence, it 
appears that differential interference (re- 
sponse oscillation) following the initial 
Correct response accounts for the differ- 
ence in total errors between the two 
groups. Manipulation of interference 
effects by introducing a temporal delay 
(retention interval) between original and 
reversal learning might be a more direct 
test of the reduced interíerence notion 
(see Youniss & Furth, 1964). 
Examination of the Percentage of Items 
Reversed variable suggests that there was 
systematic conceptual mediation factors 
in reversal learning. For the Overleart- 
ing group in the 100% reversal condition 
the median number of trials to reversal 
criterion was two. The typical S in this 
cell made about six consecutive errors 
on the first reversal trial, made no fur- 
ther errors on that trial and reached 


criterion on the next trial. That $ never - 


erred on several items during revers 
learning suggests that reversal learning 
was conceptually mediated rather than 
accomplished by “trial and error.” The 
previously mentioned observation that the 
7595 reversal group made more errors 
on the nonreversed items than on the 
Teversed ones seems consistent with b 
notion that conceptual mediation playe 
a role in reversal learning. Under the 
75% reversal condition such mediation 
would facilitate learning the reverse 
items and interfere with “learning” the 
nonreversal ones. 

This result suggests that at least tw? 
Potential sources for positive and nega 
tive transfer were present and diffe 
tially so for the various percentage % 
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reversal conditions. One source was as- 
sociative due to the association within 
pairs and the other was “conceptual” due 
to the relation, antonymity, among the 
pairs.? 

Interpair relations, like other asso- 
ciative relations, may lead either to facili- 
tation or interference depending upon the 
characteristics of the transfer task. A 
more general assumption, which could 
have the same effect, is that Ss learn a 
"concept of reversal" The data suggest 
that the greater the percentage of items 
reversed the more likely it is that Ss will 
form a concept of reversal and reverse 
all of their responses. This assumption 
does not require that pairs of items bear 
a verbalizable relation to one another. 
In the absence of nonantonym pairs the 
above suggestions are quite speculative. 


2 When the relation between the members 
of a pair is easily verbalized (e.g, an- 
tonymity) and is the same for a number of 
such pairs, there may be interference on 
a transfer task which calls for intralist dif- 
ferentiation of the pairs (e.g, as in the 50% 
reversal condition). If this conjecture has 
empirical validity, then the use of certain 
mixed-list designs may interact with type of 
transfer task—perhaps in paired-associate as 
well as in discrimination learning. 
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In any case, the PCT did not seem to 
effect conceptual activity. 
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TRANSFER OF VERBAL DISCRIMINATIONS BASED ON 


DIFFERENTIAL REWARD MAGNITUDES! 


W. K. ESTES 
Stanford University 


A group of 40 human Ss was first given training in a modified verbal 
discrimination situation in which the 2 stimuli in each pair had dif- 
ferent associated reward values, then was tested on various new 
combinations of the training stimuli. On the basis of a final series of 
single-stimulus tests, gradients were constructed for the distributions 
of probabilities that Ss would tend 


reward values upon presentation of 
gradients, together with a scanning 
erated relatively accurate prediction: 


test combinations. 


to predict each of the possible 


each of the stimuli singly. These 
model for choice responding, gen- 
5 of the proportions of choices to 
A comparison of these data with those from a 


similar experiment conducted with monkeys by Meyer, LoPopolo, and 
Singh (1966) tends to Support a common interpretation of the 


operation of reward. 


The purpose of this study was to 
obtain a relatively direct check on cer- 
tain assumptions of a Scanning model 
proposed by the writer (Estes, 1962) 
for the interpretation of effects of dif- 
ferential reward magnitudes on multi- 
ple-choice learning. The gist of the 
theory can be expressed in two princi- 
pal assumptions: (a) In any situation 
in which different responses are fol- 
lowed by different rewards with fixed 
probabilities, an S’s tendency to expect 
a given reward O; following any par- 
ticular response 4; is assumed to be 
learned according to the same process 
that governs acquisition of overt pre- 
dictive responses in simple probability 
learning (or response members of 
items in ordinary  paired-associate 
learning). (b) Upon presentation of 
the stimulus at the beginning of each 
trial, S is considered to scan (implic- 
itly) the set of available response 
alternatives, generating for each alter- 


1The work reported here was supported 
in part by the Personnel and Training 
Branch, Office of Naval Research. Repro- 
duction in whole or in part is permitted for 
any purpose of the United States Govern- 
ment. 


native a prediction of the reward that 
Will be received if the response is made 
and finally making the response which 
he predicts will yield the largest 
reward. 

Preliminary application of the model 
to data from probability-learning ex- 
periments with different monetary pay- 
offs for correct and incorrect responses 
yielded rather promising results (Es- 
tes, 1962). Further elaboration of the 
model will be needed before one can 
expect to handle the full range of : 
experiments of this type, but a more 
Pressing problem is to obtain addi- 
tional sources of evidence concerning 
the hypothesized process of scanning 
implicit predictive responses. To this 
end, a modified verbal discrimination 
Situation has been contrived in which 
a different reward is associated with 
each member of each stimulus pair. 
After Ss have learned always to choose 
the high-reward member of each of à 
number of stimulus pairs, test trials are 
given in which the stimuli are re-paired 
in new combinations and finally test 
trials on which the stimuli are pre- 
sented alone and Ss are required to 


276 


DISCRIMINATION TRANSFER AND REWARD MAGNITUDE 


attempt to recall the associated reward 
values. It is anticipated that the single- 
stimulus tests will eventuate in gen- 
eralization gradients around the true 
reward values, and that with these 
gradients in hand it will be possible by 
means of the scanning model to predict 
a priori the choice proportions to vari- 
ous test pairs. Further, by appropriate 
choice of reward values for the original 
training series, it is possible to arrange 
matters so that in some instances pre- 
dictions from the scanning model are 
quite opposed to those that would arise 
from theories assuming that rewards 
function in a more direct manner as 
modifiers of response strength or habit 


strength. 


METHOD ? 


Subjects and apparatus.—The Ss were 40 
undergraduates who participated in order to 
fulfill a course requirement in elementary 
psychology. 

The apparatus comprised a panel shielding 
the stimulus materials from S except during 
exposures and a sectional rack, located on 
E's side of the panel, to hold the stimulus 
cards so that they could be shown in any 
order prescribed by the protocol for a given 
S. Stimulus materials were 30 4X6 in. 
white index cards, each having two non- 
sense syllables typed on the face. The non- 
sense syllables used were: DAX, FIH, MOJ, 
GUV, TYB, and CEF. 

Design and procedure.—The 40 Ss were 
assigned to 10 subsets of 4 Ss each. For 
each subset the six nonsense syllables were 
assigned to three pairs and the six training 
cards presented each of these pairs in each 
of the two possible left-right arrangements. 
The 24 test cards for each subset of 4 Ss 
included the 12 remaining possible pairings 
of the six syllables, again each pair appear- 
ing in each of the two left-right arrange- 
ments. The assignments of reward values 
of one, two, and three points to syllables 
were determined randomly for each subset 
of 4 Ss, but always according to the follow- 


2 Joan SeBreny assisted in conducting this 
experiment, which was conducted in the psy- 
chological laboratories of Indiana Univer- 
sity during the academic year 1961-62. 
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ing paradigm, in which for convenience the 
syllables are represented by capital letters. 


Reward 


3 
1 


Stimulus Pair 


"O0 unm mp 
Eas no 


At the beginning of the training series 
it was explained to S that he would see 
a series of cards each containing two non- 
sense syllables and that each syllable had 
a point value. On each trial S was to 
select one of the two syllables and read 
it aloud, choosing the one that he thought 
would pay off the larger number of points. 
The S was told, 


The object is for you to accumulate as 
many points as possible. The length of 
the session depends on your ability to earn 
points by learning the best response to 
each card. As soon as you have reached 
the necessary point total, the experiment 
will be over. 


The training proceeded with a series of cy- 
cles, each of the six training cards appearing 
once in each cycle in a random order, the 
randomizations being different for each trial 
and different for each subset of 4 Ss. The 
training continued until S met a criterion 
of four consecutive errorless cycles (an 
error being defined as a choice of the lower- 
paying alternative on any card). On each 
trial E exposed the stimulus card; as soon 
as S indicated his choice of syllable, E gave 
the value of that syllable, then removed the 
card, recorded the response, and selected the 
card for the next trial Except in rare in- 
stances stimulus exposures varied from 1 to 
3 sec. and trials ran off at a rate of about 
six per minute. 

After meeting the training criterion, S 
was given the test series on new combina- 
tions, being shown each of the 24 test cards 
once in a random order. A new random 
order was determined for each subset of 
4 Ss. At the beginning of these tests S 
was told simply, "Now we will continue in 
the same manner except that I will not tell 
you how much the syllable you pick is worth 
on each trial" Finally S was given the 
single stimulus tests, being presented with 
each stimulus syllable singly with instruc- 
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tions to indicate how many points he be- 
lieved it to be worth. 


RESULTS 


Choice proportions for each of the 
12 new pairs of test stimuli are pre- 
sented in Table 1. Since the syllables 
of each test pair were presented twice, 
once in each left-right position, each 
entry of the table is based on a total 
of 80 observations. To begin with the 
salient qualitative features of these 
data, it is apparent first of all that 
Whenever the two members of a test 
pair had been associated with differ- 
ent reward values during training, Ss 
made the appropriate choice (that is 
the one associated with the high-reward 
value) with high probability. When 
the high-reward member of one pair 
was combined with the low-reward 
member of another pair, the appro- 
priate choice probabilities approached 
unity, and, when both members of a 
new pair had been the high-reward 
members of their original pairs, though 
with different absolute values (A vs. 
C or B vs. C), the choice proportions 
were a little lower. A substantial dec- 
rement appeared when both members 

of a new pair were the low-reward 
members of the original pairs (eg., 
D vs. E). 

The second main type of test brings 

together stimuli with the same absolute 


TABLE 1 


CHOICE PROPORTIONS (ROW OVER CoLuMN) 
ON Test TRIALS FOR HUMAN Ss 


Observed Theoretical 

B CDE EF BCDE 
A 54 .98 .98 — 1.00 | .52 .98 99 — 1, 
B .90 — .98 1.00 93 — .97 39 
c 81.85 — 81.85 — 
D 462 .78 61.85 
E -60 46 
Note.—Training payoffs: A:E = 3:1; B:D = 3:2; 

C:F = 2:1, 
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reward values. The result for C vs. 
both of which carry a reward val 
of two but the former of which was 
high and the latter the low-rew 
member of its original training p 
will probably not prove incomp 
with expectations from any type 
theory. The most obvious interpr 
tion might be simply that Ss iñ 
type of test were choosing the stim 
which they had selected and avoid 
the one that they had not selec 
during the final training trials. 
alternative interpretation will be 
below which fits better with the r 
of the other two types of tests on 
reward stimuli. On the A vs. B t 
there was only a slight and nonsig 
cant deviation of choice probabi 
from .5 in favor of the former, 
though A had been paired with 
stimulus carrying only one unit where 
B had been paired with a stimul 
carrying two units of reward, A la 
deviation from equality, and one 
may seem to many to be in a no 
intuitive direction, occurs in the ¢ 
of E vs. F. Both of these stim 
carried one-unit rewards during ral 
ing but E had been paired with 
stimulus carrying three and F wit 
a stimulus carrying only two; tht 
one might expect any sort of "contra: 
effect" or "frustration effect" to ope! 
ate in favor of the latter. f 

The interpretation of the test result 
in terms of the scanning model is base 
on the results of the single-stimulu: 
test, Shown in Table 2. Each row i 
the table gives the proportions of c 
in which Ss recalled the reward value 
associated with the given stimulus dt 
ing training to have been one, two, 0f 
three; thus for example, in the case 0 
Stimulus A (which had actually 
tied three-unit reward during 
ing) Ss never gave its reward 
on the test as one, but gave it as 
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TABLE 2 


PROPORTIONS OF REWARD PREDICTIONS ON 
SiNGLE-STIMULUS TRIALS 


Reward Value 

Stimulus 

1 2 3 
A (3:1) 0 125 875 
B (3:2) 050 .075 .875 
C (2:1 .050. .825 .125 
D (2:3 425 475 .100 
E (1:3 575 .325 .100 
F (1:2 .800 .200 0 


in the proportion .125 of cases and as 
three in the proportion .875 of cases. 
In all instances the correct value is 
given with highest probability, and 
generalization gradients around the 
correct value align as would be ex- 
pected on the basis of learning rates 
on the three training problems. 

For Problems C vs. F, B vs. D, and 
A vs. E, mean total errors (choices of 
the low-reward alternative) during 
training were 118, 89, and 54, and the 
trials beyond which no errors occurred 
were 19, 9, and 7, respectively. Since 
Ss were choosing the higher-paying 
member of each pair quite uniformly 
during the later training trials, there 
was little opportunity for forgetting 
of the associations of these cues, A, B, 
and C, with their reward values. Con- 
sequently the entries in the first three 
rows of Table 2 peak sharply at the 
correct reward values. 

Since the low-reward stimuli of the 
three training pairs were selected much 
less frequently than the high-reward 
members during training, and not at all 
during the final four criterion cycles, 
we should expect the probabilities that 
the correct reward values would be 
associated with the low-reward stimuli, 
D, E, and F, to be much lower than 
for A, B, and C, and for generalization 
to be greater. This expectation is 
borne out very clearly for Stimuli D, 
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and E, less so for F, the low-reward 
member of the training problem which 
was learned most slowly. 

The values in Table 2 are used for 
predictions of the double-stimulus test 
results by means of the scanning model 
in accordance with the following for- 
mula 


s c(d +e) + bd 
4" 1 = (ad + be + d) 


where py denotes the probability that 
Stimulus i is chosen over Stimulus j 
on a test trial and the entries a-f on the 
right-hand side denote the entries in 
the corresponding two lines of Table 2, 
in accordance with the following nota- 
tion 


1 2 3 
i a b c 
j d e f. 


The numerator of the expression for 
py is simply the probability that S, 
when considering the stimulus pair i: j 
and generating predictions of the re- 
ward values associated with each mem- 
ber, will tend to predict that i carries 
a higher reward value than j; the 
denominator is 1 minus the probability 
of a tie. The theoretical values com- 
puted by means of the formula are 
entered in the right-hand side of Table 
1. The gist of the theoretical vs. ob- 
served comparison would seem to be 
that for all three of the high-reward 
stimuli, A, B, and C, the predictions 
could scarcely be improved upon; that 
for Stimulus D, the low-reward mem- 
ber of three vs. two, the predictions are 
quite satisfactory; and that for the 
comparison of the lowest reward (one 
unit) stimuli, E vs. F, the model, 
rather than intuition, is qualitatively 
correct. The only significant disparity 
between theoretical and observed val- 
ues occurs in the case of the last com- 
parison, the preference for the cue 
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formerly paired with a three over the 
cue formerly paired with a two being 
somewhat overpredicted. 


Discussion 


These data provide independent sup- 
port for two conclusions concerning the 
mode of operation of reward in human 
learning that had been suggested by two 
independent preceding studies: (a) Over 
a series of trials on which Ss receive 
different rewards following different 
members of a set of response alternatives, 
associations are formed between stimuli 
and reward values according to the same 
process governing simple paired-associate 
learning (Keller, Cole, Burke, & Estes, 
1965). (b) On test trials given after 
such learning has occurred, S scans, 
covertly, the available response alterna- 
tives to a stimulus, generating an implicit 
prediction of the reward value associated 
with each stimulus-response combination, 
then makes overtly the response which 
carties the highest anticipated reward 
value (Estes, 1962). 

A major limitation on the evidence pres- 
ently available in support of the scan- 
ning model for reward learning is that 
all of the relevant studies had been con- 
ducted with human Ss in situations per- 
mitting the formation of verbal associa- 
tions. To have some basis for judging 
the generality of the model with respect 
to functioning of reward in nonverbal 
learning, it would be extremely desirable 
to secure data from analogous experi- 
ments conducted with animal Ss. With 
this end in view, several attempts have 
been made in the Writer's laboratory to 
conduct experiments parallel in design to 
the present one but using rats in standard 
discrimination learning situations; all 
have failed utterly owing to the difficulty 
of training rats in Several concurrent dis- 
criminations without use of punishment 
for incorrect responses, Fortunately, the 
capacities of monkeys in this regard are 
not so severely limited and through the 
cooperation of Donald R. Meyer it has 
been possible to secure relevant data 
from an experiment paralleling the pres- 
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ent one in the major features of training 
and testing.* 

In Meyer's study, eight monkeys were 
first trained to discriminate between four 
pairs of stimuli with procedures paral- 
leling those of the verbal discrimination 
experiments so far as possible. However, 
the monkeys were given forced trials to the 
stimulus not chosen on regular training 
trials, so as to equate their overall experi- 
ence with each alternative, The pairs of 
reward values for the monkeys (calibrated 
in currants rather than points) were 8: pt 
8:4, 4:2, and 2:1. After the monkeys 
had met a relatively strict criterion of 
discriminative performance they were 
given test trials on new combinations of 
the stimuli in the same fashion as the 
human Ss of the present experiment, 
except that in the case of the monkeys 
the various stimuli continued to carry 
their old reward values on the test trials. 
The one limitation in Securing a com- 
pletely parallel experiment was that there 
seemed to be no feasible way of con- 
ducting single-stimulus tests at the end 
comparable to those on which the human 
Ss were asked to recall the reward values 
associated with the various individual 
stimuli. 

Since no data were available for the 
monkeys comparable to those of Table 2 
for use in predicting responses to the 
double stimulus test trials, the hypo- 
thetical gradients of reward-prediction 
probabilities in Table 3 were constructed 
by starting with the values in Table 2 
and modifying these on the basis of 
rough rules of thumb derived from the 
model and overall inspection of the 
monkey data. For a first approximation 
each row of Table 3 was filled in simply 
by taking the entries from the most 
nearly comparable row of Table 2 and 
distributing these over the appropriate 
columns, so that the first approximation 
in the top row was to enter .125 under 
four and .875 under eight, for the second 
Tow .05 under two, .075 under four, .875 
under eight, etc. Then all of the values 
RUDI S 

* Personal communication; see Meyer, Lo- 
Popolo, and Singh (1966). 
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TABLE 3 


HYPOTHETICAL PROBABILITIES OF REWARD 
PREDICTIONS ON SINGLE-STIMULUS 
TRIALS BY MONKEYS 


Reward Value 

Stimulus 

1 2 4 8 
A (8:1)| 0 0 10 90 
B(8:4)| 0 04 06 90 
C (4:2)| 0 10 80 10 
D (4:8)| 0 10 | .80 10 
E (2:1)| .05 ‘80 | .15 0 
F ae 15 75 10 0 
G(1:8)| .60 30 | 10 0 
H (1:2) | .70 30 0 0 


were adjusted slightly to take account 
of the observation that overall the mon- 
keys' responses on test trials showed a 
more precise carryover of appropriate re- 
sponding from training to testing on the 
basis of absolute reward values. That is, 
it appeared that if the monkeys could 
have been asked to respond to single 
stimuli they would have yielded steeper 
gradients of generalization around the 
correct reward values than did the human 
Ss. Thus, for the monkeys the peaks of 
the generalization gradients in the rows 
of Table 3 are in general higher than 
for the human Ss. The only appreciable 
deviation from this trend occurs in the 
case of the 2:1 combination, for which 
the monkeys did not transfer responses 
on the basis of absolute reward values 
quite as precisely as did the human Ss. 
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It should be emphasized that the particu- 
lar numerical values in Table 3 should 
not be taken too seriously; all that can 
be said is that we have constructed a 
table which conforms to the spirit of the 
model and in which the overall pattern 
is quite similar to that of the human Ss 
except for the generally steeper generali- 
zation gradients on the part of the mon- 
keys. In any event, using the values of 
Table 3 for purposes of prediction in 
order to gain a rough idea of the degree 
to which double-stimulus choices of the 
monkeys conformed to expectations on 
the basis of the model, we have calcu- 
lated the theoretical values, shown in 
the right-hand side of Table 4 for pur- 
poses of comparison with the observed 
proportions of choices to the various test 
combinations. The principal conclusions 
to be drawn from Table 4 would seem 
to be, first, that the monkeys responded 
more precisely in terms of transfer of 
response according to absolute reward 
values than did the human Ss and, sec- 
ond, that the overall pattern conforms 
to expectations on the basis of the scan- 
ning model, 

Table 5 provides a more compact com- 
parison of the performance of human 
Ss with that of monkeys, the choice 
proportions being averaged for all tests 
on each main type of comparison, When 
the two members of a test pair differ in 
absolute reward value, the cases repre- 
sented in the first four rows of the 
table, the monkeys exhibit striking preci- 


TABLE 4 


CHOICE PROPORTIONS (Row OVER CoLuMN) on Test TRIALS BY MONKEYS 


Observed Theoretical 
B ©: D E F G H B Ç D E F G H 
A APES E E E AS P 50 72— .99 51 .99 :99 1:00 1.00 — 1.00 
B .95 — 1.00 1.00 1.00 1.00 95  —  .99 1.00 1,00 1.00 
G 46 26.92) 4199-5 .99 :90 599398113 rh ee 991511200 
D 95 ..95 ..99 1.00 SO a A00. 
E 05 85 m 1608 5.86  — 
F 19. 88 .80 .93 
G 74 70 


Note.—Training rewards—A: 


G =8:1; B:D = 8:4; C; F = 4:2; E:H = 2:1. 
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TABLE 5 


COMPARISON or CHOICE DATA ON TEST 
TRIALS FOR HUMAN SS AND 
Monkeys 


X—¥ x-x |y -y | Human | Monkeys 


T m us 960 979 
-938 -964 

ch ji t -698 .922 

ze 73. 5e zx -950 

AEE r 2s, |, ae 

* m hi es $ 

= a = -400 -260 


Note,—X from X: X' ys, Y from Y : Y", 
Gr) zn - v) 


Sion in responding appropriately even 
when choice of the higher valued stim- 
ulus on a test involves selecting the one 
that had been avoided during training 
and rejecting the one that had been se- 
lected during training. This last type 
of test, shown in the fourth row of the 
table, was unfortunately not included in 
the design for the human Ss, but on the 
basis of the results in the third row for 
cases in which the two unequal-valued 
members of a test pair both were the 
avoided members of their training pairs, 
there can be little doubt that the human 
Ss would have fallen distinctly short of 
the monkeys had such a test been given. 
We have no basis for judging whether 
this difference in performance may be 
simply a result of the human Ss having 
learned the discriminations more rapidly 
and thus had less training on stimulus- 
reward associations involving the low- 
reward members of training pairs. Pos- 
Sibly the human Ss would in any event 
tend to transfer more than the monkeys 
on the basis of the relative values asso- 
ciated with stimuli during training rather 
than on the basis of absolute reward 
values. The substantial difference in 
choice proportions for the two types of 
Ss in Row 5, for the test pairs involving 
stimuli of equal absolute value but with 
one having been the high and the other 
the low-reward member of its training 
pair, rather suggests the latter interpreta- 
tion. The counterintuitive prediction 
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from the scanning model for the test in- 
volving the two stimuli with minimum 
absolute rewards, shown in the last row 
of the table, is confirmed even more 
strongly for the monkeys than for the 
human Ss. 

Finally, let us consider the extent to 
which the transfer data could be handled 
by formulations alternative to the one 
specifically under test, It seems clear, 
first of all, that traditional formulations 
of the law of effect are inadequate simply 
owing to a lack of specificity; that is, 
no predictions at all can be generated for 
some of the kinds of comparisons avail 
able in this study. Further, in most 
variations of law of effect or reinforce- 
ment theory, predictions about the trans- 
fer tests would have to be made in terms 
of carry over of approach and avoidance 
tendencies built up during training. A 
conception of this sort may seem to be 
supported if we look at certain test com 
binations. For example, in the human 
S experiment the high-reward member 
of a 2:1 pair was chosen over the low- 
reward member of a 3:2 pair on transfer 
tests over 80% of the time. Thus one 
might say that in the new situation $$ 
continued to approach the stimulus. that 
they had learned to select during training 
and to avoid the one they had learned 
to reject during training. Similar tests 
with the monkeys were inconsistent: The 
animals preferred the high member x 
2:1 pair to the low member of a 4 
Pair, but on the other hand chose the 
low member of a 8:4 pair slightly more 
than half the time over the high member 
ofa 4:2 pair. Much more strongly nega 
tive for an approach-avoidance notion 8 
the result that the monkeys exhibited 7 
95% preference for the low member 0 
an 8:4 pair over the high member of 4 
2:1 pair on transfer tests. : 

The elaboration of reinforcement the- 
ory by Spence (1960) and his associates 
in terms of concepts of habit strength, 
incentive, and anticipatory goal E 
frustrative responses, could doubtless 
modified to handle our transfer results: 
However, some of the principal concepts 
of that type of theory, as presently for 
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mulated, are not particularly in harmony 
with the present data. For example, one 
would surely expect stronger conditioned 
frustration responses to the low member 
of a 3:1 pair than to the low member 
of a 2:1 pair and to the low member of 
an 8:1 pair than to the low member of 
a 2:1 pair; yet human Ss and monkeys, 
respectively, chose the former over the 
latter in each case with significantly 
greater than .50 probability on transfer 
tests. Similarly, notions of frustration 
effect, contrast, and habit strength all 
would lead to the expectation that the 
high member of a 3:1 pair would be 
chosen over the high member of a 3:2 
and the high member of an 8: 1 would be 
chosen over the high member of 8:4 on 
transfer tests; yet the observed results 
on both human Ss and monkeys deviated 
insignificantly from chance (the observed 
proportions being .54 and .52, respec- 
tively). On the other hand, the ideas 
developed by Logan (1965) concerning 
the interpretation of behavior under dif- 
ferential reward as a variety of decision 
process seem qualitatively in harmony 
with the present findings. The scanning 
model which accounts for the quantita- 
tive details of these data goes beyond 
Logan's formulation principally in offer- 
ing a more detailed account of the way 
in which decisions are generated in par- 
ticular transfer situations. 

An important question which remains 
largely unanswered is that of just what 
response is acquired when the organism 
appears to have learned to predict that 
a particular reward value will follow a 
particular response. In the case of the 
human Ss, there seems no reason to 
doubt that appropriate verbal associations 
are formed much in the manner of ordi- 
nary paired-associate learning. However, 
verbal associations are evidently not nec- 
essary, for the monkeys exhibit transfer 
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on the basis of absolute reward values 
even more precisely than the college stu- 
dent Ss. Translating one’s account of 
the animal’s behavior into terms such as 
anticipatory-consummatory responses ap- 
pears cumbersome and somewhat implau- 
sible; it is hard to imagine that, for 
example, the different anticipatory-con- 
summatory responses associated with 
different numbers of currants by the 
monkeys would be sufficiently sharply 
differentiated to mediate transfer behav- 
ior with high precision. There appears 
to be no simpler account of the behavior 
available than to say that the animal 
learns to anticipate a specific reward 
stimulus following a particular stimulus- 
response combination. In conventional 
terminology, we appear to be led toward 
a theory involving stimulus-stimulus, 
rather than stimulus-response, condition- 
ing together with the scanning model for 
response selection. 
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LEARNING AND TRANSFER IN THE MONKEY AS A 
FUNCTION OF DIFFERENTIAL LEVELS OF INCENTIVE* 


DONALD R. MEYER, MILDRED H. LoPOPOLO, AN» DEVENDRA SINGH 
Ohio State University ? 


Monkeys were trained on 4 concurrent visual discrimination problems 
in which reinforcement ratios were 1:2, 2:4, 4:8, and 1:8 pellets. 
The cues employed were then re-paired, and transfer measures taken, 
and the outcomes are at variance with expectations based upon an 
intraproblem contrast theory of the value of incentives. 


This investigation is concerned with 
the effects of differential levels of in- 
centive upon the establishment of sim- 
ple visual habits and their transfer in 
the Rhesus monkey, 


METHOD 


Subjects and apparatus—The Ss were 
eight young, test-wise, female Rhesus mon- 
keys (Macaca mulatta mulatta). The basic 
apparatus was a WGTA equipped with a 
test tray with three foodwells. The dis- 
criminative cues were eight English letters 
of eight different colors that were made 
of wood and mounted upright on thin 3.5-in. 
Squares of masonite. These squares served 
as foodwell covers, but themselves. could 
not be pushed aside by pressure on their 
edges; rather, they were fitted. into U-shaped 
slots of metal which enforced responses to 
the cues (cf. Meyer, Treichler, & Meyer, 
1965). 

Procedure —The animals initially were 
given training with a set of four concurrent 
discrimination problems in which all alterna- 
tives were always reinforced, but with dif- 
ferent numbers of dried currants. The ra- 
tios of the quantities were 8:4, 4:2, 2:1, 
and 8:1. The Specific quantities were 
chosen as a set of reasonably equidistant 
values (Fay, Miller, & Harlow, 1953; Har- 
low & Meyer, 1952). 

On a given training trial, a given pair of 
letters first appeared above the outside food- 
wells. When the monkey made its choice, 
the test tray was retracted and the forward 
Screen was briefly lowered; next, the letter 
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and incentive that had not been chosen 
presented in the midposition. This was do 
to equalize the animals' exposures to he 
different levels of incentive, and was deem d 
essential since the reinforcement context 
powerfully affects incentive value (Lea j 
1958; Meyer, 1951; Schrier, 1958). Ina 
daily session there were 20 standard trials 
and 20 single-letter presentations. There 
were 5 of each of these for each concurrent 
problem, which were intermixed in rando p 
order. The positions of the letters on thë 
standard trials were governed by a Geller 
mann series. 

Training trials continued until 5 had met 
criterion of 18/20 responses to the letters 
associated with the two incentives in 
four concurrent training problems. The 1 
trials involved in this were 5-trial blocks 
presented over 4 successive days of Le 
Afterwards, 5 days of overtraining Es 
were given prior to a re-pairing of the i 
ters. There were 24 such possibilities in a 
and each appeared within a transfer er i 
the remaining 16 trials within a transit i 
session were devoted to initial pairings 
Each of the re-pairing sessions was set 3 | 
by two retraining sessions with initial bs l 
ing, and the study ended when each anim | 
completed 10 of the re-pairing transfer ses 
sions. 


Expectations 


With respect to learning of the E | 
Concurrent problems, we expected 8 di | 
to be superior to the other three cond! g 
tions and that these would be acquir 
at roughly equal rates. But, because 
incentive values are a function of b 
Set of quantities available, we did E 
expect to find that these relationships 
would be apparent in the early stages: 


DISCRIMINATION TRANSFER AND REWARD MAGNITUDE 


That is, monkeys have to learn that 
there are different payoffs for ostensi- 
bly the same behaviors if these differ- 
ent payoffs are to be reflected in the 
levels of performance they support 
(Meyer, 1951). Therefore, we pre- 
dicted that, at first, the learning curves 
would not be ordered with respect to 
ratios, but that in the final stages the 
performances with 8:1 would rise 
above the rest. 

With respect to choices when the 
letters were re-paired, we thought that 
contrast would determine outcomes in 
those situations where initial values of 
the re-paired letters were equivalent. 
Thus, re-pairing of two letters with the 
value 1 from problems 2:1 and 8:1 
should lead, from our conception, to 
preponderant selection of the letter 
from the former pairing. Similarly, 
letters of the value 2 that first were 
paired with letters of the value 1 
should be preferred to letters of the 
value 2 that first were paired with let- 
ters of the value 4. Here, however, 
and where 4’s were re-paired after 
having first been paired with either 8’s 
or 2’s, we predicted that effects of con- 
trast would be less than in the 1:1 
comparison. 

We were not so confident concern- 
ing our predictions for the re-pairs of 
unequal value. Nonetheless, it seemed 
to us improbable that changes in the 
values due to training ratios would be 
so large as to override a two- or three- 
step difference in incentive quantities. 
Hence our expectation was that ani- 
mals would tend to choose the proper 
letter in those re-pairs in which letters 
of the value 4 were paired with 1’s, 
and 8’s were paired with either 1’s 
or 2's. 

With respect to re-pairs in which 
quantities were different by a single 
step along the scale, we expected some 
reasonably frequent choices of the 
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lower-valued letters. Generally, our 
supposition was that these would be 
related to how much the training con- 
trast would reduce the differences be- 
tween incentive values as established 
by the quantities. For example, errors 
made in 2:1 re-pairs where 2 was 
paired with 4 and 1 with 2 should be 
less common than the errors made 
when 2 and 1 had earlier been paired 
with 4 and 8. Furthermore, within 
the set of 2:1 re-pairs, we thought the 
fewest errors would be made when 
letters with the value 2 that had been 
paired with 1’s were re-paired with 
the 1’s from 8:1. 

With respect to other one-step re- 
pairs, we predicted that the fewest 
errors would be made in 4:2 re-pairs 
in which the 4 was paired with 8 and 
2 was paired with 1. Further, since 
the shifts of values in this set should 
lead to smaller final differences in 
value, we expected that the monkey 
would make more mistakes in these 
than in the 2:1 re-pairs. Finally, the 
8:4 re-pairings were expected to pro- 
duce the most divergent outcomes, with 
the order of conditions from the best to 
worst being 8(1) vs. 4(8), 8(1) vs. 
4(2), and 8(4) vs. 4(2). 


RESULTS 


The initial training was accom- 
plished in from 1.9 to 5.2 mean daily 
sessions in addition to the 4 successive 
days required for presentations of cri- 
terial trials. There were no significant 
effects of training ratios upon rates of 
learning of the habits, but, as shown in 
Table 1, the 8:1 condition was supe- 
rior during overtraining. This same 
table gives performance levels for the 
four initial problems during transfer 
testing; in this phase, the animals con- 
tinued to respond appropriately to the 
training pairs. 

Table 2 presents the nature of re- 
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TABLE 1 


PERCENTAGE OF CORRECT RESPONSES 
DURING OVERTRAINING 


Ratio 
Cond. 


2:1 | 4:2 | 8:4 | 8:1 


95.5 | 94.0 | 95.5 | 100,0 
98.9 | 98.1 
98.3 | 98,5 | 98.9 


—— asl 
Initial Overtraini 
Transfer Session Trials 
Interposed Overtraining 


sponses made by individual Ss to those 
re-pairs that involved two letters that 
were reinforced with equal quantities 
of currants during the initial learning 
sessions. Here our expectations were 
that animals would choose 1(2) over 
1(8), 2(1) over 2(4), 4(2) over 
4(8), and 8(1) over 8(4). They were 
mot confirmed except for 2(1) vs. 
2(4), and the largest difference ob- 
tained was that for 1(2) vs. 1(8), 
where the choices ran directly opposite 
to the prediction. 

"The data for the unmatched re-pairs 
where incentives differed by two steps 
or more in quantity can readily be 
summarized because the entire group 
of monkeys made but four mistaken 
choices in 880 presentations. As we 
thought, the errors were more frequent 
when the re-pairs differed by one 
quantitative step, but, again, the error 
tates were very low indeed, so low 
that we will merely give the totals, 


TABLE 2 


NUMBER oF RESPONSES TO THE CUES IN 
MarcHED RE-PAIRS 


S 110-16) | 207-26) | 4(2)-4@) 8(1)-8(4) 
PTS ESO GERE 


1) 2 8|8 2|2 BIER 
3] SWS 8:172! |» vor wig 7 38 
5 EGSASUUIEA4. 6 Ag Gli 5.005 
5| 4 6| 6 4 A TOT. TT 
GS suma Sensing 5| 8 2 
18| 1,9 ]|07. 3 5.5 1119 
19! 0 10| 6 4 3. 5| 8 2 
J} 0 10| 6 4| § 5 872 
Z| 21 59| 52 28 | 37 43 | 42 38 
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In the 2:1 re-pairings, 39 mistakes 
appeared within 240 re-pairs, and 
there was no evidence that these mig 
takes related to the nature of training 
ratios. In the 4:2 and 8:4 compari 
sons, the error rates were even les 
than this, namely, 9 and 14 errors for 
the 8:4 and 4:2 re-pairs, respectively, 


Discussion 


Errors for the unmatched re-paif 
were not numerous enough for prope 
tests of some predictions, The results 
of 1(2) vs. 1(8) cannot be interp tet 
in terms of contrast theory, but they af 
exactly as had been predicted by Estes 
(1966) prior to their collection. We do 
not consider it appropriate for us to 
to summarize his thinking for him; i 
will give his reasons and related hi KE 
data in another paper in this journal. 

# Personal communication, W. K. Estes 
966. 
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SYMMETRY IN PAIRED ASSOCIATES * 


HARALD R. LEUBA 
Johns Hopkins University 


In this study, experimental conditions are established in which it is 
possible to separate the effort required for element learning from that 
required for the learning of associations per se. 110 Ss in 5 groups 
learned a list of 8 paired associates. Each of the several measures of 
performance employed indicated (p <.10 to p «.00001) that groups 
required to practice bonds in both directions were required to learn 
more than groups required to practice bonds in only 1 direction. The 
relative magnitudes involved suggest that unidirectional practice 
produces backward associations perhaps 70-9096 as strong as forward 
ones. These results are consistent with a view which separates 


associations from recall or availability. 


The central issue in the controversy 
surrounding the relative strength of 
“forward” and “backward” associates 
has been one of controlling “availa- 
bility,” i.e., of controlling the amount 
and kind of experience S has with all 
the stimuli. If S is asked to practice 
A>B, A>B, AB,... his per- 
formance on a B — A test can be a 
function either of his failure at having 
learned «— or of his inability to pro- 
vide Element A. If S is asked to 
practice A — B, B> A, A B, B> 
A,... the increased time required to 
learn may be due to having to learn 
to use A elements, or it may be due to 
S's having to learn the "extra" back- 
ward bonds. 


iThis research, during which the author 
was a National Science Foundation Grad- 
uate Fellow, is based on a dissertation sub- 
mitted to Johns Hopkins University in par- 
tial fulfillment of the requirements for the 
PhD degree. The author is grateful to 
James Deese for his sensitive guidance and 
tactful criticism and to him and to Stewart 
Hulse for their responsible reading of sev- 
eral manuscript versions. Computer time 
and card punching were supported by funds 
from a research grant (MH 06550-03) from 
the National Institute of Mental Health, 
Public Health Service. Permission is granted 
for reproduction, translation, publication, use, 
and disposal in whole and in part by or for 
the United States Government. 
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The study reported here was struc- 
tured so that these ambiguities could 
be settled. Four experimental groups, 
and a control group (added as an after- 
thought) were used to investigate the 
relationships between availability (as 
Asch has invoked it), recall (as usually 
measured), and association. 


METHOD 


Subjects—The Ss were 110 male under- 
graduates at the Johns Hopkins University. 
Materials —Sixteen minimum association 
value CVC trigrams were used (Glaze, 
1928). Each syllable was used once in each 
of four eight-pair, paired-associates lists. 
Each S was given a 32-page booklet on 
each page of which all 16 of the syllables 
were written eight times. Table 1 is a 
typical panel from a page in the booklet. 
The syllables were written in a different 
order on each page (balanced for all possi- 
ble pairs), but in the same order in each 
of the eight panels on the page. A space 
placed next to each syllable was for the 
possible entry of a response by S. It was 
on the left for one half of the Ss and on 
the right for the other half. 
Procedure.—The S's task was to learn an’ 
eight-pair list of paired associates, The tri- 
grams were presented to S via audio tape 
as C—V—C at a rate of one letter per 
sec. The S was required to hear a syllable, 
to anticipate ? its successor, to scan his book- 


2This process may not be literally fol- 
lowed. An S may choose to use the scan- 
ning operation to recognize the proper non- 
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TABLE 1 


TYPICAL PANEL FROM THE RESPONSE 
BookLET FoR Groups III anp IV 


——KYH  ——MEQ -——CEF  ——YUD 
—VAF  ——JIC  ——sy —~zyG 
——WUH  ——QOB  ——LAJ  ——PYB 

TOV DAX GOQ XIP 


Note.—Eight panels per page; 32 pages per booklet. 


let for some syllable, to write a syllable, 
and to hear the tape (E's voice) read the 
second syllable of the pair, 5 sec. after the 
first syllable. The tape then announced 
"next pair" (2-sec. interpair interval) and 
S moved down the page to the second panel 
of syllables. This process continued until 
all eight pairs were presented, whereupon 
the tape announced “turn the page” and a 
new trial began (12-sec. intertrial interval). 
All Ss were run for 15 training trials and 
2 test trials (presentation of all eight pairs 
defined a trial). 

Experimental design—The Ss were sepa- 
rated into four experimental groups (24 Ss 
each) and a control group (14 Ss). The 
experiment was designed so that with re- 
spect to "backward" associations it would 
be possible to separate the amount of effort 
required for element learning from that re- 
quired for the learning of associations per se. 

The Ss in Group I performed the task in 
the standard, one-directional anticipation 
manner (A— B, AB, a owe di ESL 
According to the usual interpretation, Group 
I was required to learn eight forward bonds 
and eight elements. 

The Ss in Group II anticipated in the 
standard manner (like Group I), but for 
them the roles of first and second elements 
alternated from trial to trial (A> B, 
B>A, A>B, HAT Thus, they 
had to learn 16 bonds and 16 elements. 

In Group III S heard the first element 
of the pair, anticipated the second element, 
scanned his booklet for the second element, 
wrote the first element, and heard the sec- 
,Ond element. In this group the anticipation 
was in only one direction, like Group I, 
but S participated actively with all of the 
stimuli, like Group IL These Ss had to 
learn eight forward bonds, 16 elements, and 
the “novelty” of writing the A elements, 

Finally, in Group IV $ heard, scanned, 
anticipated, and wrote as in Group III, but 
had the roles of first and second element 
alternated from trial to trial as in Group IL. 


sense syllable rather than trying to recall 
it. In any case he has an opportunity to 
anticipate if he chooses to do so, 
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Group IV Ss learned 16 elements and 16 
bonds and the "novelty" of writing A cle- 
ments. 

In Groups III and IV, where S wrote 
the "stimulus"element, the space in the book- 
let was to the left For Groups I and II, 
the space was to the right. Thus, the visual 
consequence of what S wrote was consistent 
with the implied order of what he had 
heard. 

On the sixteenth and seventeenth trials the 
pairs for Groups I and III were reversed, 
in order to allow a direct assessment of 
B—»A performance. Groups II and IV 
simply continued alternating for 2 more 
trials. 

Several Ss in Groups I and III expressed 
surprise at the reversal after they had fin- 
ished their sessions, Because of this, Group 
C was added. Group C was run like Group 
I, but was also given instructions alerting 
them to the fact that they would be given 
a reversal test after 15 trials. This group 
can be used to test for the compound effect 
of (a) the surprise at the reversal and (b) 
the possibility of directional mnemonics. An 
S who did not know that he would be 
tested in both directions might develop asso- 
ciations and plans in his learning which 
were directional. He might mediate the task 
with self-instructions which were apropos 
the training, but not apropos the test. 

For purposes of control and balance, each 
experimental group was administered as 
eight subgroups of three Ss each, Thus, it 
was possible (a) to use different lists of 
paired associates, (b) to have pairs per se 
vary within experimental conditions, but be 
constant across conditions, (c) to have sys- 
tematic variation in the ordinal position of 
the pairs on each trial, and (d) to balance 
between time-of-day-day-of-week and the ad- 
ministration of conditions, (Group C was 
run as one half of Group I plus an additional 
subgroup of two Ss.) 

Twelve seconds after Trial 17 S was told 
to turn his booklet over and to write down, 
without regard to pair relationships, all of 
the nonsense syllables he could recall. He 
was given 90 sec, Then he was asked to 
open his booklet to page 32 and to rank 
order the 16 syllables on the basis of their 
familiarity, Familiarity was defined by 
“such things as: How easily can you recog- 
nize it? How long will you remember it? 
How easy will it be to learn the next time 
you are asked to learn it?” 


RESULTS 


Table 2 presents summary data on 
several performance criteria for the 
five groups. While these measures are 
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TABLE 2 
SUMMARY OF PERFORMANCE CRITERIA 
Group 
Performance Measure 
I II III IV c 

Average No. of Correct Responses 

Trials 1-15 65.12 59.29 66.04 58.88 66.00 

Trial 16 5.65 6.42 6.21 6.79 5.72 

Trial 17 6.21 7.01 7.12 7.21 6.15 
Median No. of Trials to Criterion 

Each Pair 4.58 4.81 4.45 5.20 4.65 

1 Pair 1.80 1.90 1.40 1.56 1.36 

2 Pairs 2.83 3.00 2.64 3.29 2.28 

3 Pairs 4.15 4.13 3.70 4.55 4.05 

4 Pairs 5.80 5.87 5.70 6.00 5.23 

5 Pairs 7.40 7.43 6.23 7.10 6.36 

6 Pairs 9.00 9.50 8.00 8.60 7.62 

7 Pairs 10.10 11.40 9.75 10.30 9.01 

8 Pairs 11.30 13.70 11.90 14.00 11.40 


not independent, they are not perfectly 
correlated. Furthermore, they are not 
subject to the same distributional and 
sampling properties. Therefore, they 
may imply different things and they 
may be taken collectively as stronger 
evidence than they would be separately. 

Acquisition—The average number 
of correct responses during acquisition 
(Trials 1-15) is a statistically depend- 
able measure of performance. It is 
normally distributed with homoscedas- 
tic variance (s = 21.14). If the ex- 
periment is viewed as a 2 X 2 design, 
the factor corresponding to one- vs. 
two-directional acquisition has a sig- 
nificant effect, MS = 1,014.0, F (1, 
92) = 228, p < .10, for this measure. 
There is no evidence for a significant 
effect for the factor corresponding to 
writing the “stimulus” vs. the “re- 
sponse” (MS=1.5) nor for the in- 
teraction (MS = 107). In this analy- 
sis, however, there is no way to assess 
the effects of element learning. 

The distribution of trials to criteria 
is censored (because some Ss do not 
learn) and log-normal (with homosce- 
dastic variance, s = .234 log,, units). 
Using normal-curve statistics it is pos- 
sible to place confidence intervals on 
estimates of the number of trials re- 
quired to learn associative elements 


vs. the number of trials required to 
learn associative bonds. The four ex- 
perimental groups are distinguished on 
the basis of what they might have been 
required to learn, Thus, Groups II, 
III, and IV each had to learn 16 ele- 
ments (e), Groups II and IV had to 
practice bonds in two directions (b), 
and Groups III and IV had to contend 
with the novelty of writing the “stimu- 
lus" element of each pair (n). If 
I, Il, III and IV represent the 
performance of the corresponding 
group, then solution of the linear model 
implied by this analysis yields: b — 
IV — III; e — II + III — I — IV; and 
1-IV-—IIL Using the values in 
Table 2, the following trends develop: 
(a) the estimates for e and n are posi- 
tive and negative and none of them 
differ significantly from zero, (b) all 
of the estimates of b are positive, but 
only the one for “Each Pair" is sig- 
nificantly different from zero (z= 
2.68, remaining 2’s = .33-1.4). 

On Trial 1, all Ss received identical 
experience: exactly one trial of for- 
ward practice on eight pairs. On Trial 
2, these pairs were reversed for half of 
the Ss (probability of a correct re- 
sponse is .102) and repeated for the 
other half (probability of being correct 
= .120). Using a conservative sim- 
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ple-randomized design, with the num- 
ber of pairs correct per S as the per- 
formance measure, this difference is 
significant at the .05 level, F (1, 110) 
= 4.85. 

Transfer —Many Ss in each group 
got all eight pairs correct on Trials 16 
and 17 and the resulting distributions 
of scores appear to be normal distri- 
butions censored at eight, but with 
homoscedastic variance (523). Ac- 
cording to chi-square, number of cor- 
rect responses on Trials 16 and 17 for 
Groups I, III, and C is significantly 
below that for Groups II and IV, 
x*(1) = 5.65, 400, p<.05, If this 
analysis is repeated with partial and 
blank responses, this result is more 
pronounced, y*(8) = 27.80, p < .001. 
On Trial 16, each group is like the 
others in its class, but Group III im- 
proves measurably on Trial 17 and is 
no longer individually below the per- 
formance of Groups II and IV, though 
according to the extrapolated learning 
curves, it is still below where it would 
have been had the pairs not reversed 
on Trial 16. 

Effect of "surprise" at reversal.— 
When performance on Trials 16 and 17 
is examined on a pair-by-pair basis, 
the following results are observed : 
Groups II and IV, continuing alterna- 
tion, seem to improve linearly over all 
16 pairs (2 = 1.17, 1.30). Group C 
(warned of the reversal) does not 
exhibit any improvement over the 
16 pairs (g= -10), though it is sig- 
nificantly higher than Group I at Pair 
1l@= 1.63) and not significantly dif- 
ferent from Group I at Pair 16 (am 
48). Groups I and III both demon- 
strate significant improvement over the 
first 6 pairs (z = 2.34, 1.96) and none 
over the last 10 pairs (z = 43, .13). 

Unaided recall of elements. —Table 
3 collects summary data on the un- 
aided recall of elements, Elements are 
assigned the designation A or B on 
the basis of the role they played on 
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Trial 1; thus, for Groups I, III, and 
C, A elements were stimulus elements 
and B elements were response elements 
—at least during the training trials. 
For Groups II and IV, A elements 
were stimulus elements on 9 trials and 
response elements on 8 trials; B ele- 
ments were response elements on 9 
trials and stimulus elements on 8 trials, 

As the first element to be recalled, 
Group I prefers to recall A elements 
(x? = 42, p < .05) ; the other groups 
demonstrate no significant preferences, 
though the relative percentages in 
Group C are quite like those in Group 
I and probably miss significance only 
because of the small number of cases 
observed. 

For the second item to be recalled, 
all Ss tended to recall the pair mate 
for the first item recalled. The theo- 
retical probability for this event, if 
recalls are independent, is 1/15. The 
Observed probability exceeds 2/3 for 
all groups. A difference this large 
would occur by chance one in 1.32 
X 10" times, "Throughout the recall 
Process Ss in every group recalled 
pair mates consecutively, significantly 
more often than the hypothesis of in- 
dependent recall would predict (every 
Z exceeds 9.5), 

An S confronted with this recall task 
could recall either or both of the ele- 
ments of a pair and he could recall 
them in any order. The last five lines 
of Table 4 present the data required 
to test the relationships between the 
recall of A and the recall of B and be- 
tween the recall of A — B and the 
recall of B— A, Groups I, III, and 
C recall A more easily than B (2— 
2.33, 1.94, 3.86) and recall B. given A 
More often than they recall A given 
B, x°(1) = 6.76, 7.40, 16.60, p < .01. 
Neither of these trends is present in 
Groups II or IV (z—.10, 21 and 
X: = .003, 008). 

Possible indexes of availability.— 
After the experiment was completed, 
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TABLE 3 
Data on UNAIDED RECALL 


Group 
Performance Measure 
I u IH IV c 
No. of Ss Whose ist Recall 
Wasan A 17 13 12 13 9 
Wasa B 7 11 12 11 5 
2nd Recall Was Pair 
Mate for 1st 16 18 16 19 11 
Probability of Consecutive 
Recall of Pair Mates 
"Theoretical .150 148 168 147 146 
Observed 3572 594 565 588 588 
No, of Pairs Recalled 
A before its Pair Mate 78 70 85 73 57 
B before its Pair Mate 47 69 61 70 23 
A Only 28 13 10 10 8 
B Only 27 11 10 14 5 
Total No, of Elements 
Recalled 305 302 312 310 173 
E counted the number of times each S asa stimulus or a response, all but one 
participated with each syllable in each correlation is significant at at least the 


of several ways (see the row headings 
in Table 4). These numbers were 
correlated with (a) S's familiarity 
rankings and (b) his rank order of 
recall. Table 4 presents the average 
correlations within each Group of Ss, 
but it presents only the correlations 
involving the familiarity rankings 
(those involving rank order of recall 
show no departures from chance ex- 
pectations). Here, except for usage 


.05 level. (Significance is tested by 
applying a z transform to each corre- 
lation and testing the average 2). In 
general, the correlations for Groups II 
and IV are higher than those for 
Groups I, III, and C. Furthermore, 
being used in a correct bond correlates 
better than just using the element in 
any bond. Finally, the correlation with 
the trial number of the first correct 
response is not uniform across groups. 


TABLE 4 
CORRELATIONS BETWEEN USAGE OF ELEMENTS AND FAMILIARITY RATINGS 


Group 
Manner in Which Element Was Used 
I I III IV G 

Feet 798 | 02. | 9, | 92 | i 
Response -03 1 4 à al 
‘Transcribed eases am —a1* se 13% | 22884) |. —.18* 
Any Mark Made Next toa 

"Element e —.11* =.26** | —.13* —.21** | —.08 
Transcribed Correctly in a Bond 19** | —.58** | —.27** | —.47** | —.25** 
Typed Correctly in a Bond .24** | —,57** | —.23** | —.43** | —.23** 

i . ich 1st Correct 
TEL Pee .36** 40** i bed Eis shee 
Rank Order of Recall> igit* .28** 129** aet aste 


* d correctly are scored as 17 + 2/2, when nis the number of elements not used correctly. 
5 Blements mot recalled are scored as 16 + 27, where m is the number of elements not recalled. 


*p « .05. 
Fd <.01. 
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For Groups I, III, and C, this corre- 
lation is higher than those for usage 
in a correct bond; it is lower in Groups 
II and IV. 

It is possible that being correct on 
the first trial would tend to “lock-in” 
that response. Since the probability 
of being correct on the second trial is 
independent of whether the first re- 
sponse was correct (9/37) or incorrect 
(112/843), there is no evidence for 
this. 


DiscussioN 


The concept of availability has an im- 
pact on all of the research in this area. 
Studies which report stronger forward 
than backward associations (Feldman & 
Underwood, 1957; Houston, 1964; Rich- 
ardson, 1960; Underwood & Keppel, 
1963; Underwood, Runquist, & Schultz, 
1959) invariably can be interpreted as 
containing elements with different avail- 
abilities. On the other hand, studies re- 
porting equally likely recall of A and B 
elements (Asch & Ebenholtz, 1962a, 
1962b; Asch & Linder, 1963; Horowitz, 
Brown, & Weissbluth, 1964; and Horo- 
witz, Norman, & Day, 1966) have meas- 
ured these recalls in situations which 
are not the same for all elements. The 
S is asked to recall more than one ele- 
ment. If later elements are recalled as 
a function of earlier recalls (and the 
evidence here strongly suggests that this 
is the case), then recall does not reflect 
availability. Just as it can be argued 
that recall contaminates associations, so 
it can be argued that association con- 
taminates recall. 

The four experimental groups here 
were structured so that the issue of avail- 
ability could be factored out. Group IT 
practiced in the A+B, B A, A— B, 
B>A, .. . manner. They required 
more trials to learn and got fewer cor- 
rect responses than either Group I or 
Group III. Since both Groups II and 
III had to learn all the elements, the in- 
creased difficulty cannot have been due 
to extra elements. Neither can it be 
due to the facilitation Group III might 
have had from the novelty of writing 
the stimulus element. If that were the 


problem, Group IV would have per- 
formed as well as Group III, and it did 
not. Performance during learning, both 
rate and number of correct responses, 
suggests that associations are asym- 
metric. 

Group III practiced A — B, A — B, 
A—B,... but had to learn the A ele- 
ments. According to the free-recall data, 
Group III actually learned the A ele- 
ments better than the B elements. Fur- 
thermore, Groups I and C were also 
better at recalling the A elements than 
the B elements. If associations are sym- 
metric, these groups should be able to 
reverse their pairs without a decrement 
in performance; they cannot. This poor 
performance is not due to unequal avail- 
ability; the A elements are more avail- 
able than the Bs. It is not due to the 
unexpected reversal task; Group C also 
had the performance decrement. Per- 
formance on the reversal task suggests 
that associations are asymmetric. 

To some extent Ss chain in their re- 
calls, so the recalls are influenced by 
the pair structure of the learning task. 
The second member of a pair to be re- 
called can be thought of as contingent 
upon the first element and not recalled 
solely because of its own availability. 
In Groups I, III, and C where "stimulus" 
element had a functional meaning, Ss 
recalled A elements more easily than B 
elements. Thus, A elements are more 
available than B elements. But these 
same Ss experienced difficulty in recalling 
these A elements if the Bs were recalled 
first. Recall sequences imply that asso- 
ciations are asymmetric, and this is true 
in spite of the fact that these Ss have had 
2 trials of backward training before re- 
call begins. 

The analyses above are convincingly 
consistent. Table 5 summarizes these 
analyses, and some others; the conclusion 
that associations are asymmetric seems 
defensible. However, asymmetry does 
not imply that "backward" associations 
are somehow in addition to "forward" 
ones. The data here suggest that the 
relative strengths are comparable. In 
particular: Group III performed as well 
at the end of Trial 17 as did Groups II 
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TABLE 5 
PATTERN OF RESULTS 


Data Support Notion 
Measure of That Associations Are 
Performance Used 
Symmetric] Asymmetric 
Trials 1-15 
Cumulative Density, No. 
of Correct Responses 
per S p <.10 
No. of Correct Responses 
per Tria! P < 00001 
Avg. Total No. of Correct 
Responses per S p <.05 
Trials to Criterion 
Each p «.01 
N Pairs nonsig. 
trends 8 
out of 8 
values 
Probability of Correct Re- 
sponse on Next Trial after 
One Prior Correct Re- 
sponse p «.003 
Trials 16 and 17 
No. of Correct Responses 
on Trial 16 p < 02 
No. of Correct Responses 
on Tri 7 p «.05 
Pattern of Partial 
Responses p «.0001 
Sequence of Elements in 
Recall b <.01 
Performance on Trial 2 b «.05 


and IV. (And they had more total cor- 
rect responses for all 17 trials, thus im- 
plying that the method is an efficient way 
to learn.) On the basis of total number 
of correct responses "backward" asso- 
ciations slow S down only 11%. On 
the basis of trials to criteria, they slow S 
down from 5% to 19.5%. On the basis 
of probability of being correct following 
one prior correct response, backward as- 
sociations are 84.595 as strong as for- 
ward ones. On the basis of performance 
on Trial 2, backward associations are 
85% as strong as forward ones. And 
on the basis of recall ordering, backward 
associations are 89-97% as strong as for- 
ward ones. 

Availability has been useful in inter- 
preting these results and it has been 
logically consistent throughout. All of 
the trends in Table 4 are consistent, and 
there is a significant correlation between 
the total number of correct responses and 
the number of elements recalled (e.g., 
r=.747 for Group I). It seems rea- 
sonable to suppose that availability (the 
probability of recall) is a function of the 
number and kinds of experience an $ 
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has had. Furthermore, it seems that S$ 
can directly perceive availability. 
Association and availability are inti- 
mately connected, but they are not equiv- 
alent. An S may be able to associate 
elements which he has not yet managed 
to make a functional part of his response 
repertoire, It may be that he is using 
only small pieces of the elements in these 
associations. On the other hand, it has 
been demonstrated many times that S 
can recall (ie. he has available) ele- 
ments which he is unable to use correctly. 
Recall follows availability—by definition 
an S cannot recall an unavailable item— 
but availability does not guarantee recall 
performance which also requires associa- 
tion. 
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72 rats were run in a 2 X 3 factorial experiment combining 2 condition- 
ing procedures (1-way and shuttle responding) and 3 shock intensities 
(1.0, L5, and 2.5 ma.). Conditioning was much faster with the 1-way 
procedure than with the shuttle procedure. Rate of conditioning 
decreased with shock intensity with the shuttle procedure, but not with 
the 1-way procedure. Escape latencies did not vary systematically 
with shock intensity. The data indicate that the slower learning 
rates with high-intensity shock in the shuttle condition are not due 
to the disruption of the escape response by high-intensity shock. 
Rather, it appears that with the shuttle procedure the decreased rate of 
conditioning is primarily due to the fact that high-intensity shock 
increases the resistance to extinction of a response which is incom- 
patible with the avoidance response. The results of a 2nd experiment 
indicated that handling of S during the intertrial interval contributed a 
small amount to the difference between the 1-way and shuttle 


conditions. 


Moyer and Korn (1964) have pub- 
lished data on shuttle avoidance con- 
ditioning in rats which indicate that 
rate of learning and level of perform- 
ance are inversely related to shock in- 
tensity. They attribute the relationship 
to greater and greater disruption of 
the instrumental escape response by 
higher and higher shock intensities. 
In spite of these results, there is an 
extensive literature on aversive con- 
ditioning in a wide variety of test 
situations which indicates that learning 
and asymptotic performance are in- 
creasing functions of shock intensity 
(eg. conditioned suppression, Annau 
& Kamin, 1961; shock postponement, 
Boren, Sidman, & Herrnstein, 1959; 
and wheel-turning avoidance, Kimble, 
1955). Moyer and Korn (1964) sug- 
gest that the differences between their 
data and these data are due primarily 
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to the fact that in these situations, the 
instrumental escape response is less 
susceptible to disruption by shock than 
the shuttle-type response. 
Examination of the shuttle proce- 
dure suggests an alternative hypothe- 
sis, namely, that the learning of a shut- 
tle response is an extremely complex 
discrimination with at least one strong 
competing behavior available to S. 
Consider the early trials of shuttle 
avoidance conditioning. During the 
periods before shock onset, the rat has 
a very strong tendency not to run back 
into the compartment where he re- 
ceived shock on the previous trial. The 
S does run back only when he is forced. 
to do so by reapplication of shock. 
For brevity, this tendency not to re- 
turn to the previous shock compart- 
ment will be called the "staying" re- 
sponse. It is obvious that the staying 
response is incompatible with the avoid- 
ance response and thus competes with 
it. It is logical to expect that the 
Stronger the shock on the previous 
trial, the lower the probability that S 
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will return to the compartment where 
shock was received on the previous 
trial, Extinction of the staying re- 
sponse must occur before S can per- 
form the avoidance response. If re- 
sistance to extinction of the staying 
response is directly related to shock 
intensity, then rate of avoidance con- 
ditioning will be inversely related to 
shock intensity. 

In the experimental situations men- 
tioned earlier in which performance is 
positively related to shock intensity, 
there is a distinct lack of a compelling 
competing response. To have a com- 
peting response for the wheel-turning 
shock-avoidance or bar-pressing shock- 
postponement situations comparable to 
that in the shuttle situation would re- 
quire two manipulanda with the con- 
tingency that S alternate his responses 
to the manipulanda on successive trials 
of the experiment. 

It is possible to obtain avoidance 
conditioning free of a strong competing 
response in a shuttle-type apparatus 
simply by requiring S to respond in 
one direction rather than having him 
shuttle back and forth. With this pro- 
cedure, one compartment is always the 
shock compartment and the other al- 
ways the safe compartment. Theios 
and Dunaway (1964) found that 
rats trained with a one-way procedure 
made an average of only 3.4 errors 
before reaching a criterion of 10 con- 
secutive correct responses. On the 
other hand, rats trained under the same 
conditions with a shuttle procedure 
made an average of 31.1 errors, or 
nine times more errors. If decreased 
performance with increased shock is 
due to a competing tendency not to 
return to the previous shock compart- 
ment, then it would be expected that 
increased shock should not decrease 
performance in the one-way avoidance 
situation which is free of this type of 
competing tendency. 
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ExPERIMENT I 


The present experiment tested the 
competing response hypothesis with a 
2 X 3 factorial design. Rats were run 
with either a shuttle or a one-way pro- 
cedure under a low, medium, or high 
level of shock. According to the hy- 
pothesis there should be (a) a signifi- 
cant effect due to type of conditioning 
procedure with the one-way Ss learn- 
ing faster than the shuttle Ss and (b) 
a significant Procedure X Shock Level 
interaction such that high shock re- 
tards learning rate with the shuttle 
procedure but does not retard rate 
with the one-way procedure. 


Method 


Subjects—The Ss were 80 naive albino 
rats obtained from the Holtzman Company 
of Madison, Wisconsin. They were between 
90 and 100 days of age when the experiment 
began and had been maintained on ad lib 
food and water. Eight Ss were dropped 
from the experiment due to apparatus fail- 
ure leaving 72 Ss which were randomly 
assigned to the six experimental conditions 
of the 2X 3 factorial design. 

Apparatus—The conditioning box con- 
sisted of two 14-in. long, 8-in. wide, and 
6-in. high compartments separated by a 
motor driven, clear Plexiglas guillotine door. 
The raising and lowering of the door took 
approximately .1 sec. The end of each com- 
partment consisted of a curved, clear Plexi- 
glas wall, behind which was located a 15-w. 
glazed light, diffused through a milk-glass 
plate. The floor of each compartment con- 
sisted of j&-in. brass rods spaced H in. 
apart, positioned parallel to the door, in a 
hinged frame. The weight of a rat depressed 
a microswitch mounted beneath each frame. 
The apparatus was painted flat black and 
had hinged Plexiglas tops covered with 
black cloth mesh which permitted observa- 
tion of S but which mitigated his possible 
distraction by E. A 1-in, audio speaker was 
centered in the top of each compartment. 
The apparatus rested on foam-rubber cush- 
ioning in a modified horizontal freezer chest 
which had been entirely lined with acous- 
tical tile for sound attenuation. All stimulus 
presentation, timing, and response recording 
was programmed by Foringer, Grason-Stad- 
ler, and Lehigh Valley Electronics modules. 
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The CS was the complex consisting of the 
opening of the door between the compart- 
ments, the offset of the light in the occupied 
compartment, the onset of the light in the un- 
occupied compartment, and the presentation 
of wide-band white noise which increased the 
sound level in the compartments from 77 
db. to 90 db. (SPL, C scale of General 
Radio sound-pressure meter). The UCS 
was scrambled electric shock delivered by a 
Lehigh Valley Electronics LVE 1531 con- 
stant current shocker. Response latency in 
-10-sec. units and type of response (error or 
avoidance) were recorded by a Grason- 
Stadler E4600 print-out counter. 
Procedure.—The CS-UCS interval was 5 
sec, and the intertrial interval was 30 sec. 
The experiment was begun by placing S into 
one of the two compartments, chosen at 
random. Pretest trials then followed and 
consisted of the presentation of the CS alone 
for 11.5 sec. During this period shock was 
never presented, and S’s responses had no 
effect on the termination of the CS. At the 
end of the 11.5 sec. the CS was terminated, 
the door lowered, and the S was trapped in 
the compartment then occupied. Each S 
was given pretest trials until he reached 
a criterion of five successive failures to 
leave the occupied compartment within 5 
Sec. after the presentation of the CS. 
Training began on the next trial An 
avoidance response was defined as leaving 
the occupied compartment within 5 sec. after 
the presentation of the CS, If S took longer 
than 5 sec, to leave, he was shocked until 
he did leave. Responses to shock were 
scored as errors, Avoidance responses ter- 
minated the CS, and escape responses (er- 
rors) terminated the CS and the UCS. 
Each S was run to a criterion of 10 con- 
secutive avoidance responses. Half the Ss 
were trained with a shuttle procedure, and 
half the Ss were trained with a one-way 
Procedure, Shuttle Ss were never touched 
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during the experiment. One-way Ss were 
not touched during the pretest, but 15 sec. 
after the end of each training trial they were 
picked up and replaced in the compartment 
in which they had originally received shock. 
Thus, for the one-way Ss, one of the com- 
partments was a shock compartment and 
the other was a safe compartment. Within 
each procedure, subgroups of 12 Ss each 
had the shock intensity meter of the con- 
stant current shocker set at either 1.0, 1.5, 
or 2.5 ma. These values of shock intensity 
were selected because Moyer and Korn 
(1964) found that the interesting effects of 
shock were asymptotic below 1.0 ma. and 
above 2.5 ma. 


Results 


Table 1 presents the mean total er- 
rors and the mean trial of the last 
error before the criterion run for the 
six groups of the experiment. One- 
way Ss perform significantly better 
than shuttle Ss; F (1, 66) = 49.75, p 
< .001, for total errors and F (1, 66) 
= 58.67, p < .001, for the last error. 
In both measures there is a significant 
Procedure X Shock Intensity inter- 
action; F (2, 66) = 7.31, p < .01, for 
total errors, and F (2, 66) = 6.30, p 
< .01, for last error. The interactions 
are such that increasing shock inten- 
sity slows down learning rate in the 
shuttle condition, but has little effect 
on learning rate in the one-way condi- 
tion. There are also significant ef- 
fects due to shock intensity; F (2, 66) 
= 7.10, p< 01, for total errors, and 
F (2, 66) = 5.12, p < .01, for trial of 
last error, Inspection of Table 1, how- 


TABLE 1 
Mean TOTAL Errors (T) AND TRIAL OF THE LAST ERROR (L) BEFORE CRITERION 


Shock Intensity 


Cond, 1.0 ma. 1.5 ma. 2.5 ma. 
F L T L T L 
Shuttle 32.3 (54.2) 47.8 63.6 
E . 100.3 s 
One-Way 5.0 (8.2) 5.4 (03) 4.7 03) 
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TABLE 2 


Mean TIME (SEC.) TO ESCAPE SHOCK OVER 
ALL SHOCK TRIALS 
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TABLE 4 


SUMMARY OF ANALYSIS OF VARIANCE 
ON AVOIDANCE LATENCIES 


Shock Intensity 


Cond. 
1.0 ma. | 1.5 ma. | 2.5 ma. Mean 
Shuttle 1.2 6 112 1.0 
One-Way 1.0 1.2 1.0 1.1 
M 1.1 3 1.1 1.0 


ever, reveals that the intensity effects 
are due only to the highly significant 
decremental effect which increasing 
shock has on learning rate in the shut- 
tle condition. 
Response latencies —Table 2 presents 
the mean time to escape shock over all 
the shock trials. None of the differ- 
ences even approach significance. 
Thus, with the apparatus and settings 
used, increasing shock intensity neither 
retards nor facilitates escape behavior. 
Table 3 presents the mean latencies 
of all the avoidance responses. A sum- 
mary of an analysis of variance on the 
avoidance latencies is given in Table 
4. The analysis indicates that there 
is a significant effect between condi- 
tions, with one-way Ss responding 
faster than shuttle Ss. There is also 
a significant effect due to shock in- 
tensity, when the 2.5 and 1.5 ma. 
groups are combined and compared to 
the 1.0 ma. groups. There is no Con- 
ditions X Shock Intensity interaction. 


TABLE 3 
MEAN OF ALL AVOIDANCE LATENCIES (SEc.) 


Shock Intensity 


Cond. 
1.0 ma. | 1.5 ma. | 25 ma. | Mean 
Shuttle 2.4 21 2.0 2.2 
One-Way 1.5 1.4 ils 1.4 
M 2.0 18 | 16 


Source df F 
Conditions 1 45.87 *** 
Shock Intensity (2) 

1.5 vs. 2.5 1 13 
1.0 vs. (1.5 and 2.5) d 4.00* 
Conditions X Shock 2 .52 
Within MS, 66 (.23) 
*p <.05. 
A S001. 
Discussion 


The results indicate that rate of avoid- 
ance conditioning is depressed by high- 
intensity shock only if a shuttle-type pro- 
cedure is used.  High-intensity shock 
does not depress rate of conditioning 
when a one-way procedure is used. The 
results suggest that retardation of condi- 
tioning with high-intensity shock should 
not be interpreted as due to greater and 
greater disruption of the escape response 
by higher and higher shock intensities. 
Rather, a necessary condition for retarda- 
tion of learning rate by high-intensity 
shock seems to be the initial presence of 
a high-strength response to the CS which 
is incompatible with the avoidance re- 
sponse. In the shuttle box, the high- 
strength competing response is the tend- 
ency for S not to enter the compartment 
where he was just shocked. This com- 
peting response or inhibition is not pres- 
ent in the one-way avoidance training 
procedure, and in fact the learning rates 
of the one-way Ss are not retarded by 
high-intensity shocks. 

The competing-response hypothesis also 
accounts for the huge differences between 
rates of learning in the shuttle and one- 
way conditions which were found in this 
study and by Theios and Dunaway 
(1964). In order to learn the avoidance 
response, shuttle Ss have to extinguish 
the competing tendency of not returning 
to the compartment in which shock was 
received on the previous trial. That this 
response is highly resistant to extinction 
is evidenced by the fact that mean num- 
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bers of shocks before the first avoidance 
response were 16, 26, and 41 for the 1.0-, 
1.5-, and 2.5-ma. shuttle groups, respec- 
tively. By contrast, corresponding means 
for the one-way groups were only 3, 3, 
and 4. 

The latency data also suggest that dis- 
ruption of the escape response by high- 
intensity shock is not the reason for the 
lower learning rates with high-shock in- 
tensities in the shuttle condition. With 
the Lehigh Valley Electronics constant 
current shocker, we found that settings 
of 1.0, 1.5, and 2.5 ma. did not result 
in systematically different shock escape 
latencies. Thus, even though the differ- 
ent shock intensities resulted in different 
learning rates in the shuttle Ss, no evi- 
dence was found that escape responses 
were differentially disrupted by the dif- 
ferent intensities of shock. We would 
have to interpret these results as indicat- 
ing that settings from 1.0 to 2.5 ma. on 
the Lehigh shocker deliver electrical 
stimulation to the rat which effectively 
do not differentially inhibit motor co- 
ordination, 

It is trivially true that over the entire 
domain of possible shock intensities, 
learning rate will be a nonmonotonic 
function, at first increasing and then de- 
creasing, For at very low shock intensi- 
ties, the organism will not be affected by 
the shock, and learning rate will be zero 
or near zero. With extremely high 
shock intensities, the organism will be 
either tetanized, convulsed, or killed, and 
learning rate again will be zero or near 
zero. The point that seems clear in our 
data, however, is that within the optimal 
range of shock intensities found by 
Moyer and Korn (1964), 10-25 ma., 
there is no differential interference of 
the escape response. However, the re- 
sistance to extinction of the initial com- 
peting response to the CS is an increas- 
ing function of shock intensity, 


EXPERIMENT II 


Operationally, there are two differ- 
ences between the procedure of the 
shuttle and one-way Ss. First, the 


J. THEIOS, A. D. LYNCH, AND W. F. LOWE, Jn. 


shuttle Ss were not handled during the 
intertrial interval whereas the one-way 
Ss had to be handled. Second, the 
shuttle Ss eventually had to learn to 
turn around and face the door before 
each trial, whereas the one-way Ss 
were replaced in the shock compart- 
ment facing the door. The one-way 
Ss, therefore, did not have to learn 
to turn around. Although it appears 
that the primary variable determining 
the difference between the shuttle and 
one-way conditions is an inhibition 
against entering a place where shock 
has just been received on the previous 
trial, it is possible that handling per se 
and learning to turn around may ac- 
count for some of the differences be- 
tween the two conditions. To assess 
the relative effects of these variables a 
second experiment was performed. 


Procedure 


The general method, apparatus, and Ss 
were similar to those of the 1.0-ma, condi- 
tion of Exp. I. There were 40 Ss assigned 
equally to four groups. Three of the groups 
were required to shuttle from one compart- 
ment to the other on each trial, The first 
group was an unhandled control group which 
was treated just as the 1.0-ma. shuttle group 
in Exp. L The Ss of the second group 
were picked out of the apparatus during the 
intertrial interval just as were the one-way 
Ss of Exp. I. However, Ss of the present 
group were replaced in the compartment 
from which they were picked up rather 
than the other compartment as were the 
one-way Ss. In addition, the present Ss 
were replaced facing away from the door. 
A performance difference between the first 
two groups of Exp. II would be due to 
the effects of handling per se. The Ss 
of the third group were picked out of 
the apparatus during the intertrial interval 
and replaced in the same compartment fac- 
ing toward the door. A performance dif- 
ference between the two handled groups 
would be due to the effects of the direction 
of replacement. The Ss of a fourth group 
Were given one-way training, but were re- 
placed in the shock compartment facing 
away from the door. A comparison of this 
group and the 1.0-ma. one-way group of 
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Exp. I would also give an estimate of the 
amount of difference due to having to learn 
to turn around before the next trial. 


Results and Discussion 


To demonstrate the reliability of the 
conditioning technique, a comparison 
will first be made between the 1.0-ma. 
shuttle groups of Exp. I and II, which 
were treated the same, but which were 
run at different times by different Es. 
The mean total errors was 32.3 for the 
Exp. I group and 33.7 for the Exp. II 
group. The difference is far from ap- 
proaching statistical signifiance, F (1, 
38) < 1.0. The error term in the anal- 
ysis of variance was based on data from 
all four of the 1.0-ma, shuttle groups. 

Consider now the shuttle Ss of Exp. 
II who were handled during the inter- 
trial interval. The mean total errors 
was 17.2 for the group replaced facing 
away from the door and 18.0 for the 
group replaced facing toward the door. 
The difference is not significant, F (1, 
38) < 1.0. This indicates that whether 
the organism was placed facing toward 
the door or away from the door had 
no effect per se on performance. 

The overall mean total erors for the 
22 nonhandled shuttle Ss was 30.7, 
whereas the overall mean for the 20 
handled shuttled Ss was 17.6. The 
difference is statistically. significant, 
F (1, 38) = 4.81, p < 05. Thus, han- 
dling Ss during the intertrial interval 
does result in faster learning and prob- 
ably did contribute to the difference 
between the one-way and shuttle con- 
ditions of Exp. I. The effects of han- 
dling, however, should be relatively 
constant across shock levels. From a 
theoretical standpoint it is difficult to 
see how handling could differentially 
interact with shock intensity to pro- 
duce the interaction which was found 
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in Exp. I. From observation of Ss, 
it is our opinion that handling provides 
S with a very dynamic stimulus to 
which he can condition an orienting 
response. After the avoidance re- 
sponse appeared, Ss who were handled 
and replaced facing away from the 
door were very likely to turn around 
just after being replaced in the com- 
partment. The use of the stimulus 
feedback from handling as a cue to 
turn around or orient could account 
for the lack of a difference between 
the two-directional placement condi- 
tions. This interpretation is further 
strengthened by the lack of a large dif- 
ference in mean total errors between 
the 1.0-ma. one-way Ss of Exp. I, who 
were replaced facing toward the door 
of the shock compartment, and the one- 
way Ss of Exp. II, who were replaced 
facing away from the door. There was 
a directional difference in that $s who 
faced the door had a lower mean (5.0) 
than did Ss who faced away from the 
door (5.8), but the difference is not 
significant, F (1, 20) = 70. 
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BRIGHTNESS ENHANCEMENT IN INTERMITTENT LIGHT: 
METHODS OF MEASUREMENT? 


GERALD S. WASSERMAN 2 


Massachusetts Institute of Technology 


Brightness enhancement was measured under 4 conditions of judgment 
(simultaneous or successive comparisons and in the presence or 
absence of an adapting field) in a 2X 2 factorial design. Brightness 
enhancement was greater for successive comparisons than for simul- 
taneous comparisons. Enhancement was also favored when the 
adapting field was removed during the comparisons, The greatest 
amount of enhancement, then, was found when the intermittent light 
was isolated in space and time from all other stimuli, thus contradicting l 
the suggestion that brightness enhancement might be an interaction 
artifact. However, these results also demonstrate that brightness 
enhancement is sensitive to the presence of other stimuli. Intro- 
spections of Os suggest that subjective colors were produced to a 


greater degree during simultaneous comparisons than during successive 


comparisons. 


When a light is flashed intermit- 
tently it may look brighter than on 
continuous exposure, even though less 
total flux reaches O's eye from the 
intermittent light. This enhancement 
of brightness can be observed either 
with a single flash (Broca-Sulzer 
effect) or with a train of flashes 
(Brücke-Bartley effect). In another 
paper (Wasserman, 1966), it was re- 
ported that these two enhancement 
effects are different manifestations 
of the same process; they will be so 
considered in the present report. 
The term “intermittent light” will be 
used to describe either experimental 


1This research was done during the au- 
thor’s term as a National Institutes of Health 
predoctoral fellow (F 1-MH-22, 408-01) and 
was supported by a National Aeronautics 
and Space Administration grant (NsG 496) 
awarded to H.-L. Teuber, The computations 
for this report were done on an IBM 7090 
computer at the Massachusetts Institute of 
Technology Computation Center. I would 
like to express my indebtedness to my 
teacher, Hans-Lukas Teuber, for the advice 
and encouragement he has always given me. 

2 Now at the Ophthalmology Branch, Na- 
tional Institute of Neurological Diseases and 
Blindness, Bethesda, Maryland. 


300 


paradigm. The classical method of 
measuring this effect presents O with 
two illuminated fields, one intermit- 
tent and the other steady, and the 
luminance of one field is adjusted until 
a brightness match is obtained. The 
ratio of the luminances of the two fields 
is then taken as a measure of the phe- 
nomenal brightness of the intermittent 
field; this ratio is often called a bright- 


ness index. Most Es have made these 


brightness matches by presenting both 
fields simultaneously although Mc- 
Dougall (1904), Martius (1905), 
Flügel and McDougall (1909), Haas 
(1923), Aiba and Stevens (1964), and 
Wasserman (1966) have presented 
two fields successively with interstimu- 
lus intervals of up to 30 sec. Both 
techniques produced a measurable ef- 
fect. 

Recently, Raab, Fehrer, and Hersh- 
enson (1961) presented data which 
Suggested that brightness enhancement 
in intermittent light might not be due 
to intermittence at all, but entirely to 
the juxtaposition of an intermittent 
and steady light in O's field. Bright- 
ness enhancement would then reduce 
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to a special case of contrast. This 
suggestion that brightness enhance- 
ment might be an interaction artifact 
is puzzling in view of the above-men- 
tioned demonstrations of an effect 
when the two fields are clearly sepa- 
rated in time. Moreover, Raab (1962) 
himself later demonstrated an enhance- 
ment effect with a single, isolated flash 
of light when Stevens' method of mag- 
nitude estimation was used. 

Informal observations made in our 
laboratories by Hans-Lukas Teuber 
suggested the rather opposite conclu- 
sion, that the presence of other stimuli 
hindered the production of brightness 
enhancement. An intermittent light 
which was surrounded by a steady 
light seemed to exhibit little or no en- 
hancement. Moreover, an examination 
of the literature revealed large differ- 
ences in the amount of brightness en- 
hancement produced by various stimu- 
lus configurations. The amount of 
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brightness enhancement at the optimal 
flash duration as measured by the 
brightness index is shown in Fig. 1 as 
a function of luminance. The stimulus 
configurations used are shown next to 
the appropriate curves. It is true, of 
course, that these experiments differed 
in many other respects, but there isa 
strong suggestion here that brightness 
enhancement, far from being an inter- 
action artifact, is a primary effect 
which is sensitive to the presence of 
other stimuli. This suggestion is re- 
inforced by the data of Ball (1963) 
and Aiba and Stevens (1964) which 
show that brightness enhancement is 
affected by a change in the state of 
adaptation. As an extensive investiga- 
tion of brightness enhancement was 
planned, it seemed wise to determine 
which stimulus conditions were most 
favorable for the production of the 
effect. 


Luminance (db re | mL) 


Dependence of brightness 


Fic. 1. 
different con 


experiments using three 
field, F — flash field, and S 


enhancement on luminance for three 
figurations as shown. (A — adapting 


— steady field.) 
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Fic. 2. Maxwellian view apparatus assembled for this study. 


METHOD 


Apparatus.—The Maxwellian-view system 
used in these experiments is shown in Fig. 
2. This system has been described in de- 
tail elsewhere (Wasserman, 1966), and only 
the salient features will be described here. 
Three independent light channels were pro- 
vided, and the appearance of all three illu- 
minated fields to O is shown in Fig. 3. 
Aperture stops formed artificial pupils with 
maximum diameters of 1.6 mm, The color 
of the adapting field was visually matched 
to that of the daylight-quality stimulus fields. 
Neutral filters were used for gross attenua- 
tion only and were not changed during the 
course of this experiment, The intensities 
of all three fields were varied in a spec- 
trally nonselective manner, Luminance cali- 
brations were made with a Macbeth il- 
luminometer using a haploscopic matching 
technique. 


Fic. 3. Appearance of the stimuli used in 
the experiment, 


Observers.—Four members of the depart- 
ment served as Os in addition to myself. 
Three were naive as to the purpose of the 
experiment while one (DWP) was not. 
All read the H-R-R pseudoisochromatic 
plates with no errors and had essentially 
normal vision, except for refractive errors. 

Procedure.—The experimental design is 
illustrated in Fig. 4. Matches were made 
simultaneously or successively and with the 
Presence of the adapting field constant or 
variable in a 2x2 factorial design. The 
interstimulus intervals were always 1 sec. 
to eliminate masking effects and the stimuli 
Were presented for 2 sec. The intertrial 
interval was controlled manually by myself 
and was several seconds, Matches were 
made by a modified method of limits except 
for my data which were obtained by the 
method of adjustment, One ascending and 
one descending trial were averaged to de- 
termine a match. Each O made five matches 


Lae pe is 
—H. mpm -— 
E [stes ee Oe [—] 
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3 Fre. 4. Design of experiment. (Each 
line indicates whether a field is illuminated 
at any given time. A field = adapting field, 


F field = flickering field, and S field = steady 
field.) 
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Fic. 5. Uncorrected results given as the luminance of a steady light 
required to match a flickering light as a function of the repetition rate. 
(Numbers in the upper right corner of each cell indicate the number of 
data points showing brightness enhancement.) 


303 


at five repetition rates for each of the four 
cells of the design. The data points were 
presented in a random order determined by 
a table of random numbers. Experimental 
sessions lasted 1 hr., and each O was given 
i hr. of practice before data collection was 
begun. Several minutes were spent at the 
start of each session during which time 
O was adapting to the 1-mL. adapting field. 
The flickering light was presented at a 
luminance of 200 mL., and the luminance of 
the steady field was varied. Since large 
constant errors are common under such re- 
duced conditions, matches were made be- 
tween two steady lights as a control In 
addition, repetition rates of 3%, 6$, 133, and 
663 per second were used with the light- 
dark ratio always kept at 1:2. The Os were 
instructed to base their judgments on the 
appearance of the stimuli some time after 
stimulus onset, thus minimizing enhancement 
effects at the onset of the steady light, as 
well as boundary effects in the train of 
flashes (cf. Raab & Osman, 1962). 


RESULTS 


The luminances required by each O 
to match the flickering light are pre- 


sented in Fig. 5 along with a mean 
trend line. Table 1 records the means 
and standard deviation for each O and 
for the grouped data. The differences 
due to the experimental treatments are 
quite clearcut and indicate that the 
presence of other stimuli reduces the 
brightness enhancement produced by 
a flickering light. Successive presenta- 
tion of the stimuli favors the produc- 
tion of brightness enhancement (F = 
217.50, p < .001) and the absence of 
the adapting field also favors the 
production of brightness enhancement 
(F =18.10, p «.001) although the 
effect of this treatment is less pro- 
nounced. The anticipated constant er- 
rors are present and the amount of 
constant error differs from cell to cell. 

The F ratios and probabilities given 
above were obtained by performing an 
analysis of variance on data which had 
been corrected for the appropriate con- 
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TABLE 1 


MEAN LUMINANCES REQUIRED TO MATCH A 200-ML. FLICKERING LIGHT 


Repetition Rates 


Os Simultaneous Successive 


Steady 3.33 6.67 13.3 66.7 Steady 3.33 6.67 13.3 66.7 


Variable 
GW 162 184 162 87 71 142 205 245 100 53 
(14) | (42) | (28) | (14) | (10) | (3) (33) | (38) | (10) | (15) 
TD 134 171 169 89 61 157 184 265 191 57 
(13) | (30) | (29) | (14) (9) | (18) | (44) | (15) | (42) (9) 
HB 159 179 119 81 70 154 174 149 117 71 


DWP 145 128 86 7 51 128 154 108 91 60 
(39 | (43) | (27) | (38) (10) | (11) (9) | (44) (7) (8) 
M 155 156 128 80 61 148 173 186 127 60 
(25) (37) | (40) (19) | (15) (19) (34) (69) (42) | (11) 

Constant 
EFIE | | gb | e as e | gy 
LT D (3) (34) (22) IG T 181 {3 fo 6 
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HB 173 160 120 7. Ed (o fen {op ie K? 
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Fic. 6. Effect of mode of comparison after Fic. 7. Eff i 
; 2 ect of mode of adaptation after 
correction for constant errors and after Correction for constant pict ani after 
averaging over the two levels of adaptation, averaging over the two levels of comparison. 
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stant error. Each brightness match 
was expressed as a ratio to the mean 
steady match found in the same cell 
of the experimental design. Figures 
6 and 7 illustrate the effects of each 
of the experimental treatments. after 
normalization for constant errors and 
after averaging over the two levels of 
the orthogonal treatment. This nor- 
malization is not perfect as can be seen 
by the fact that the brightness index 
at fusion (663 per second) is some- 
what higher than the value of .33 re- 
quired by Talbot's law. Nevertheless, 
this first-order correction enables us 
to see the treatment effects clearly. 
In both cases, isolation of the flicker- 
ing light increases brightness enhance- 
ment. 

Subjective colors.—During the course 
of the experiment, it became obvious 
that the mode of comparison was in- 
fluencing the production of so-called 
subjective colors. All Os complained 
that simultaneous comparisons were 
difficult when brightness enhancement 
was present because of a color differ- 
ence which appeared between the two 
fields. Four Os saw this color differ- 
ence as a reddening of the flickering 
light, while one saw the steady light 
become green. On the other hand, no 
O complained of this problem when 
successive matches were being made. 
Upon interrogation, Os reported that 
there might be a color difference in the 
successive mode but they could not be 
certain. It is clear that there is a 
dissociation of the effect of the mode 
of comparison on brightness enhance- 
ment and on subjective colors; the pos- 
sibility exists that subjective colors are 
not seen when an isolated flashing light 
is observed. 


Discussion 


In the interpretation of these results, 
it is essential that a clear distinction be 
drawn between first- and second-order 
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contrast effects. A first-order contrast 
effect is a change in brightness due to 
a juxtaposition of two lights in O’s field. 
A delay in the onset of this first-order 
contrast effect might produce a time- 
dependent change in the brightness of 
an intermittent field which would be 
formally identical with the change one 
calls brightness enhancement. From this 
point of view, however, the phenomenon 
would be a reduction of brightness due 
to contrast rather than an enhancement 
due to intermittence itself. 

The results of this experiment agree 
with those of earlier experiments which 
demonstrated the existence of brightness 
enhancement when the intermittent field 
is isolated in space and time from all 
other stimuli. Furthermore, at the lumi- 
nance used. in this experiment, an iso- 
lated intermittent light shows the greatest 
amount of enhancement. It is not likely, 
therefore, that first-order contrast effects 
play any role in producing brightness 
enhancement as such. 

The effects of other stimuli on bright- 
ness enhancement would seem to be easily 
understood as second-order manifesta- 
tions of contrast. The juxtaposition of 
a contrast-inducing field with a test field 
of a given luminance is in many respects 
equivalent to reducing the luminance of 
the test field. Indeed, the reduction of 
luminance of a comparison field required 
to produce a brightness match with such 
a test field is a common method of meas- 
uring contrast. Brightness enhancement 
depends on luminance, as Fig. 1 showed. 
It would seem reasonable, therefore, that 
the reduction of brightness enhancement 
by the presence of other stimuli is the 
result of an effective reduction in the 
luminance of the intermittent light due 
to contrast. If this explanation is cor- 
rect, successive contrast would also be 
expected to reduce brightness enhance- 
ment, That is, if the two fields to be 
matched were presented successively in 
the same retinal location we would ex- 
pect less enhancement than if they were 
presented successively to separate retinal 
locations. This explanation also predicts 
the reversal of the effect of contrast- 
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inducing stimuli when an extremely in- 
tense intermittent light is presented since 
increasing luminance eventually begins to 
produce less brightness enhancement (cf. 
Fig. 1). 

The dissociation of these brightness 
enhancement phenomena from subjective 
colors is intriguing in view of the belief 
that these two effects are closely linked 
(cf. Aubert, 1865; Brücke, 1864; Cohen 
& Gordon, 1949). Moreover, these re- 
sults are not what one would expect if 
Fechner's (1838) generally accepted ex- 
planation of subjective colors were true. 
For Fechner, subjective colors were a 
result of certain hypothetical differences 
in latency in the color systems of the eye. 
The observations reported here would 
suggest that subjective colors are partly 
(perhaps entirely) dependent on the 
presence of contrast-inducing stimuli in 
the visual field. These observations are 
consistent with the historical fact that 
complex rotating patterns have been the 
preferred stimuli for the production of 
subjective colors. A careful colorimetric 
investigation of this problem would seem 
to be worthwhile. 
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4 groups of 11 rats each were trait 
factorial design 


immediately rew: 


10-sec. delay-of-reward trial (D). 
both the Ist an 


increased by nonreward on 
being greatest to least in the or 
results were interpreted 
while being incompatible with al 


partial reward. Consideration w: 


variability in extinction. 


It appears possible to employ both 
nonrewarded trials and delay of re- 
ward trials in the same schedule of 
reward in a manner which will allow 
a convincing and critical test of several 
current formulations dealing with de- 
layed reward and/or partial reward. 
First of all, assume that habit strength 
is a negatively accelerated decreasing 
function (logarithmic, exponential, or 
some other form) of the interval of 
delay of reward and falls to a low or 
a near zero value at about 10 sec. delay 
of reward. Assume further that the 
aftereffect of 10 sec. delay (S?,o) is 
rather similar to the rewarded (85) 
aftereffect and rather dissimilar to the 
nonrewarded (SN) aftereffect (Ca- 
paldi & Poynor, 1966). Taken to- 
gether, these assumptions contain an 
implication tested in this investigation, 
that the separation of a nonreward 
irial from a subsequent immediately 
rewarded trial by a 10-sec. delay-of- 
reward trial will produce relatively 
rapid extinction. 


1 This investigation was supported in part 
by National Institute of Child Health and 
Human Development Research Grant HD 
00949-02 to the first author. 


ned in the runway employing a 2 X 2 


in which the 1st and 2nd of 2 trials preceding an 
arded trial could be either nonrewarded (N) or a 


Resistance to extinction was 
d 2nd trials, resistance 
der NN, DN, ND, and DD. The 


as supporting an aftereffects hypothesis 
lternative hypotheses of delay and/or 


as given to the role of stimulus 


METHOD 


Subjects —The Ss were 44 naive male 
albino rats about 95 days old when pur- 
chased from the Holtzman Company, Madi- 
son, Wisconsin. 

Apparatus.—The runway was 82 in. long, 
4 in. wide, with 9-in. high sides, covered 
with hinged, 3-in. hardware cloth. An 8-in. 
long floor treadle, suspended ove: a micro: 
switch, constituted the initial portion of the 
alley and when the treadle was depressed 
by the rat, whose front paws were always 
placed at its extreme forward edge, the first 
clock (.01 -sec.) was started. Interrupting 
an infrared photobeam 2 in. beyond the 
treadle stopped Clock 1 (start time) and 
started Clock 2. Interrupting a photobeam 
52 in. beyond the first photobeam stopped 
Clock 2 (run time) and started Clock 3. 
Interrupting a photobeam 154 in, beyond 
the second photobeam and 23 in, in front 
of a brass 2X 4X 14 in. food cup covered 
by a tightly fitting, sliding, metal lid stopped 
Clock 3 (goal time) and depending on the 
settings on a series of Hunter timers opened 
the lid covering the food cup either im- 
mediately or 10 sec. later. A guillotine 
door, 14 in. from the end of the alley, was 
manually lowered behind S after the third 
photobeam was broken. On every trial S 
was allowed 60 sec. to traverse each section 
of the alley. If 60 sec. elapsed in one sec- 
tion of the alley, S was guided to the next 
section, and from the goal section S was 
guided to the goal box. The elapsed time 
on the start, run, and goal clocks was added 
and is termed total. 
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Experimental design.—All Ss received five 
trials each day. The experimental design is 
a2X2 factorial. Let us refer to those trials 
preceding an immediately (I) rewarded trial 
which could receive either nonrewarded (N) 
or delayed (D) reward as the A and B trials, 
The 2 X 2 factorial results from the A trials 
and the B trials being either D or N trials. 
Group NN was trained under the five-trial 
schedules (a) I, I, N, N, I, or (b) I, N, N, 
I, I; Group DN under the schedules (a) I, 
I, D, N, I, or (b) I, D, N, I, 1; Group ND 
under the schedules (a) I, I, N, D, I, or 
(b) I, N, D, I, I; and Group DD under the 
schedules (a) I, I, D, D, I, or (b) I, D, D, 
I, I. For each group the a schedule oc- 
curred on odd-numbered training days, the 
b schedule on even-numbered training days, 

Habituation—On arrival at the laboratory 
Ss were placed in individual cages and for 
3 days given ad lib food and water. Be- 
ginning on Day 4 Ss were fed Rockland 
pellets in the home cage for only 1 hr. each 
day, water always being available, Hand- 
ling occurred on Days 14, 15, and 16 and 
on Day 17 each S explored the apparatus 
for 5 min. Two 50-sec. goal-box placements 
separated by a 20-sec. interval Occurred on 
Days 18, 19, and 20. On these days the 
goal cup was plentifully supplied with J. P, 
Noyes .045-mg. pellets and the lid covering 
the goal cup was in Open position. On Days 
21 and 22 each S received three goal-box 
placements separated by a 20-sec. interval, 
the lid covering the goal cups being in the 
closed position, opening and making a char- 
acteristic slight noise as E's hand and S’s 
body swept past the infrared beam as S was 
placed in the goal box, On these days the 
reward on each placement consisted of six 
-045-mg. pellets, 

Acquisition training —On immediately re- 
warded trials and on 10-sec. delay-of-reward 
trials $ received six .045-mg. pellets and was 
removed from the goal box after the pellets 
Were consumed. On nonreinforced trials the 
lid covering the empty food cup opened im- 
mediately and $ was removed 15 sec. after 
goal-box entry, Each $ was run individually 
and at the same time each day, the order 
of running Ss being randomized beforehand 
so that no two Ss from the same group were 
Tun consecutively. There were 10 days of 
acquisition training. The intertrial interval 
Was 20 sec. and § was fed the I-hr. daily 
ration about 10 min. following the last trial 
of the day. 

Extinction —There Were 5 days of extinc- 
tion training at four trials each day. On 


each trial the lid covering the empty food 
cup opened immediately and 15 sec. later § 
was removed from the goal box and given 
the 20-sec. intertrial interval. 


RrsurTS 


All times were converted to recipro- 
cals. Figure 1 shows the mean median 
total speed for each group on each day 
of acquisition and of extinction. Total 
speed is quite characteristic of the 
speeds obtained in each of the individ- 
ual alley segments, Analysis of vari- 
ance on the total speeds on Day 10 
of acquisition indicated a nonsignificant 
difference for the A trial, F < 1, the 
B trial, F<1, and the A x B inter- 
action, F < 1, 

The extinction results are quite or- 
derly with Group NN being most and 
Group DD being least resistant to 
extinction and Group DN being more 
resistant to extinction than Group ND. 
Analysis of variance on the median 
total speeds over all 5 days of ex- 
tinction indicated that the A-trial effect 
was not significant, F (1, 40) = 3.14, 
P < .10, that the B-trial effect was 
highly significant, F (1, 40) 2 23.33, 
$ <.001, and that the A x B inter- 
action was not significant, F (1, 40) 
= 2.28, p<.10. A subsequent Dun- 
Can's range test. indicated that all 
groups except NN and DN differed 
from each other at the .05 level or bet- 
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Fic. l. Mean median speed for each group 
on each day of acquisition and extinction. 


DELAYED REWARD AND THE AFTEREFFECTS HYPOTHESIS 


ter. Analysis of variance over the last 
2 days of extinction indicated signifi- 
cant differences for the A trial, F (1, 
40) = 5.27, p < 05, and the B trial, 
F (1, 40) = 25.00, p < .001, but not 
for the A X B interaction, F < 1. A 
subsequent Duncan’s range test indi- 
cated that all differences between 
groups were significant at the .05 level 
or better. 


DISCUSSION 


The theoretical principles contained in 
two previous papers (Capaldi, 1964; 
Capaldi & Poynor, 1966) plus the as- 
sumption that habit strength is a nega- 
tively accelerated decreasing function of 
the length of delay of reward are re- 
quired to deduce the present results 
from an aftereffects hypothesis. First, 
the relatively great resistance to extinc- 
tion of Group NN follows from the stimu- 
lus modification assumption (Capaldi, 
1964). According to this assumption a 
single nonreinforced trial produces the 
aftereffect S% while two successive N 
trials produce the aftereffect Sha The 
aftereffect S"« supplies more generalized 
habit strength than S% to the higher 
valued SNs which occur in extinction, 
eg, S", S"» etc, and therefore re- 
sistance to extinction is greater following 
two successive nonreinforced trials than 
following a single nonreinforced trial. 
It should be noted, too, that the stimulus 
modification assumption expects that the 
greater resistance to extinction of Group 
NN relative to Group DN should develop 
mainly in the late extinction trials, a 
result obtained in this investigation. This 
is because the proportionality between the 
generalized habit strength supplied by 


2 The modification of the aftereffect as a 
function of successive trials will be repre- 
sented as S~“; (one nonreinforced trial), S" 
(two successive nonreinforced trials), etc. 
The effect of confinement duration will be 
represented as SD, (10-sec. delay trial). 
According to this system, a complete repre- 
sentation of the aftereffect of 15 sec. confine- 
ment on a single nonreinforced trial would 
be Sis. 
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S* (S*H) and that supplied by S^ 
(S*H) is greater, the higher the value 
of S“ under consideration, eg, SWH/ 
SMH at S& « S&H/SSH at S'» (Ca- 
paldi, 1964) . 

According to the present hypothesis, 
Group DD was least resistant to extinc- 
tion because the aftereffect of 10 sec. 
delay is quite similar to the aftereffect 
of reward and therefore the delay after- 
effect supplies the SNs which occur in 
extinction with relatively little general- 
ized habit strength (Capaldi & Poyner, 
1966). Group DN was more resistant 
to extincton than Group ND because the 
SX in Group DN was followed by im- 
mediate reward while in Group ND the 
aftereffect S*: was followed by 10 sec. 
delay of reward. Since habit strength 
is a decreasing function of the length of 
delay, it follows that the S^ of Group 
DN had more habit strength than the 
SX of Group ND, and therefore the 
higher valued SNs occurring in extinc- 
tion received more generalized habit 
strength in Group DN than in Group 
ND. 4 

From the viewpoint of the aftereffects 
hypothesis, the greater resistance to ex- 
tinction of Group ND relative to Group 
DD could be due to either one or both 
of two mechanisms: (a) the S^ in 
Group ND could have been supplied with 
some habit strength on the 10-sec. delay 
trial, or (b) in Group ND the after- 
effect S%: was not completely replaced by 
the delay aftereffect and on the subse- 
quent immediately reinforced trial this 
stimulus, a combination of the non- 
rewarded and delayed aftereffects, was 
conditioned to the instrumental response. 

A major theoretical usefulness of a 
blocking procedure is that it is capable 
of producing experimental results which 
provide a clear choice between intratrial 
and intertrial theories. Intratrial theo- 
ries attempt to deduce increased resist- 
ance to extinction on the basis of events 
occurring on the nonreinforced trial per 
se. Thus, for such theories it makes 
little difference if the block occurs before 
the N trial (Group DN) or after the N 
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trial (Group ND). Intertrial theories 
such as the one considered here suggest 
that between-trial events regulate per- 
formance. It seems doubtful that the 
ordering of the four groups in extinction 
in the present investigation could be ac- 
counted for on the basis of the frustra- 
tion hypothesis (Amsel, 1958; Spence, 
1960), the y, hypothesis (Logan, Beier, 
& Kincaid, 1956), the dissonance hy- 
pothesis (Lawrence & Festinger, 1962), 
or, for that matter, on the basis of 
any other available intratrial hypothesis. 
Moreover, even an intertrial hypothesis 
such as the discrimination hypothesis 
(Gonzalez & Bitterman, 1964) seems un- 
able to account for the present results 
in their entirety. According to the dis- 
crimination hypothesis, resistance to ex- 
tinction should decrease as the organism 
becomes better able to discriminate the 
acquisition series of events from the ex- 
tinction series of events. There seems 
to be no obvious reason why it should be 
easier to discriminate acquisition from 
gon for Group ND than for Group 
N. 
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PREFERENCE AND RECALL 


HARRY MUNSINGER ? 
University of Illinois 


The ability to recal 
and the relation of this ability 
investigated. The result: 
direct relation 
stimulation and pr 


We have shown that adults express a 
preference for an intermediate amount of 
variability of strings of words and letters 
(Munsinger & Kessen, 1964). In the pres- 
ent study, Ss stated their preferences for 
eight levels of variability of letters and 
words in a complete paired-comparison de- 
sign and then recalled as many of the strings 
of letters and words as they could imme- 
diately following the preference judgments. 

We were interested jn the relation among 
three observations: (a) the limitation on 
the ability of human Ss to recall strings of 
letters and words (Miller & Selfridge, 
1950), (b) the differential preference of 
human adults for these strings of letters and 
words, and (c) the relation between recall 
and preference. 

Method.—Thirty-two students from intro- 
ductory psychology classes at the University 
of Illinois served as Ss in this study. 

Eight levels of variability (sequential pre- 
dictability) of strings of letters and words 
(ranging from redundant letters through 
prose to random letters) were presented to 
all Ss in a complete paired-comparison de- 
sign. These stimuli were samples of the 
differing orders of approximation to Eng- 
lish used by Miller and Selfridge (1950) in 
their studies of the effect of context on free 
recall of verbal materials. The Ss were 
asked to state their preference for one mem- 
ber of each pair. The pairs were projected 
for 10 sec. with a negligible interval be- 
tween pairs. The Ss were told that they 
would be asked to recall the items following 
the preference judgments. After making the 
judgments, Ss were given 10 min. in which 
to write down as many of the strings of 
words and letters as they could remember. 

Results—Figure 1 shows the percentage 
of strings of letters and words correctly 


1 This research was supported in part iii a grant 


from the Carnegie Corporation of New Yor! 
i San Diego. 


2Now at University of California, 
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OF STIMULUS VARIABILITY ! 


WILLIAM KESSEN 
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AND 


of differing variability 
to preference for the strings were 
the expectation that there is a 
f Ss to recall variability of 


recalled. The ordering of stimulus varia- 
bility along the abscissa assumes an ordinal 
relation of increasing variability from re- 
dundant letters through prose to random 
letters. The curve indicates high recall for 
redundant strings and prose, and then a 
decreasing recall of the strings from prose 
to random letters. The discontinuity was 
statistically stable, F (7, 217) = 29.2; P< 
001 for cubic trerd. 

Figure 2 Shows the results of a complete 
paired-comparison scaling analysis of Ss’ pref- 
erence for the strings of letters and words. 
As we have shown earlier, Ss preferred an 
intermediate amount of variability and did 
not like strings of letters and words well 
below or well above this intermediate range. 
Moreover, a positive relation exists between 
the recall of strings of high variability and 
the stated preference of Ss for these mate- 
rials. The Ss who prefer high-variability 
materials recall them better, x (1) = 32.2; 
p< 01. 


These results support two propositions 
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Fio, 2. Scale scores of erie for strings of 
letters and words di! ering in variability. 


concerning the way in which human beings 
approach environmental variability. We had 
assumed earlier that there was a limit on the 
ability of human beings to store and process 
environmental variability (Miller, 1956; 
Munsinger & Kessen, 1964) ; this assumption 
received support from the relation found 
between Ss' free recall of strings of letters 
and words and the variability of the stimuli. 
Recall was almost perfect for redundant and 
prose materials but dropped precipitously 
and systematically as variability increased. 
A second assumption of earlier work was 
that Ss' Preference for variability is deter- 
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mined by their ability to Process variability. 
Variability which is easily processed (re- 
dundant strings) is not highly preferred. 
Variability at or near Ss’ optimal process- 
ing ability (prose) is highly preferred. Pref- 
erence for variability above this optimal 
point will be directly related to Ss' ability 
to process variability. These expectations 
were supported in the present study. Of 
particular interest are two findings relating 
preference and processing ability for mate- 
rials of high variability. First, the curve 
relating preference and variability above 
prose and the curve relating recall and 
variability above prose are parallel. This 
finding is congruent with earlier results 
relating preference to Ss’ ability to estimate 
numbers of turns of random shapes (Mun- 
singer & Kessen, 1966). The relation be- 
tween processing ability and preference was 
also shown in the significant relation of 
contingency between the ability of Ss to 
recall high-variability materials and their 
stated preference for these materials. 
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ADULTS AND THE INTERMEDIATE SIZE PROBLEM: 


MICHAEL D. ZEILER anp JO LANG 
Wellesley College 


In a recent study of the intermediate size 
problem, Zeiler (1964b) found that adults 


1 This investigation was supported in whole by 
Public Health Service Research Grants MH 08818-01 
and MH 08818-02 from the National Institute of 
Mental Health, 


chose the test-set component most similar 
in size to the positive training stimulus when 
the training and the test sets were similar. 
They transposed when the two sets were 
quite different from each other. Other experi- 
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ments showed that children transposed when 
the training and the test sets were similar 
and failed to transpose with increased differ- 
ences between the two sets (Zeiler, 1963; 
Zeiler & Gardner, 1966). Despite the plausi- 
bility of an explanation of these opposed 
findings in terms of developmental changes, 
the differences found between these experi- 
ments may have been due to variables in the 
experimental situations. Whereas the study 
with adults used five stimuli and a correction 
procedure in the training phase, both in- 
vestigations with children employed three 
stimuli and noncorrection. To determine 
whether the behavior of adults was due to 
these factors, the adults in the current ex- 
periment had either a three- or a five- 
stimulus problem and were trained with 
either a correction or a noncorrection pro- 
cedure. 

Method.—Except for the particular stim- 
ulus sets and the manipulation of the correc- 
tion-noncorrection variable, the apparatus, 
procedure, and instructions were identical to 
that of the simultaneous discrimination con- 
ditions of Zeiler's (1964b) study. Eleven 
square unpainted 4-in. Masonite blocks were 
prepared so that the smallest, 1, had a sur- 
face area of 2.86 sq. in., and each succeeding 
block increased in area by a factor of 14. 
Experimentally naive female undergraduates 
(N = 96) were randomly assigned to one of 
eight groups of 12 Ss. The variables of 
number of stimuli (three or five), training 
technique (correction or noncorrection), and 
the distance of the training from the test set 
(one or four steps) were combined in a 2 X 
2 X 2 factorial design. 

Four groups were trained with the three- 
element set composed of Stimuli 4-5-6, and 
four groups learned with the five-element 
set made up of Stimuli 3-4-5-6-7. Two of 
the four groups that received each training 
set were given a correction procedure: S was 
asked to keep choosing stimuli until the cor- 
rect block was selected. The other two 
groups that had each training set had a non- 
correction procedure in which S was allowed 
a single choice on each training trial. Upon 
attainment of the criterion of five successive 
first choices of the correct block, Stimulus 
5, there were five trials with the test set 
substituted for the training set with all re- 
sponses reinforced. The two groups that 
had each training set with the same training 
procedure were given different test sets. 
Following training with 4-5-6, one was tested 
with 5-6-7 and one was tested with 8-9-10. 
Following training with 3-4-5-6-7, one group 
had Test Set 4-5-6-7-8 and the other had 
7-8-9-10-11. 


Results Each presentation of the training 
set that terminated with a correct choice was 
considered to be a single trial for Ss given 
the correction procedure. An F test showed 
that although none of the interactions were 
significant, the main effects of the number of 
stimuli and the training procedure produced 
differences in both the number of trials and 
first-choice errors required to learn the train- 
ing discrimination ($ «.01). The mean 
trials to criterion, including the five criterion 
trials, were: three-stimulus correction prob- 
lem — 9.3; three-stimulus noncorrection prob- 
lem = 15.0; five-stimulus correction problem 
—109; five-stimulus noncorrection problem 
= 20.9. 

The Ss either transposed or chose the 
stimulus most similar in physical size to the 
positive training stimulus on the first test 
trial Since neither the correction-noncor- 
rection variable nor the number of stimuli 
produced significant differences in the test 
stimulus chosen, the eight groups were com- 
bined into two groups classified by the num- 
ber of steps between the training and the test 
sets. The percentage of Ss that transposed 
on the first test trial for each group of 48 Ss 
is shown by the filled data points in Fig. 1. 

There was no significant difference in the 
frequency of absolute choice and transposi- 
tion on the first test trial in the near test, 
x (1) =2.08, $ > .05. There was, however, 
significant transposition in the far test, x” 
(1) — 24.08, p«.001. The first trial be- 
havior was maintained consistently through- 
out the five test trials; when categorized as 
"same choice in five trials" vs. “not same 
choice in five trials," almost all of the Ss 
were in the first category. 

Discussion.—The choice data were insensi- 
tive to differences in the speed of the original 
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Fic, 1. Percentage of Ss that transposed on the 
first test trial. (The filled data points refer to the 
current study.) 
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learning as well as to the training situation 
and the number of stimuli. Almost all of the 
Ss transposed on the far test, whereas a non- 
significant majority responded absolutely on 
the near test, and no individual S chose ran- 
domly in the five test trials. This experi- 
ment replicated Zeiler’s (1964) study in 
showing that there was more transposition 
than absolute choices with dissimilar training 
and test sets and more transposition on a far 
than on a near test. 

Figure 1 shows the data from studies with 
adults (Zeiler, 1964b; current study) and 
with children either 4-5 (Zeiler, 1963) or 
7-8 (Zeiler & Gardner, 1966) yr. of age. 
The relationship between transposition and 
the similarity of the training to the test sets 
substantiated the fact of distinctly different 
behavior in the intermediate size problem as 
a function of developmental level. With 
children, the amount of transposition de- 
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creased with larger differences between the 
training and the test set, whereas with adults 
transposition increased as the between-set 
differences increased. How these findings 
are related to those of Kendler and Kendler 
(1962) and Zeiler (1964a) on developmental 
changes in discrimination remain to be 
evaluated. 
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UNAVAILABILITY AND ASSOCIATIVE LOSS IN RI AND Pl: SECOND TRY? 


JOHN CERASO ann ANN HENDERSON 
Yeshiva University 


Following an earlier study the present experiment attempted to 
measure associative loss, and availability loss as they might occur 
in RI or PI. Experimental groups were tested for 1st- and 2nd-list 
recall by either matching- or aided-recall tests given shortly aíter 
interpolated learning or 24 hr. later. Control groups were given 
equivalent tests. Association was measured by correct A-B pairing 
in the matching test, while availability was measured by the total 
number of B terms given in aided recall. RI was demonstrated for 
both the associative and availability scores, but spontaneous re- 
covery occurred only for association. PI was demonstrated only for 
the availability score at 24-hr. recall. The results suggested that 
only unlearning produces associative loss, while recall interference 
produces only availability loss. The results were also related to 


theory which states that PI is produced by a growing interaction 


of List 1 and List 2 over time. 


In a recent paper (Ceraso & Henderson, 
1965) a distinction was made between for- 
getting of an A-B association due to unayail- 
ability of the B term, or due to a disruption 
of the A-B association. Two different meas- 
ures were devised in an attempt to index 
these aspects of recall. 

The present study derives from a dissatis- 

1These experiments were supported by Grant GB 


1604 from the National Science Foundation to the 
senior author. 


faction with the associative measure pre- 
viously employed. The previous measure 
expressed association as the ratio of number 
of B terms correctly associated to the total 
number of B terms given, the latter number 
serving as the availability measure. What 
this measure gets at is the associative level 
of B terms which are ayailable. What is 
wanted, however, is a measure of association 
for all items, available or not. Secondly, a 
B term was only counted as correctly asso- 
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ciated when it was paired with both the 
appropriate A and was correctly assigned 
to list. 

In order to deal with these problems the 
present study utilized a matching test to 
measure association. In the matching test S 
is given all A and B terms and has only to 
demonstrate his ability to correctly pair 
them. The S's ability to pair A and B can 
be measured even for Bs which would be 
unavailable in a recall test. To meet the 
second objection one counts as correct an 
appropriate A-B pairing even if S confuses 
the list membership of B. 

Method.—Experimental and control Ss 
were tested for recall by either an aided- 
recall test or by a matching test. These tests 
were given immediately after interpolated 
learning (or after a comparable interval for 
control Ss) for the Immediate-Recall groups 
(IM), or 24 hr. later for the Delayed- 
Recall groups (D). The learning procedures 
for all groups were exactly as described in 
Ceraso and Henderson (1965), as were the 
materials used, with the exception that 8 
instead of 10 pairs comprised each list. 

The aided-recall test was administered by 
presenting each A term and asking S for 
both Bs learned to that A (or for control S's 
just the one B) with S being given up to 15 
sec. in which to recall. Next, S was asked 
if there were any Bs he might remember 
even if he could not recall the A with which 
they had been paired. This test (free recall) 
was aimed at eliciting Ds which were avail- 
able but not paired with an A. The avail- 
ability score is the total number of B terms 
produced by 5. 

For the matching test a sheet was prepared 
which had all 16 Bs listed in alphabetical 
order. Each A was presented and S asked 
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to indicate verbally the two words which 
had been learned to A (S was free not to 
choose). The S choices were recorded by 
E. It was hoped that leaving the matching 
sheet unmarked would minimize the effects 
of guessing. Again, up to 15 sec. were al- 
lowed for matching each A. The associative 
score is the total number of B terms cor- 
rectly given to an A term. 

Undergraduates at Pratt Institute 
Brooklyn served as paid volunteers. 
scheme of random numbers was devised so 
that Ss were placed into one of the eight 
groups in their order of appearance at the 
laboratory until each group had 12 Ss. 

Results.—Table 1 gives the mean number 
of trials to criterion for all lists. An analy- 
sis of variance performed for all eight first- 
learned lists (OL and CL) showed that the 
eight subgroups were comparable in learning 
ability, F (7, 88) = .84. 

The first recall analysis compared avail- 
ability scores for First List (OL) and Con- 
trol List (CL) at IM and D recall points. 
The only significant, F (1, 44) = 16.6, p< 
001, result of this analysis is that 
showed poorer recall than CL. The data 
show a slight rise in availability for both OL 
and CL over time but this effect is not sig- 
nificant, nor is there a significant interaction 
between lists and recall points. Therefore, 
while there is clearly RI for the availability 
score, there is no evidence for spontaneous 
recovery either absolute or relative to the 
control group. 

The analysis comparing availability scores 
for the second list (IL) and the CL at the 
IM and D recall points yielded significant F 
ratios for IL vs. CL, F (1,4) =97, b 
01; IM vs. D, F (1, 44) 215.67, p < 001; 
and the interaction of lists and recall points, 


in 


TABLE 1 
MEAN LEARNING AND RECALL SCORES FOR ALL Lists 


Associative Recall Groups Availability Recall Groups 
List Learning Recall Learning Recall 
M SD M SD M SD M SD 
OL 
IM 9.00 3.83 5.58 1.27 11.08 3.89 6.08 1.40 
D 10.33 3.18 6.17 1.15 10.75 4.00 6.50 1.19 
IL 
IM 8.25 3.07 7.67 E 9.50 .28 7.83 44 
D 9.50. 2.66 6.15 1.23 9.58 3.05 6.00 .91 
CL 
IM 10.08 4.63 8.00 .00 11.83 4.77 7.42 12 
D 11.33 5.26 7.33 1.05 10.08 3.87 7,67 AL 
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F (1, 44) —27.04, p<.001. Inspection of 
the data shows IL not to be poorer than CL 
at the immediate-recall point so the results 
indicate chiefly much PI of the availability 
score occurring over time. 

The next analysis compared the association 
scores for OL and CL at both IM and D 
recall points. Again a significant, F (1, 44) 
= 35.98, p<.001, OL-CL difference was 
found. There was no significant change over 
the 24-hr. interval, but this lack of signif- 
icance is accounted for by a significant, F 
(1, 44) =4.50, 5 «.05, interaction between 
lists and recall points. The analysis, there- 
fore, showed RI for the association score, 
coupled with spontaneous recovery of the 
association score relative to the control 
group, 

A comparable analysis for IL and CL re- 
sulted in a significant F ratio only for the 
IM-D difference, F (1, 44) =9.90, p<.01. 
Neither the IL-CL difference nor the Lists X 
Recall Point interaction proved significant. 
An F test performed on the simple effect of 
IL vs. CL at the D recall point also failed to 
reach significance. These results indicate 
that insofar as the association measure is 
concerned there is little evidence for PI. 

The results can be summarized as follows: 
The first list shows RI of both availability 
and association Scores, but spontaneous re- 
covery is evident only for the association 
Score. The second list shows PI at the D. 
recall point only for the availability score. 

Discussion.—The data confirm the conclu- 
sion reached in the Ceraso and Henderson 
(1965) study that associative loss and avail- 
ability loss are two different kinds of loss. 
But more important, these data give some 
clue as to the processes producing these kinds 
ofloss In the present study the matching 
score is depressed maximally for the OL at 
the immediate-recall point, and over time 
undergoes spontaneous recovery. 

Loss for the second list is mainly due to 
loss in availability (the loss in matching is 
not large enough to be significant) which 
Occurs over the time interval. Thus recall 
interference. (which should be maximal for 
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the IL after some time interval) seems to be 
related to loss in availability. To generalize, 
the data suggest a relationship between recall 
interference and unavailability, and a rela- 
tionship between unlearning and associative 
loss. 

Of interest also is the relation of the pres- 
ent results to a theory of proactive inhibition. 
The primary fact of PI is that it increases 
over time. In an attempt to explain this fact 
Underwood (1948) has postulated that the 
spontaneous recovery of OL from unlearning 
is the source of increasing recall interference 
of the second list. Ceraso, Schiffman, and 
Becker (1965) on the other hand, have 
argued that PI is independent of spontaneous 
recovery. Rather, they argued that PI is the 
result of a more general process whereby OL 
and IL mutually interact over time causing 
loss for both lists. This temporal loss is 
overt for the second list, but masked for the 
first list by spontaneous recovery of that list. 
Data from the present experiment may be 
interpreted as being congruent with that hy- 
pothesis. If one assumes that the loss factor 
which affects both lists has to do with loss of 
availability, then the failure of the first list 
to show a temporal rise in availability may 
be due toa counteracting of spontaneous re- 
covery by the overall availability loss. Sim- 
ilarly, the overt rise in OL matching scores 
may have been possible only because there 
is no overall matching loss over time. The 
hypothesis presented here Teverses the causal 
relation between spontaneous recovery and 
PI proposed by Underwood (1948). Instead 
of spontaneous recovery causing PI, PI (or 
the general temporal loss best measured by 
PI) is assumed to cause a loss in OL which 
prevents or masks spontaneous recovery. 
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According to signal-detection theory the response is a function of both 


sensitivity and the criterion ; thi: 
in short-term memory (STM). 


s experiment tested for criterion shifts 
Each list consisted of 5 A-B paired 


associates followed by a probe (A or B) for 1 of the pairs. 4 Ss were 


tested intensively, and reca 


11 was supplemented with confidence ratings 


and latency measures. Analysis of the results showed that the strength 
of the evoked response (as measured by d’) was invariant over serial 
position but the criterion (as measured by 8) became stricter as reten- 


tion interval increased. 


Thus the predicted changes did occur; these 


findings raise doubts about conventional error analyses in studies of 


human verbal learning an 


d suggest caution in extrapolating from 


probability measures to strength measures. 


One of the long-standing problems 
in the field of human verbal learning 
and retention is the treatment of errors. 
With anticipation or recall S makes 
both correct and incorrect responses ; 
the latter, commonly called “intru- 
sions,” are usually scored and dis- 
cussed separately from correct re- 


iThe data for this experiment were col- 
lected at the University of Vermont, and the 
writer would like to thank Robert Lavallee 
for his assistance. Support was provided 
by Public Health Service Research Grants 
MH 03,330 and 10,882 from the National 
Institute of Mental Health, Public Health 
Service. The paper was prepared during 
attendance at a summer conference on Math 


` Models of Memory, supported by the Center 


for Advanced Study in the Behavioral Sci- 
ences; I would like to thank the other par- 
ticipants for many helpful suggestions and 
criticisms. 


sponses. Considerable theoretical sig- 
nificance is attached to error analyses 
(Conrad, 1965; Postman, 1961) ; in- 
deed, it was on the basis of intrusion 
analyses that a two-factor interference 
theory was first formulated (Melton & 
Irwin, 1940). However, the theoreti- 
cal justification for separation of cor- 
rect responses and errors is not always 
clear. 

Perhaps the nature of the problem 
can be illustrated by reference to classi- 
cal psychophysical studies. Tradition- 
ally, one instructs Ss to report a stimu- 
lus only when it is present, a correction 
for guessing is used, catch trials are 
introduced, wo many false posi- 
tives lead to discarding of trial blocks. 
According to this view errors are 
guesses, essentially determined by non- 
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TABLE 1 
Conrusion MATRIX For SP 1 AND 6 ONLY 


Output Position 


SP Total 
sp | SP6 
Input 1 182 34 216 
Position 6 1 636 637 
"Total 183 670 853 


Note,—Data from Exp. I, Murdock, 1962. 


sensory factors. An alternative view is 
that of signal-detection theory wherein 
the response given by S is presumed 
to depend both upon sensitivity and cri- 
terion (Swets, Tanner, & Birdsall, 
1961). That is, there is a detection 
process and a decision process, and the 
model attempts to separate these two 
experimentally. And, of course, the 
simple dichotomy into correct responses 
and errors is not sufficient to separate 
detection and decision. 

The same analysis may be made for 
recognition and recall studies in human 
verbal learning and retention. Re- 
sponses will be given only when they 
exceed a criterion, and § determines 
the criterion himself. Changes in cri- 
terion will result in different number 
and proportion of correct and incorrect 
responses, and one can draw unambigu- 
ous conclusions about "strength" or 
"learning" only if one can be sure that, 
over time or trials, there is no change 
in criterion level as set by S. Not only 
is the criterion set by S, but also it is 
not measured by E ; thus, there is gen- 
erally no way of knowing whether or 
not it is constant. In this vein Ed- 
wards (1961) has stressed the role of 
costs and. payoffs in determining the 
criterion, but his message seems to 
have been widely disregarded. 

In addition to these general consid- 
erations, we have a more specific reason 
for suspecting the existence of criterion 
changes. Basically, in our studies of 
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short-term memory for paired associ- 
ates, examination of intrusion matrices 
show gradients which are explicable in 
terms of criterion shift. In Table 1 is 
shown a subset from such a matrix; the 
data is from Exp. I (Murdock, 1962) 
where six-pair lists were used, but only 
Serial Positions (SP) 1 and 6 are 
given here. Rows and columns repre- 
sent input and output positions, respec- 
tively, and cell entries the absolute fre- 
quencies, The usual way of reporting 
data is in terms of a priori probability ; 
that is, given a probe from SP 1, 2 
. n, what is the probability of a 
response from SP 1,2...n. For the 
data in Table 1 the a priori probabili- 
ties were .84 and .998 for SP 1 and 6, 
respectively. However, the a posteri- 
ori probabilities (i.e. the probability 
that the probe was from SP 1 given 
that the response was from SP 1) go 
in the reverse direction, .995 and .95 
for l and 6, respectively. For the full 
6 X 6 matrix, the a priori probabilities 
show the recency effect so characteristic 
of STM while the a posteriori prob- 
abilities are essentially flat over all SP. 
Thus, by the former measure, there is 
marked forgetting of single pairs within. 
a matter of seconds ; by the latter meas- 
ure there is no "forgetting" whatsoever. 
The data chosen for illustration are 
atypical only in the magnitude of the 
effects, That is, intrusion analyses of 
many of our studies show a posteriori. 
gradients which, while not completely 
flat, are certainly different from the a 
priori gradients usually reported (e.g. 
Murdock, 1963a, p. 320, Fig. 1). In 
general they show a primacy effect 
Which is completely absent from the 
a priori curves; also (in six-pair lists) 
the probability values at SP 4 and 5 
are clearly the lowest. A very tempt- 
ing explanation for these effects is 
that there is a change in criterion level 
such that, as the retention interval in- 
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creases, the criterion becomes more 
strict. Asa result, fewer responses are 
given from the earlier SP but those 
that are given are more often correct 
(hence higher a posteriori probabili- 
ties). Moreover, it is certainly true 
that omissions (where recall is op- 
tional) increase with retention interval, 
another finding consistent with cri- 
terion changes. 

The present experiment, then, at- 
tempted to find out whether in fact 
the predicted changes do occur. Tt was 
designed deliberately to be similar to 
our previous studies of STM except 
that testing was intensive rather than 
extensive. Four highly practiced Ss 
were used, and the recall data were 
supplemented with confidence judg- 
ments so that, hopefully, one could 
trace out ROC curves for each S at 
each SP. In general the procedure 
can be considered the mnemonic ana- 
logue of studies of intelligibility in 
word recognition (eg., Pollack & 
Decker, 1958). 


METHOD 


Subjects —There were four Ss, two re- 
search assistants of college age (LB and 
DO) and two advanced undergraduates (WS 
and AC) who were working on their own 
research projects under the writers super- 
vision. None of the Ss had previously 
participated in any verbal-learning experi- 
ment, 

Procedure —Each S was tested on ap- 
proximately 26 sessions, with each session 
consisting of 30 lists. Each list consisted of 
five A-B pairs composed of common English 
words paired at random. After each list 
one of the five pairs was tested with one 
word of the pair (either A or B) as the 
probe (ie, cue for recall). The SP of 
the critical pair was randomized across lists 
but occurred six times at each of the five 
possible positions within each session. Pres- 
entation rate was initially 30 pairs/min for 
all Ss; however, after the sixth session it 
was increased to 60 pairs/min for one S 
(WS) whose performance was too good at 
the slower rate. Pairs were photographed 
and presented visually; the exact composition 
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TABLE 2 


RECALL PROBABILITY FROM PREVIOUS 
(Munpock, 1963a) AND PRESENT 


EXPERIMENT 
SP Previous Present 
1 1359 29: 
2 .297 .29 
3 .365 44 
4 596 59 
5 .888 .85 


of the lists varied haphazardly from S to S. 
In general, the procedure followed was not 
different from previous studies, and addi- 
tional details may be found elsewhere (e.g., 
Murdock, 1962). 

After each list Ss were required to re- 
spond; no omissions were permitted. In 
addition, a confidence rating was required ; 
the 6-point scale of Pollack and Decker 
(1958) was used which went from +++ 
(“positive the word is correct") to ——— 
(“positive the word is wrong”). Also, 
latencies were measured by E using a tele- 
graph key and Standard Electric timer; the 
timing started at the end of the probe (which 
was spoken by E) and stopped when $ gave 
his response. All Ss were tested individually 
at approximately the same time each day; 
at no point was knowledge of results given. 


RESULTS 


To test for practice effect the entire 
experiment was divided into three 
blocks of approximately 270 lists each 
and a three-way (Blocks x SP x Ss) 
analysis of variance was done on total 
number correct. Blocks were signifi- 
cant at the .001 level but did not inter- 
act with SP. Mean recall probabilities 
were .492, .659, and .676 for Blocks 
1-3, respectively ; apparently the prac- 
tice effect did not carry over beyond 
the first block. 

To test for comparability with prior 
data, performance on Block 1 was com- 
pared with results from a previous 
study (Murdock, 1963a, Exp. I). 
The comparison is shown in Table 2 
and appears reasonably satisfactory. 
Apparently, adding confidence judg- 
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PROBABILITY ( CORRECT | Rj) 


=== - - * ++ — 


CONFIDENCE JUDGEMENT j 


Fic. 1. Conditional probability of a cor- 
rect response given confidence judgment R; 


as a function of j. (Separate curves for 
each S.) 


ments and latency measures did not 
greatly affect recall probability. 

All subsequent analyses will be based 
only on Blocks 2 and 3, simply to 
eliminate the problem of the practice 
effect. Thus, the conclusions apply to 
well-practiced (and, it might be added, 
highly motivated) Ss, In Fig. 1 are 
shown conditional probability scores as 
a function of confidence judgment, with 
separate curves for each $. It is very 
clear that all four Ss make accurate 
judgments ; when they give the highest 
tating they are nearly always correct, 
when they give the lowest judgment 
they are nearly always wrong, and in- 
termediate judgments are ordered 
accordingly. 

Similar curves are shown in Fig. 2 
except that the parameter is SP rather 
than Ss. (Obviously, the curves in 
Fig. 1 were averaged over SP while 
those in Fig. 2 were averaged over Ss.) 
This curve bears a striking resemblance 
to the comparable curves reported for 
speech communication studies (see Fig. 
5 and 6 in Pollack & Decker, 1958). 
Thus, it would seem that Ss can assess 
their performance quite accurately re- 


gardless of whether they are reporting 
a word spoken at unfavorable signal-to- 
noise ratios or reporting on a word 
presented some time back. Further, 
Pollack and Decker (1958) conclude 
that, “over a range of S/N ratios, Ss 
apparently employ the strategy of 
assigning their confidence ratings to 
achieve fixed average levels of message 
reception [p. 290].” The analogous 
conclusion applies to the SP data of 
this experiment if one considers only 
the extreme judgments; the middle 
judgments are clearly not identical yet 
they do not fall into any simple pattern. 
(The last SP was given such high 
confidence judgments that only three 
points could be plotted in Fig. 2.) 

An attempt was made to construct 
separate ROC curves for each S. Un- 
fortunately, this attempt was not too 
successful ; the confidence ratings were 
often so extreme that there were not 
enough points to determine a curve. 
As a consequence, it was necessary to 
collapse the 2 x 6 matrix (i.e., correct- 
incorrect vs. confidence judgment) 
into a 2 X 2 matrix, with the cutting 
point (selected so as to give the most 
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PROBABILITY (CORRECT | Rj ) 
$ 


CONFIDENCE JUDGEMENT j 


Fic. 2. Conditional probability of a cor- 
rect response given confidence judgment Ry 


as a function of j. 


(Separate curves for 
each SP.) 
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TABLE 3 


MEAN VALUES OF d’ AND B AS A 
Function or SP 


SP d' 

5 2.01 0.43 
4 2.31 1.39 
3 1.85 1.02 
2 2.03 2.68 
1 2.33 3.07 


even split over all conditions) on the 
confidence judgment between +++ 
and ++. (For a more complete de- 
scription of the method see Pollack & 
Decker, 1958, in particular Table Il, 
p. 288.) A2x2 matrix was obtained 
for each S at each SP and from each 
matrix was obtained a value of d’ and 
B using the strong assumptions of 
normality of distribution and equal 
variances. These values were then 
averaged across Ss, and the mean 
values for each serial position are 
shown in Table 3. 

As can be seen, 8 increased (and by 
nearly an order of magnitude) over SP 
whereas d' did not. Analyses of vari- 
ance confirmed these statements; for 
the latter F (4, 12) = 1.50, p > .05, 
while for the former an analysis of 
variance of log (B + 1) gave F (4, 12) 
= 3.27, p < 05. Furthermore, inspec- 
tion of the data of the four Ss sepa- 
rately did little to suggest that these 
results might be artifacts of averaging.” 

As a further test, the “false-alarm” 
rate (ie; P (+++ | Incorrect) ) was 
determined for each S (a) for SP 45, 
and (b) for SP 1-2. For each of the 


2 To maximize expected value the criterion 
should be numerically equal to the odds 
against a correct response. Both the cri- 
terion and the odds varied over about an 
order of magnitude; this statement is the 
strongest one can make since, “. . . any at- 
tempt to investigate, in more than a correla- 
tional sense, the correspondence between 
obtained and optimal criteria appears ex- 
tremely difficult [Green, 1960, p. 1203].” 
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four Ss the former was greater than 
the latter, and for all four 5s combined 
the false-alarm rates were .159 and 
.076, respectively. (By a z test, p 
<.01.) Thus, Ss are more cautious 
in giving a high confidence rating at 
the longer retention interval. As has 
been pointed out, Ss seem to apply 
their confidence ratings consistently 
across retention intervals, at least for 
the +++ judgment (see Fig. 2). 
Therefore, this analysis provides addi- 
tional evidence for a criterion shift 
without the necessity for making the 
strong assumptions of signal-detection 
theory. 

An analysis of the latency data 
showed that, in general, latencies in- 
creased with retention interval. This 
effect is shown in Fig. 3, where median 
latency of all responses for each S is 
plotted as a function of SP. In an 
attempt to partial out accuracy differ- 
ences, the same analysis was performed 
on a subset of responses at each SP; 
namely, those which were correct and 
given the highest confidence rating 
(+++). Median latencies of these 
responses are shown in Fig. 4; clearly 
the last two SPs are indistinguishable, 
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Fic. 3. Median latencies of all responses as 
a function of SP. 
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Fic. 4. Median latencies of all correct re- 
sponses judged +++ as a function of SP. 


but the earlier ones slower. These re- 
sults are very similar to other latency 
data reported for individual Ss (Mur- 
dock, 1963b), and the differences 
among SP are highly significant. 
(Median tests with SP 1-3 vs, SP 4-5 
showed p < .001 for each S$ individ- 
ually.) The echo-box seems to consti- 
tute a ready-access store, but retrieval 
from Secondary Memory takes longer 
(see Waugh & Norman, 1965). 
Within any given retention interval 
latencies and confidence judgments are 
highly correlated. Consistently median 
latencies increase as confidence ratings 
decrease, and the difference across con- 
fidence levels is not inconsiderable. 
Thus, it may be that $ uses a variable- 
length observation interval and assigns 
confidence ratings on the basis of the 
duration of this interval. That confi- 
dence judgments and latencies may be 
interchangeable is suggested by the fact 
that symmetric ROC curves are ob- 
tained when one partitions on the basis 
of latency (making say five equispaced 
cuts in the pooled distribution) rather 
than partitioning on the basis of con- 


fidence judgments. This interpretation 
is by no means certain, but it clearly 
suggests caution in inferring strength 
relationships from latency data even 
with recall probability held constant. 


Discussion 


The above analyses represent an at- 
tempt to apply a detection-decision model 
to STM. Similar applications have been 
suggested for reaction-time experiments 
(Fitts, Peterson, & Wolpe, 1963) and 
for division of attention (Broadbent & 
Gregory, 1963). Also, there have been 
previous applications to STM (e.g., Mur- 
dock, 1965; Norman & Wickelgren, 
1965). Such an application necessitates 
assumptions which may be questioned. 
However, these assumptions seemed not 
too unreasonable in the studies cited 
above, and have also been examined 
critically by Wickelgren and Norman (in 
press). Also, it should be mentioned that 
here in some cases (i.e., for some of the 
four Ss at some SP) there were enough 
points at least to suggest the shape of the 
ROC function. These appeared linear 
when plotted on double-probability graph 
paper, and the slope seldom departed 
greatly from unity. 

As d' is the signal-detection measure 
of sensitivity, and given that d' did not 
vary with SP, should one conclude that 
in fact there was no forgetting? Such a 
conclusion seems unlikely, for it must be 
remembered that this experiment used a 
Type 2, or response-conditional, paradigm 
(Clarke, Birdsall, & Tanner, 1959). 
That is, S first had to select a response 
and then evaluate this response. It is im- 
portant to distinguish between the correct 
response (ie. the A or B member of the 
critical pair) and the response actually 
given by S. In recognition tests the 
former does decrease with SP (Murdock, 
1965), and in the present experiment the 
probability that the correct response was 
the dominant response certainly decreased 
as well. However, the latter did not 
weaken with SP. It was as though a 
response had to reach a certain strength 
before it could be evoked, and this 
strength was invariant with SP. 


THE CRITERION IN SHORT-TERM MEMORY 


Of course, it is quite true that, in an 
application such as this, one is far re- 
moved from the basic situation for which 
the model was formulated, and it is still 
an open question just how close an anal- 
ogy it is. Also, Type 2 studies are not 
completely unambiguous: “First, the ob- 
server has to select (from several possi- 
bilities) the most likely word; second, he 
must evaluate this decision with respect 
to all other possibilities. Such a process 
produces mathematical expressions vir- 
tually impossible to evaluate except under 
the most doubtful set of simplifying as- 
sumptions [Green, 1960, p. 1197].” 
Nevertheless, at the very least the results 
of the present study do seem to show 
rather clearly that there are criterion 
changes in STM, and the problems posed 
by these changes should not be under- 
estimated. 

Presumably we are interested in the 
strength of memory traces and how they 
change as a result of experimental manip- 
ulations. If one tries to assess these 
changes by the conventional recall pro- 
cedure one is in fact getting only those 
responses which exceed S’s criterion, so 
clearly recall probability measures ob- 
tained under these conditions are of 
questionable significance. (The criterion 
adopted by S certainly need not corre- 
spond to the +++ judgment used as a 
cutting point in the present study; the 
point is that some cutting point must be 
used by each S.) Further, to require S. 
to respond (as was done here) gets at his 
strongest response, but it may not tell 
you much about the strength of the 
memory trace in question. A further 
possibility is to require not only first 
choice but also second choice, third choice 

. nth choice, but soon both S and E 
may run out of patience. 

A more appropriate solution might be 
to go back to recognition-type measures 
for STM. One could then use Type 1 
(stimulus-conditional ) procedures, though 
it would be nice to confirm ROC curves 
obtained by confidence judgments in 
some other way. Of course, with recog- 
nition one is simply eliminating the 
search process presumably characteristic 
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of recall; the recompense is that one has 
an easier situation to conceptualize. But 
there is the further problem that recog- 
nition memory depends critically upon the 
distractors (or alternatives) used (e.g. 
Dale & Baddeley, 1962). Perhaps one 
could adopt the procedure used in word- 
recognition tests wherein Ss first mem- 
orize the set of words to be used, and 
fhen tests are constructed on the basis 
of known message vocabulary. This 
procedure would follow the information- 
theory precept of specifying completely 
the ensemble size, and one could work 
with subsets at will. To carry the anal- 
ogy one step further, it may turn out 
that retention interval in STM is an- 
alogous to S/N ratio in intelligibility 
tests, and the recall-recognition distinc- 
tion in the former parallels ensemble size 
in the latter. In any case, the important 
point is that in STM one must be able 
to separate changes in strength from 
changes in criterion, and the criterion 
problem will not disappear by ignoring it. 
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EFFECT OF CHANGE IN 


SEQUENTIAL VISUAL 


STIMULI ON GSR ADAPTATION 
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tained for 20 Ss while they were presented 


GSR records were ob 
s alternating in apparently random 


with a series of 80 stimuli (4 light: 
sequence). Records were obtained for 20 additional Ss using the same 
procedure, except that a novel stimulus (a light not previously 
presented) was substituted in place of 1 of the original 4 lights. 
Typical adaptation phenomena were observed for both groups. 
While the novel stimulus was perceived by all Ss in the 2nd group, 
it had no effect on the adaptation curve for that group. It is 
concluded that novelty is a dimension which needs considerable 


reexamination. 


In recent studies, evidence has been 
presented in support of the orienting 
response. (OR) interpretation of the 
GSR. Allen, Hill and Wickens 
(1963) report that novelty of stimuli 
is a sufficient condition for the evoca- 
tion of the GSR. Similar results have 
also been reported by Berlyne, Craw, 
Salapatek, and Lewis (1963). The 
term novelty is, however, equivocal. 
Stimulus novelty is identified by Ss’ 
report of differentness, or, in GSR 
work, as a change in response ampli- 
tude to the stimulus after adaptation to 
other stimuli has been demonstrated. 

The difficulty with the GSR cri- 
terion, for stimulus novelty, despite its 
apparent objectivity, is that procedures 
for adapting- the GSR may affect the 
critical measure. A change in response 
amplitude may reflect an OR, but the 
magnitude of the OR will be a function 
of the base level achieved in adapting 
Ss.. When a reliable adaptation pro- 
cedure is-not utilized, it may simply 
reflect. incomplete preadaptation. . 

One interesting qualification of the 
OR position is also suggested by a 
study of GSR adaptation in which the 


1 Also The Payne Whitney Clinic, New 
York Hospital, 3 * 
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most significant response-evocation ef- 
fect was attributed to recency of the 
stimulus in the presentation series 
(Hayes, Bronsaft, Bloch, & Welch, 
1964). These investigators conclude 
that change in GSR amplitude is great- 
est for stimuli presented early in the 
sequence. 

Korn and Welch (1962) have re- 
ported a technique of stimulus presen- 
tation that yields fairly rapid and per- 
sistent adaptation of the GSR. The 
method of serial stimuli exposes S to a 
series of similar stimuli (four colored 
lights) that are presented in an ap- 
parently random order, eight. stimuli 
per minute, with no interstimulus inter- 
val. With this procedure, attention is 
maintained, and GSR’ responsitivity to 
the adaptation stimuli is reduced. - 

The above procedure may provide 
precisely the background for the study 
of response to novel stimuli: To what 
extent will GSR adaptation patterns 
be disrupted by the intrusion of a novel 
stimulus? That is, a colored light to 
which S was not previously exposed. 


METHOD 


Subjects —Twenty naive female college 
students: were assigned to the experimental 
group, and 20 to the control group. 
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Fic. 1. GSR adaptation: mean amplitude (in K-ohms) on presentation of 
the amber light stimulus. 


Apparatus.—The GSR records were ob- 
tained with a custom built pathometer 
coupled with a Texas Instruments rectiriter. 
Visual stimuli were presented by means of 
six 60-w. lamps with cellulose acetate filters 
(Edmund Scientific Company). These 
lamps were mounted in a housing with a 
common milk-glass aperture 14 in. in diam- 
eter. They appeared to S as a changing 
circular patch of light. 

A multipole 360° switch programmed the 
stimulus sequence. While not random, each 
of the four hues: amber, green, blue, and 
white, appeared with a mean probability of 
i. The S could not learn the sequence. 

Hunter timers controlled the stimulus 
duration (7.5 sec.) and interstimulus time 
(.01 sec.). 

Procedure.—Each S was seated in a small 
room and finger electrodes were applied. 
The stimulus was presented through an 
aperture in the wall, approximately 6 ft. in 
front of him. The room was darkened, and 
he was instructed to look at the changing 
lights and refrain from unnecessary body 
movement, 

Five minutes were allowed to elapse prior 
to stimulus presentation. 

In the control group, the four lights, blue, 
green, white, and amber, were presented un- 
til they all had appeared 80 times. In the 
experimental group, on the stimulus presen- 
tation, when the amber light was to have 
appeared for the eleventh time, a red light 
appeared instead. This red light was the 


novel stimulus, and never appeared in any 
other position, or at any other time. 


RESULTS 


The data consists of the resistance, 
in ohms, during each presentation of 
the amber light. For each S, there 
were 20 such readings. 

Figure 1 presents the adaptation 
curves for the control and experimental 
groups. The amber light was arbi- 
trarily designated as the critical stimu- 
lus on which GSR amplitude is indi- 
cated. It should be noted that the OR 
interpretation of the GSR phenome- 
non depends upon the configuration of 
the ohms-amplitude curves usually ob- 
tained. These curves generally indi- 
cate a negatively accelerated slope 
which appears to become asymptotic 
at a given resistance level assumed to 
Tepresent adaptation base line. 

In the above context, GSR adapta- 
tion was obtained. There is, however, 
virtually no difference between these 
two curves. Analysis of covariance in- 
dicates homogeneity of regression lines, 
as well. F tests for treatment effects 


GSR ADAPTATION WITH CHANGE IN STIMULUS SEQUENCE 


and within-class regression were not 
significant at criterion (P < .05). 


Discussion 


Every S in the experimental group re- 
ported the presence of the red light. 
Eight Ss made this report spontaneously. 
In view of this fact, and in the context 
of the OR position on the GSR, it would 
be expected that a significant increase in 
the GSR amplitude be observed following 
the presentation of the novel stimulus. 
No evidence, in this study, supports this 
contention. On the contrary, the course 
of GSR adaptation was conspicuously un- 
changed as a result of the introduction of 
the red light. 

In view of these findings, it can be 
stipulated that, at best, the GSR is an 
unreliable index of signal detection. 

It is entirely likely that the disparity 
between the above findings and those in 
previous studies, indicates that an opera- 
tional definition of novelty does not hinge 
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precisely upon criteria of differentness. 
Particularly where the latter are con- 
sidered one member of a dichotomous 
variable. It must be assumed that novelty 
is not a fact, but a dimension which is 
not presently understood. 
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DISCRIMINABILITY OF ASSOCIATION VALUE 
IN RECOGNITION MEMORY 


EUGENE WINOGRAD ax» WALTER VOM SAAL 


Columbia University 


An adaptation of the method of recognition memory is reported 
which allows for the investigation of the discriminative properties 
of sets of verbal material differing in selected properties. 2 related 
studies were performed with CVCs selected from the association value 
(AV) norms of Archer (1960) and Noble (1961) so that different 
outcomes would be predicted by the 2 sets of norms. Ss were pre- 
sented with a list of 60 trigrams correlated on the 2 norms and then, 
in a recognition test, with the original items plus 60 new items and 
asked to rate each trigram as old or new using a confidence scale, 
i of the new items were correlated on the 2 sets of norms while 
the remaining new items were discrepant, Predictions of differences 
in the discriminability of new items based on both norms were con- 


firmed. Analyses of the items on se 
low Noble AV tri; 
letter pairs than 
pronounceability, nor intertype simi 
the results. 


Since the contribution of association 
value (AV) by Glaze (1928) to the 
study of meaningfulness, the most com- 
monly used behavioral measures in 
studies employing AV as an inde- 
pendent variable have been learning 
scores. Recently, interest has devel- 
oped in the discriminative properties 
of AV. In this direction, Portnoy, 
Portnoy, and Salzinger (1964) and 
Schulz and Lovelace (1964) have 
found that Ss take longer to locate low 
AV than high AV trigrams in a search 
task. The present study investigated 
the discriminability of classes of CVC 
trigrams on the basis of differences in 
AV in a recognition memory task. 

In drawing trigrams from tables of 
AV, investigators have available two 
recent sets of revised norms published 
independently by Archer (1960) and 
Noble (1961). Although the correla- 
tion between the two sets of norms is 
high, enough discrepancies exist for 
Archer (1961) and Noble (1962) to 
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veral frequency scales found that 


grams were composed of lower frequency letters and 
were low Archer AV trigrams. 


Neither frequency, 
larity taken singly accounts for 


have made an issue of the relative 
validity of their work and to allow Es 
to draw subsets of items differing in 
AV on the two sets of norms. The 
major point of agreement in the dis- 
cussion regarding the relative merits 
of the two sets of norms is the need 
for empirical comparisons of them in 
performance. The traditional recogni- 
tion memory task was adapted in the 
Present study to investigate the dis- 
criminability of classes of trigrams dif- 
fering in AV and to test differential 
predictions over a restricted range of 
AV for the Noble and Archer norms. 

Consider the case where S is serially 
shown 60 trigrams of high AV on both 
norms (here termed Type A items) 
and then, in a recognition test, shown 
120 mixed trigrams partitioned into 
the 60 original or "old" ones and 60 
"new" ones. If one further partitions 
the 60 new CVCs into 30 low AV and 
30 high AV items, it is reasonable 
to expect that S will incorrectly iden- 


CVC NORMS AND RECOGNITION MEMORY 


tify as old items more new trigrams 
of high AV than new low AV trigrams 
due to the greater similarity of the new 
high AV items to the old AV items, 
or, alternatively, due to the greater 
discriminability of the new items of 
low AV. Furthermore, by selecting 
new items of discrepant AV on the 
Archer and Noble norms, one is able 
to generate differential predictions for 
the two scales in this kind of task. 
Thus, in the first experiment reported 
here, 30 of the new trigrams on the 
recognition test were of high AV on 
both scales (Type B items) while the 
other 30 new trigrams differed across 
the scales. These discrepant items 
were of high Archer AV but relatively 
low Noble AV (Type C). On the 
basis of the Noble norms, these dis- 
crepant items should be more discrimi- 
nable from the original Type A items 
than the Type B items are. On the 
other hand, assuming the validity of 
the Archer norms leads to no differen- 
tial prediction in performance on the 
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and Type C new items are of identical 
high mean Archer AV. The second 
experiment involved new items of low 
Archer and high Noble AV (Type D), 
allowing for a prediction of a difference 
in false recognitions between Type B 
and D trigrams based on the Archer 
norms and a prediction of no difference 
based on the Noble norms. 


METHOD 


Materials —While Archer's norms are 
presented in terms of a single measure, the 
proportion of Ss who answered “yes” for 
each item, Noble’s norms provide three dif- 
ferent scales, a, a’, and m’. Noble had his 
Ss rate the trigrams on a 5-point scale (from 
None to Very Many) according to the num- 
ber of associations they estimated to each 
CVC. The symbol a represents the propor- 
tion of Ss who rated a given item in any 
category above None and is closest to the 
traditional AV of Glaze (1928). The sym- 
bol a' represents the mean of the ratings 
assigned to each item by Ss, and m' is a 
monotonic nomographic transformation of the 
a' scale derived by the method of successive 
intervals. Since the trigrams used in the 
present study were chosen on the basis of 


recognition test since both the Type B percentile values within their respective 
TABLE 1 
VALUES ON DIFFERENT SCALES FOR THE Four Tyres or CVCs Usep 
Old New 
B j 
ca Ao | arek ir her, | areh Archer, | (row Archer, 
High Noble) | High Noble) | Low Noble) | High Noble) 
Mean Archer percentile 69.8 69.2 68.9 53.2 
Range (62-76) | (62-76) (62-76) | (47-60) 
Mean Noble a’ percentile 69.8 69.2 50.6 70.5 
Range (58-80) (62-76) (36-62) (63-77) 
Mean Archer AV (%) 83.6 83.0. 82.7 64.1 
Range (75-90) | (75-90) | (75-90) | (55-73) 
Mean Noble a’ value 3.03 3.00 2.35 3.05 
Range (2.63-3.42) (2.74-3.27) | (1.91-2.76) (2.78-3.29) 
Mean Glaze AV (96) 824 82.5 51.7 86.3 
Mean Krueger AV (%) 93.7 91.6 87.1 89.7 
Median trigram freq. 4,0 19.0 0.0 16.5 
Proportion in trigram frequency table 38/60 28/30 13/30 25/30 
Median responses to two-letter stimuli 6.5 11.5 5:9 10.5 
Mean letter-pair freq. 860 924 524 931 
Ratio of high freq. to low freq. 53/32 22/11 15/18 32/11 
consonants 
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TABLE 2 


ALL THE CVCs or Eacu Tyre Listep 


Old New 
Type B Type C Type D 

Type A i i v Archer, 

(High Archer, High Noble) fies n EARE Gor Archer 
BOL HED LEX RAK BUF JUL BAW KUR BAP NUG 
BUK HIK LIC ROP CIG KAL CIP KUT cov QIL 
BUL HIR LOK ROS CIS LEM FOP MAH CUL SAB 
CAF HIZ LON ROV DEF LEY FUK MUK DET SAF 
CIN HUL LOR SIK DIZ MOC HAC NAH FAS SEV 
CUF HUR LUC suc Duc NOC HAK NEY FIS SUF 
DAT JAX LUK SUD FAK POK HOK SEZ FOL SUL 
DAV JEF MIK SUK FEB RIF HOY TIS FON TAS 
DER JEM MUC TIK FER ROM jie TOC HAB TUR 
DEX JER MUF TUF GIR SEL KAW WOT JIT VEN 
DOP JN PAV VAS GIV TOL KIC wuz JUN VIL 
DOV KAD PEL WAK GON VAG KOG YAC LEC VIR 
FEM KAM PIS WAM HOC VAR KOY YEP NAM WoL 
GAV KER POC YEH HUP WER KUM ZAG NAS WOM 
GUL KOW RAF YOR Joc WIP KUP Zzow NEG YAL 


scales, there is no difference between the a' 
and m’ rankings. Noble has pointed out 
that the a' scale is more sensitive to dif- 
ferences between items of high meaningful- 
ness than is the a scale. The a' scale was, 
therefore, used in selecting items in the 
present study. Percentile values were used 
to facilitate comparisons across different 
measuring units. All items containing “y” in 
the vowel position were discarded from the 
Archer list before percentile ranks were 
assigned since Noble did not include such 
items in his norms. There are 2,100 CVCs 
which occur in both sets of norms. To 
make the desired comparisons possible, it 
was necessary to choose a region of AV in 
which a low enough correlation obtains be- 
tween the two sets of norms so that suffi- 
cient discrepant items can be found. From 
Archer's (1961) calculations of correlations 
between his scale and Noble's a scale, the 
60-80% AV region seemed promising. First, 
90 trigrams of nearly identical high per- 
centile values on the Archer AV scale and 
the Noble a' scale (60 Type A plus 30 Type 
B) were drawn. Then, 30 trigrams of rela- 
tively high Archer rank but low Noble rank 
were located and, similarly, 30 items of high 
Noble but low Archer rank were selected. 
These constituted the Type C and Type D 
items used in Exp. I and Il, respectively. 
The upper half of Table 1 presents the mean 
percentile values and ranges for each of the 
types of CVCs used as well as their Archer 


and Noble scale values. Table 2 presents all 
the CVCs used. 

Subjects—Four groups of male Ss were 
drawn from introductory laboratory sections 
in Columbia College. Two groups (» — 22, 
n=17) served in Exp. I, and two groups 
(n—22, n —16) in Exp. II. 

Procedure —Experiment I: Each of the 
two groups served separately, but as a group, 
in their usual laboratory room. After being 
instructed that they were serving in an ex- 
Deriment on memory, they were shown the 
60 Type A CVCs at a 5-sec. rate, by means 
of a manually operated slide projector on a 
screen at the front of the room. The order 
of the 60 CVCs was changed for the second 
group. After one presentation, sheets of 3- 
digit numbers were passed out and 4 min. 
were devoted to work on an interpolated 
arithmetic task (crossing out numbers divis- 
ible by 7). Following this, instructions were 
read for the recognition memory test. The 
Ss were asked to rate each of 120 CVCs on 
a 6-point confidence scale. The 120 CVCs 
were printed on sheets, 30 to a page, in a 
random order, and the order of the sheets 
was permuted. The 120 CVCs rated in- 
cluded the 60 old Type A CVCs, 30 new 
Type B CVCs, and 30 new Type C CVCs. 
Next to each trigram was a space for S’s 
rating. The rating scale which follows was 
Printed on the blackboard in front of the 
room and Ss were instructed to refer to it 
as necessary during the self-paced recog- 
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TABLE 3 


PROPORTION OF TIMES A GIVEN CATEGORY 
Was Usep Tuat Ir Was ASSIGNED 
TO AN OLD WorD 


Rating Category 


1 2 3 4 5 6 
Exp. I. | .91 |.62 | 47 +33 | 2311] 14 
Exp. II | .91 | .62 | .43 29 | .27 | 21 


Positive it is an old syllable (i35. 
but not positive, it is an old 
Think it is an old syllable, 
but not too sure (3); Think it is a new 
syllable, but not too sure (4); Fairly sure, 
but not positive, it is a new syllable (5) ; 
Positive it is a new syllable (6). The 
utility of the rating scale in conjunction with 
the method of recognition memory has been 
demonstrated by Egan (1958) and applied 
to the investigation of verbal dimensions by 
Winograd (in press). 

Experiment II: The difference between 
experiments lies in the substitution of 30 
Type D new syllables for the Type C syl- 
lables. Again, two groups of Ss were used, 
the only difference in treatment being the 
order in which the old trigrams were shown 
to Ss. 


nition test. 
Fairly sure, 
syllable (2); 


RESULTS 


As a check on at least the ordinal 
status of the confidence scale, the 
proportion of the total number of ob- 
servations placed in each rating cate- 
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gory that were, in fact, old items was 
computed for each experiment. This 
proportion should decrease with de- 
creasing confidence in the oldness of 
a trigram. The data of Table 3 are 
consistent with this expectation. 

The comparisons of greatest interest 
are between the ratings of the Type B 
and Type C items in Exp. I, and be- 
tween Type B and Type D items in 
Exp. II. In each case, if there is a 
difference in the discriminability of the 
two types of new items, the Type B 
items should have a lower mean rating, 
that is, be rated as older, since they 
are more like the old items in AV. 
Table 4 presents the average ratings 
for the different classes of items for 
both experiments. First, it is clear 
that the Type A or old items are given 
the lowest ratings. Second, in both 
experiments predictions based on dif- 
ferences in AV are confirmed. In 
Exp. I, for combined groups, 28 out 
of 37 Ss gave a lower mean rating 
for the Type B CVCs than for the 
Type C CVCs, with two cases of no 
difference, In Exp. II, 31 out of 38 
Ss gave a lower mean rating for the 
Type B items than for the Type D 
items. Both outcomes are significant 
by a sign test at the .01 level. 


TABLE 4 
Mean RATING ASSIGNED TO EACH CVC Crass 
Old New 
aie OP ED) 
Type A Type B Type C Type D -C or B-! 
i i . | (High Archer, | (Low Archer, 
High Noble) £ Ho Noble) n Noble) | High Noble) 
b denda Ere ij High Noble go e TIT 
Exp. I 
PE San 1 (n=22 2.42 3.86 22 cr 4.41** 
Group 2 (n= 1) 2.06 4.03 4.49 —— 4,45** 
ig Comite’ (n=39) 2.27 3.94 4,34 = — 
"Group 1 (n222) 2.38 4.10 2 4.37 2.81% 
Group2 (n=16) 2.23 4.4 Ue 428 2.90 
Combined (n=38) 2.31 442 = 33 oe 
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The data from the groups run on 
separate days in each experiment were 
analyzed separately. In each case, a 
paired ¢ test of the Type B-Type C 
(Exp. I) or Type B-Type D (Exp. 
II) mean rating difference scores 
yielded a significant outcome (see 
Table 4). 

The data are presented graphically 
as ROC or isosensitivity curves in Fig. 
1 for Exp. I and in Fig. 2 for Exp. II. 
These functions plot the proportion of 
correct acceptances of old (Type A) 
items as old against the proportion of 
incorrect acceptances of new items as 
old for each experiment. There is a 
pair of curves for each group because 
of the partitioning of the false-alarm 
tate (incorrect acceptances of new 
items) between the two sets of new 
items. Chance performance is repre- 
sented by the diagonal. The figures 
portray the greater discriminability of 
the Type C and D trigrams (the 
dashed lines) relative to the Type B 
trigrams. A more complete account of 
the method used in plotting these func- 
tions may be found in Egan, Schulman, 
and Greenberg (1959), 
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Fic. l. Proportion of correct acceptances 
of old trigrams as old plotted against propor- 
tion of incorrect acceptances of new trigrams 
as old for Exp. I, 
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Fic. 2. Proportion of correct acceptances 
of old trigrams as old plotted against propor- 
tion of incorrect acceptances of new trigrams 
as old for Exp. II. 


Discussion 


It is clear from the results of both ex- 
periments that while predictions of dif- 
ferences in trigram discriminability based 
on both norms are confirmed, there is 
no evidence for the greater validity of 
either the Archer or Noble norms within 
the range of AV sampled. The results 
indicate that they are best viewed as 
convergent measures which jointly pro- 
vide a better estimate of AV than either 
does alone. 

An interesting question remains as to 
whether the subsets of discrepant items 
differ systematically in any of the char- 
acteristics which have been found to in- 
fluence the rate of trigram learning. 
Consequently, we have compared the dif- 
ferent classes of trigrams on several 
existing scales as shown in the lower 
portion of Table 1. The average AVs 
taken from the Glaze tables show that 
for the range of AV sampled in the 
Present study, the Glaze and Noble scales 
are more highly correlated than are the 
Glaze and Archer scales, That is, on 
the basis of the Glaze values in Table 1, 
one would predict a difference between 
Type B and Type C items only. The 
differences in AV on the Krueger scale 
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(Appendix A, Underwood & Schulz, 
1960) among the four categories of tri- 
grams do not appear to be large enough 
to provide for clear-cut predictions. The 
median trigram frequencies shown in 
Table 1 are taken from the total fre- 
quency count in prose samples of Ap- 
pendix D of Underwood and Schulz. 
Medians instead of means are given be- 
cause of large variability. The next row 
in Table 1 shows the proportion of tri- 
grams of each type represented in the 
trigram frequency table with a frequency 
of 1 or greater. Both rows indicate that 
the Type C trigrams (high Archer, low 
Noble values) are of lowest frequency in 
prose samples. The next row, taken 
from Appendix F of Underwood and 
Schulz, shows the median number of 
completions to the first two-letter com- 
binations which yield the three-letter 
trigrams used. Again, the Type C tri- 
grams have a lower frequency than the 
Type B or Type D trigrams. Each tri- 
gram is composed of two bigrams and 
the frequency of each bigram of each 
type was taken from Appendix D of 
Underwood and Schulz. The mean of 
these values is shown in Table 1 and it 
is apparent that the Type C trigrams are 
considerably lower than the other types 
in bigram frequency. Finally, a letter 
count was made of the consonants ap- 
pearing in the different types of trigrams. 
Separate counts were made for the seven 
most frequent letters (D, H, L, N, R, 
S, T) and the seven least frequent letters 
(B, J, K, Q, V, X, Z) according to Table 
6 in Underwood and Schulz. It can be 
seen from the last two rows of Table 1 
that the Type C trigrams show the lowest 
ratio of high to low frequency letters. 
The different frequency measures agree 
then, that the Type C trigrams are of 
lowest frequency as units, as completions, 
and in terms of letter composition. While 
these data would suggest an explanation 
for the results of Exp. I, in which the 
Ss’ ratings of Type B and C trigrams 
were compared, they do not account for 
the results of Exp. II in which differ- 
ences in rating Type B and D trigrams 
were found. 
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Since Underwood and Schulz found 
evidence for the importance of pro- 
nounceability as a determiner of trigram 
learning, it is reasonable to inquire into 
the pronounceability of different types of 
trigrams used in the present experiment. 
Unfortunately, not enough of the tri- 
grams used in the present experiment 
are to be found in the pronounceability 
ratings of Underwood and Schulz to 
allow for reliable comparisons. How- 
ever, Underwood and Schulz report that, 
while a substantial correlation exists be- 
tween ease of pronouncing a unit and its 
Glaze AV, the covariation of AV and 
pronounceability is almost exclusively 
found below AVs of 50%. To quote 
(Underwood & Schulz, 1960) “ease of 
pronouncing the syllables increases from 
0 per cent M to approximately 50 per 
cent M, with no appreciable change 
thereafter [p. 282].” Since all four cate- 
gories of CVCs used in the present ex- 
periment were of very high AV, it is 
unlikely that pronounceability can ac- 
count for the results. 

An alternative explanation for the 
greater discriminability of the C and D 
trigrams over the B items may be sought 
on the basis of differential overlap with 
A items in letter composition. Although 
an analysis of single letters revealed no 
noteworthy differences, a comparison of 
the number of trigrams of each new class 
(B, C, and D) sharing letter pairs with 
the Type A, or old, trigrams shows sig- 
nificant differences. The proportion of 
Type B trigrams containing at least one 
letter pair occurring in a Type A. tri- 
gram is 27/30. This proportion is sig- 
nificantly greater than the same propor- 
tion for both Type C (16/30; z— 3.19, 
p<.01) and Type D (18/30; z= 2.68, 
p<.01) trigrams. It is possible that Ss 
are generalizing between new and old 
trigrams to the degree that they contain 
identical letter pairs, especially in the 
light of Postman’s (1951) generalization 
gradient obtained in recognition memory 
by varying letter similarity between old 
and new material. In order to evaluate 
this possibility, a separate analysis was 
carried out for only those new (B, C, 
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and D) trigrams containing letter pairs 
in common with the old trigrams. A 
significant difference between the ratings 
of these selected items was found for the 
B-C comparison (t= 3.24, p < .01) but 
not for the B-D comparison (t= 1.25). 
In the former case, at least, Ss were 
evidently discriminating on some other 
basis than letter-pair identity with Type 
A items. It is not at issue here whether, 
in the last analysis, AV can be frac- 
tionated into some basic formal attributes. 
The point of the present study is that, 
whatever AV consists of, it is discrimina- 
ble with the present technique. The ad- 
yantage of this technique is that it 
provides a method suitable for the inves- 
tigation of the discriminative properties 
of sets of stimuli differing in selected 
properties. 
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PERCEIVED DEPTH AS A FUNCTION OF RELATIVE HEIGHT 
UNDER THREE BACKGROUND CONDITIONS * 


WILLIAM EPSTEIN 


University of Kansas 


The hypothesis was proposed, that the perceived depth, which results 


from the relative height cue, depen 


ds on “optical adjacency.” A 3x3 


factorial experiment was conducted to examine this hypothesis. The 2 


factors were vertical separation 


(3.5, 5.5, 7.5 in.) and background 


conditions (0 background, outline background without surface texture, 


textured background). Verbal est 


imates of the depth between pairs of 


frontal parallel points were obtained under the 9 conditions. In one 
experiment, the backgrounds simulated a floor surface, and in another, 


the backgrounds simulated a ceiling surface. 
Both main effects, separation and 


experiments were comparable. 
background, and the interaction eff 


The results in both 


ect were significant. All the effects 


were in the direction predicted by the optical adjacency hypothesis. 


Relative height is often included in 
general discussions of the cues for 
perceived depth. The higher of two 
objects will generally appear more dis- 
tant. A search of the literature has 
failed to uncover any systematic ex- 
perimentation dealing with this cue. 
Apparently the assertion that there 
exists a lawful relationship between 
relative height and relative perceived 
distance is based on informal observa- 
tions and demonstrations. Two inter- 
esting demonstrations are provided by 
Gibson (1950, Fig. 72) and Kilpatrick 
(1952, p. 5). Although these demon- 
strations, and others like them, may 
be impressive, an experimental dem- 
onstration of this cue under controlled 
conditions is still desirable. 

However, let us assume for the 
moment that the vertical position cue 
is valid. What accounts for the effect 
of this variable? On this question, the 
experimental literature is also silent. 
Nevertheless, there are several possi- 
ble answers, and one of these has been 


1 This work was supported by Grant MH 
4153 from the National Institute of Mental 
Health of the United States Public Health 


Service. 


examined in the present experiment, 
The explanation is derived from Gib- 
son's (1950, 1959) analysis of the stim- 
ulus conditions for perceived space. 
In Gibson's view, the first step in 
accounting for the perceived distance 
of objects is to determine the optical 
stimulation responsible for the percep- 
tion of an extended continuous ground 
surface. Gibson proposes that the opti- 
cal gradient of textural density is one 
of the important stimulus correlates of 
perceived surface. The perceived dis- 
tance of an object will be determined 
by its optical relationship to the ground 
surface, and can be specified by noting 
at what point, in the textured optic 
array, the contour of the object inter- 
rupts the texture of the ground. 

The foregoing analysis provides an 
explanation of the height-in-the-field 
cue, Objects which are higher up are 
normally optically adjacent with por- 
tions of the optical gradient of texture 
correlated with more distant (physi- 
cally and perceptually) portions of the 
ground surface, than is the case for 
relatively lower objects. Therefore, 
the higher of two objects will appear 
farther away. Experiments have been 
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designed to examine this interpreta- 
tion. 


EXPERIMENT Í 
Method 


General plan—Reports of the perceived 
depth between a pair of frontal parallel 
points were obtained. The points were 
vertically aligned and were separated by one 
of three vertical interspaces. The predic- 
tion was that the depth perceived between 
the points would increase as the vertical 
separation increased. The points were ob- 
served under three background conditions: 
zero background, outline background with- 
out surface texture, textured background. 
The expectation was that perceived depth 
would increase as the specificity of the 
background (from zero to textured) in- 
creased. Furthermore, a significant Sepa- 
ration X Background interaction was ex- 
pected, The effects of varying separation 
were expected to be greater for the highly 
specific background. 

Subjects—The Ss were 30 undergrad- 
uates. Ten Ss were assigned to each of the 
three background conditions. 

Materials.—The standards were fluorescent 
points, 8 in. in diameter, located at a dis- 
ance of 57 in. from S’s eye. In the com- 
pletely dark laboratory room they were 
visible by the light of an ultraviolet tube. 
The S viewed the points monocularly 
through a red filter which eliminated any 
perceptible illumination of the surroundings. 
Under the condition of zero background, 
the points were viewed against a black back- 
ground. For the outline background, the 
points were fastened onto a frontoparallel 
black cardboard sheet, on which the trapezoid 
illustrated in Fig. la was drawn in fluo- 


TABLE 1 


MEAN PERCEIVED DEPTH FOR THREE VERTICAL SEPARATIONS UNDER 


THREE BACKGR 
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rescent paint. The separation between the 
two parallel sides was 8 in.; the longer of 
the parallel sides was 26 in.; the shorter, 
8% in. The textured background is shown 
in Fig. 1b. Its outer dimensions were the - 
same as the outline background. Both 
backgrounds yielded the impression of a 
receding ground. Figure lb had the ap- 
pearance of a floor. In addition to these 
stimulus materials a fluorescent footrule was 
prepared. 

Procedure.—Each S made two verbal 
estimates of the depth between the points 
for each of three degrees of vertical sepa- 
ration: 3.5, 5.5, 7.5 in. The six trials were 
run in random order. Each S made judg- 
ments under one background condition only. 
For all background conditions, the absolute 
heights of the points were the same. The 
points were located approximately at eye- 
level. 

The instructions called for a verbal esti- 
mate of the depth between each pair of 
points based on an immediate impression of 
their depth separation (phenomenal instruc- 
tions). To facilitate performance, S was 
shown the fluorescent frontoparallel footrule 
immediately prior to the depth estimation 
task. The S was asked to make his judg- 
ments in multiples or fractions of a foot. The 
ruler was not present during the depth 
estimation task. The instructions also in- 
formed S that a report of zero depth was 
entirely acceptable. 


Results 


Table 1 shows the mean perceived 
depth in inches for each of the nine 
experimental conditions. In Fig. 2, 
the means are plotted as a function 


OUND CONDITIONS 


Background Condition 


M 
Meier " Zero Outline Textured Overall 
M SD M SD M SD M SD 
3.5 2.11 1.88 2.12 1.27 3.08 0.90 
5.5 194 | 117 3.99 1.54 5.87 | 2.06 393 130 
7.5 2.49 2.44 5.38 2.79 12.82 3.74 6.90 4.19 
Overall 2.85 2.05 3.83 2.37 7.26 4.79 ze Et 


PERCEIVED DEPTH AND BACKGROUND CONDITIONS 


Fic, 1, Outline and textured backgrounds. 
(The actual dimensions are provided in the 
text.) 


of separation with background as a 
parameter. An analysis of variance 
showed that both main effects were 
significant—separations, F (2, 54) = 
5012, p < 001; backgrounds, F (2, 
27) = 81.80, p<.001. In addition, 
the interaction between background 
and separation was significant, F (4, 
54) = 7.21, p < 001. 

The nature of the interaction is 
easily recognized in Fig. 2. In the 
case of zero background, variations in 
vertical separation did not produce sig- 
nificant differences in perceived depth. 
And a t test showed that the overall 
mean perceived depth for the zero 
background (see Table 1) was not 
significantly greater than zero per- 
ceived depth. On the other hand, with 
a textured background, each increase 
of vertical separation produced a sig- 
nificant increase in mean perceived 
depth. The effects for the outline 
background were in the appropriate 
order, but the difference between the 
5.5.in. and 7.5-in. separation was not 
significant. 

"As another index of the interaction 
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effect, the data were examined to de- 
termine the frequency with which Ss 
in each background condition gave 
depth estimates whose rank order of 
magnitude agreed with the rank order 
of magnitude of vertical separation. 
The number of Ss in each background 
condition was 1, 4, and 10 for the 
zero, outline, and textured background, 
respectively. 


SuPPLEMENTARY EXPERIMENT 


Experiment I was repeated with 
only one difference. The backgrounds 
shown in Fig. la and 1b were inverted 
so that a ceiling effect was produced 
instead of a floor effect. According to 
the optical adjacency hypothesis this 
modification should yield only one im- 
portant difference between the results 
of Exp. I and the supplementary ex- 
periment. The depth effects should 
be the same except that with the ceil- 
ing patterns the lower point should 
appear more distant. 


PERCEIVED DEPTH IN INCHES 


$5 55 75 


VERTICAL SEPARATION 
Fic. 2. Perceived depth between the two 
standards as a function of frontal plane ver- 
tical separations and three background con- 
ditions. 
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Thirty new Ss were tested with the 
ceiling pattern. The main effects and 
interaction were highly significant in 
the same direction as Exp. I. The 
sole difference between the results of 
the two experiments was the one pre- 
dicted above. In Exp. I the upper 
point was judged to be more distant 
in 79 of the 82 depth reports. (There 
were 8 reports of zero depth.) In the 
supplementary experiment it was the 
lower point that was judged to be 
more distant in 68 of the 78 depth 
reports. 


Discussion 


The results support the hypothesis of 
optical adjacency as an account of the 
effects of relative height on perceived 
depth. Also, in a general sense, the ex- 
periments support Gibson’s (1950, 1959) 
insistence that formulations of the cues 
for depth should not disregard the sur- 
faces surrounding the object. The cue 
under discussion is best thought of as 
relative height in the field rather than 
relative height or vertical position con- 
sidered in isolation in empty space. It 
is also more accurate to omit reference 
to the direction of the perceived depth, 
ie, which appears nearer, unless the 
background is specified. 

Having asserted the importance of 
background, what can be said about the 
cases in which, contrary to the results of 
the present experiment, differences in 
vertical position produced depth effects 
in the absence of any visible background. 
For instance, in Kilpatrick's (1952, p. 5) 
example, two light points exposed near 
the ceiling of a dark room are said to 
appear separated in depth. The same 
points, also presented in darkness, this 
time near the floor, are also perceived 
as separated in depth, but with their 
positions reversed. To account for these 
apparent exceptions to the hypothesis of 
optical adjacency, one further assump- 
tion needs to be introduced: In the ab- 
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sence of a visible background, relative 
height will affect perceived depth, if $ 
assumes the existence of an unscen back- 
ground. Gibson (1950, p. 180) offers 
a similar conjecture in his discussion of 
this cue, There are two ways in which 
an assumed background can arise. (a) 
Immediately prior exposure to a back- 
ground can lead S to imagine or infer 
its presence when it is no longer visible. 
The recent study of negative afterimages 
in "imagined space" (Gruber, King, & 
Link, 1963) illustrates an effect of this 
general nature, (b) Extreme eye or/and 
head position, normally correlated with 
distinctively differently oriented surfaces, 
may induce an assumption of the asso- 
ciated surface when it is not present. 
Thus, tilting the head back and raising 
the eyes may create the assumption that 
the objects are viewed against the ceiling. 
Since the standard points in the present 
experiment were located straight ahead, 
eye position could only signal a fronto- 
parallel wall as the invisible background. 
According to the optical adjacency hy- 
pothesis, variations in vertical separa- 
tion, which occur in the context of an 
optic array correlated with a frontal 
parallel surface, should not lead to per- 
ceived depth between the separated points. 
This would account for the results under 
the condition of zero background in the 
present experiment, 
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The sensitivity and criterion ind 


exes, d’ and B, of signal-detection 


theory were calculated and used as measures descriptive of vigilance 


performance, and trends were studied in terms of these 


indexes in 


addition to the usual ways involving detections and false alarms. Ss 


were run under 2 conditions: 


(a) à the Ss employed a multiple 


criterion, ie. indicated their degree ‘of confidence at 1 of 3 levels when 
they thought they detected a signal; (b) 4 the Ss merely indicated the 


occurrence of a signal (single criterion). 


All Ss performed the same 


task—detection of a 1.8-db. increment to periodically occurring 60-db. 


noise pulses—for 9 80-min. sessions. 


creased slightly during sessions a 
(b) criterion indexes, B, increase 
(c) Ss employing a single criterion, 


It was found that (a) d' de- 
increased slightly over sessions; 


d both within and over sessions; 


ie, single standard of judgment, 


showed more pronounced trends than did Ss employing multiple 
criterion. The usual changes in Hits and False Alarms, i.e., decreases 


in detections within sessions 


(especially the early sessions) and 


decreases in false alarms within and over sessions were observed. 


There recently have been several at- 
tempts to apply the theory of signal 
detection (TSD) to the vigilance situa- 
tion. It was suggested by Egan, 
Greenberg, and Schulman (1961) that 
the TSD model (Peterson, Birdsall, & 
Fox, 1954; Tanner, Swets, & Birdsall, 
1961) might be applied to the vigilance 
situation, and a method—termed the 
method of free response—was de- 
scribed and applied to the case in which 
Os were to detect signals occurring 
without warning in a brief observation 
interval. An even earlier attempt to 
describe the vigilance situation in TSD 


1 The research reported was made possible 
by a research and development contract be- 
tween the Office of the Surgeon General, 
Department of the Army, and the University 
of Louisville. We are indebted to Valerie 
Wasson and Lois Hohmann for technical 
assistance. 


terms was made by Howland (1958), 
and Broadbent and Gregory (1963a, 
1963b) also have applied TSD meth- 
ods in the analysis of their data. In 
one of these (Broadbent & Gregory, 
1963a) they demonstrated changes in 
Os’ criteria for responding over time 
on an auditory vigilance task, although 
no general decrements in detection 
were observed. In a recent study, 
Loeb and Binford (1964) observed 
changes in O hit (detection and false- 
alarm rates within sessions, which in a 
detection-theory model could be inter- 
preted as a changing criterion. There 
was some reason to believe, however, 
that sensitivity was changing—increas- 
ing in successive sessions—but because 
of the small number of sessions no 
definite conclusion could be reached. 
If there were a change in sensitivity 


339 


340 


3 5 5 Ei Ei 
V 
a 
2 
[- 
2 
Jm : 
2 3 : 
= 
s 25 | 25!] 25 
SEC | SEC 1 SEC 


TIME 


JOHN R. BINFORD AND MICHEL LOEB 


Fic. 1. Signal parameters and sequence. 


(d') as well as an increase in the 
criterion (@), then the static detection- 
theory model would appear insuffi- 
cient; rather, a dynamic model like 
that of Luce (1963) or Atkinson 
(1963) might be required. It is with 
the requirement for dynamics in a 
detection theory that the present re- 
search is concerned. 


MetHop 


Schedule——Each S was employed for 10 
sessions—the first (a training session) being 
labeled 0 and the subsequent nine vigilance 
sessions labeled 1-9. Each of the vigilance 
sessions (1-9) had a duration of 90 min. 
During the training session (a) crude 
threshold measurements for white noise 
pulses were made (method of limits), (b) 
each S was familiarized with the signal and 
nonsignal events, and (c) Ss were instructed 
in using the response apparatus and in em- 
ploying the three criteria. 

Signal parameters—During Sessions 1 
through 9 all Ss were exposed to a periodic 
Sequence of pulses of Gaussian noise as 
shown in Fig. 1. The pulses had a duration 
of .5 sec. and occurred at 2.5-sec, intervals. 
Amplitude of the nonsignal pulses was 60 
db. above threshold (SL), as determined 
from threshold measures mentioned above, 
and. amplitude of signal pulses was 61.8 db. 
Intervals between signal pulses were ran- 
dom, the distribution being uniform and 
having a mean of 2 min. The range was 
further restricted so that the interval between 
signals was never less than 10 sec. nor 
greater than 240 sec. _ j 

Experimental conditions—Two groups of 


Ss were employed. The Ss of one group 
operated with only one criterion, ie. were 
required to press a single button whenever 
they detected a signal. The Ss in the second 
group were instructed to press one of three 
buttons, each corresponding to a degree of 
confidence—“very sure,” (VS) ; “moderately 
sure,” (MS) ; “possibly,” (P)—that a signal 
was present. 

Apparatus—The Ss were run individually 
in an insulated air-conditioned chamber 
Model 402A sound-shielded room. ^ Noise 
was generated by means of a Grason-Stadler 
Model 455C noise generator and fed 
through an attenuator network to a pair of 
Grason-Stadler Model HD30 earphones. A 
General radio 1450HB decade attenuator was 
used to set the level of nonsignal pulses at 
60 db. (SL) and a relay circuit shorted a 
18 db. fixed attenuator to provide signals. 
Signals were programmed by means of a 
Gerbrands filmstrip programmer, ^ holes 
punched in the film as specified by the dis- 
tribution described above. The S made his 
response by pressing buttons on a panel on 
a table in front of him. 

Subjects. —Twenty-four college students, 
paid at the rate of $1.50 per hr. served as 
Ss. An incentive of $20.00 was offered for 
the “best” performance. 


RESULTS 


An overall view of the outcome of 
the experiment is afforded by: Fig.-2, 
showing trends in detection, or hits 
(H), and false alarms (FA) over ses- 
sions, and by Fig. 3, showing trends 
over blocks (successive 10-min. pe- 
tiods). For Ss performing under MC 
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than there were when Ss employed a 


conditions, an H was credit for a detec- 
tion at any of the three criterion levels. 

Tt can be seen from Fig. 2 that there 
were more H, on the average, for Ss 
employing a multiple criterion (MC) 


single criterion (SC), but the overall 
difference is quite small and neither 
nonparametric Wilcoxon T test nor 
analysis of variance (Winer, 1962, p. 
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202) indicates the difference to be a 
significant one. However, an analysis 
of the sums of squares (Anderson & 
Bancroft, 1952, p. 238) based on the 
points themselves results in a partition 
of sums of squares in which the mean 
squares for both criteria and sessions 
exceed a remainder (interaction) mean 
square by a factor of 15 (p < .01). 
For Ss using an MC, the H mean 
frequencies appear to remain constant 
or to increase slightly over sessions 
(Fig. 2). The increase is quite small 
and the end points do not differ sig- 
nificantly according to a Wilcoxon T 
test, but trend analysis of the sums 
of squares (Anderson & Bancroft, 
1952, p. 207) results in a linear com- 
ponent which is approximately 10 times 
the value of the mean square based on 
the remaining components after the 
linear and quadratic components have 
been extracted, F (1, 6) =881, p 
< .025. The curve showing H for Ss 
using an SC increases over sessions, 
the end point of the curve differing 
from the starting point at the 5% level 
by a Wilcoxon T test. The linear 
Component of the sum of squares is 
approximately 13 times a mean square 
based on the residual sum of squares 
after the linear and quadratic compo- 
nents have been extracted (p < 01). 
The FA curve for the MC group 
(Fig. 2, right) falls uniformly below 
that for the SC group. For both 
groups the number of FA decreases 
markedly over Sessions, ends points 
differing significantly at the 1% level 
as indicated by a Wilcoxon T test. 
The linear component of the sum of 
Squares is significant at better than the 
1% level for both the SC and MC 
groups, as is the quadratic component 
for the MC group. An analysis of 
variance (Winer, 1962, p. 303) indi- 
cates a significant sessions effect (p 
<.05), but an insignificant effect of 
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differences in number of criteria, even 
though the curves have no crossings. 

In summary, Ss performing under 
an MC appear to make more H and 
fewer FA than do Ss performing under 
an SC, and while both groups show 
trends—an increase in H and a de- 
crease in FA over sessions—the per- 
formance of the MC group undergoes 
less of a change; i.e., the percentage 
increase in H and decrease in FA ap- 
pears to be less. 

Figure 3 shows trends in H (left) 
and FA (right) for successive 10-min. 
blocks, averages being taken over Ss 
and sessions. As was observed in the 
preceding figure, Ss performing under 
an MC make more H and fewer FA, 
and the trends appear quite similar. 
The decline in H apparent in the figure 
is relatively small. A test comparing 
end points does not indicate the end 
points of the SC or MC groups to be 
significantly different, but an analysis 
by means of orthogonal polynomials 
(Anderson & Bancroft, 1952, pp. 207- 
216) reveals linear trends significant 
at the 5% level. In view of the ap- 
parent similarity of the trends and lack 
of interaction, a simple two-factor an- 
alysis of variance (Anderson & Ban- 
croft, 1952, p. 258) was computed and 
using the interaction (remainder) we 
find both the block and criterion vari- 
ables to be significant ( p < .01) 
sources of variation. 

Inspection of the curve for FA indi- 
cates a marked separation of the curves 
for performance under MC and SC 
conditions, the latter making fewer 
FA. A simple two-factor analysis of 


. Variance using the interaction as error 


suggests that the number of criteria 
variable is a significant source of vari- 
ation (p< .01). Analysis of trends in 
FA by means of orthogonal poly- 
nomials indicates a linear trend signifi- 
cant at the 1% level for the MC group 
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TABLE 1 


MEANS AND 95% CONFIDENCE INTERVALS FOR SENSITIVITY INDEXES, d, 
AND MEAN CRITERION INDEXES, B 


2 B 
Coco Session 20-Min. Period 20-Min. Period 
First Last First Last 
MC 1&2 3.09 + .52 2.96 + 46 13.72 26.88 
8&9 3.58 + .48 3.43 + 43 43.41 42.28 
sc 1&2 2.54 + .52 2.48 + .55 17.74 129.68 
8&9 3.93 + .79 3.18 + .74 54.80 87.92 


and linear and quadratic components 
in the SC group significant at the 5% 
level. 

The preceding data suggest differ- 
ences in trends during early sessions 
and during late sessions. A more de- 
tailed analysis, reported elsewhere 
(Binford & Loeb, 1965) makes it clear 
that (a) there is an increase in the 
number of detections from early to late 
sessions, (b) there is a large decrease 
over blocks in both H and FA in early 
sessions and no decrease in late ses- 
sions, (c) there is an overall decline 
over blocks which can be accounted for 
largely by a linear trend when an SC 
is employed but which involves linear 
and quadratic components when MC 
are employed, and (d) there is a de- 
creasing frequency of use of the more 
lax criterion categories (possibly, mod- 
erately sure) for H and of the strict 
category (very sure) for FA over 
sessions. 

Shown in Table 1 are mean sensi- 
tivity indexes, d', and mean criterion 
indexes, 8. d' and B were calculated 
by determining the standard normal 
variable, Z, and ordinates of normal 
probability density function (pdf), 
TZI corresponding to the percent- 
ages of H and FA and from the 
definitions, d'—Zm + Zra and B 
= fn(Z)u/fu(Zea), the Zs being 
standard normal deviates correspond- 


ing to, respectively, the proportion of 
H and the proportion of FA. The 
method employed is that of Broadbent 
and Gregory (1963b). To obtain 
reasonably stable indexes, percentages 
were determined for performance dur- 
ing the 20-min. periods for early ses- 
sions (1 & 2) and late sessions (8 
& 9). 

Inspection of the left-hand portion 
of the table showing d' indexes indi- 
cates that, for corresponding entries, 
those in the MC portion of the table 
exceed those in the SC portion of the 
table in three cases out of four. An 
analysis of variance (Winer, 1962, p. 
319) indicates that the effect of the 
difference in number of criteria is small 
relative to the error sum of squares 
(Ss within groups). The analysis of 
variance indicates, further, that the 
sessions variable makes a significant 
difference (p < .05). The overall de- 
crease in d' from the first 20 min. of a 
session to the last 20 min. of a session 
is significant ($ < .01). Of the inter- 
actions, only that for Sessions X Blocks 
approaches significance. 

Shown on the right in Table 1 are 
mean criterion indexes, 8. The under- 
lying data are quite variable and con- 
tain a few extreme scores in each con- 
dition, Normal-curve statistics, there- 
fore, were not calculated and confidence 
intervals not presented. Pairwise com- 
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parisons of means, employing the Wil- 
coxon T test, can be summarized as 
follows : 

For the MC group, the increases 
from Sessions 1 and 2 to Sessions 8 
and 9 are significant (p < .01) for 
performance both at the beginning of 
a session (first 20 min.) and at the 
end of the session. There is a signifi- 
cant increase (p< .02) in 8 during 
Sessions 1 and 2 from the first 20 min. 
to the last 20 min. For Ss performing 
under SC conditions only the increase 
from the first to the last 20 min. during 
early sessions (1 & 2) was significant 


(p < .01). 


Discussion 


Our data may be summarized in three 
ways: (a) Curves for H rate and FA 
rate have been shown to change over 
sessions; i.e, FA decline and H increase 
from the second to the tenth session. The 
increase in H is consistent with results 
obtained by Broadbent (1954), Bakan 
(1955), Buckner, Harabedian, and Mc- 
Grath (1960). (b) H, although increas- 
ing slightly over sessions, decline within 
sessions, while FA decline markedly dur- 
ing early sessions but negligibly during 
the last sessions, and possibly even in- 
crease in late sessions. There is the 
appearance that the FA rate has arrived 
at a relatively stable level by the last 
pair of sessions. (c) Finally, an analy- 
sis in terms of TSD of sensitivity in- 
dexes, d', and criteria indexes, 9, indi- 
cates shifts in d’; ie, an increase over 
sessions and a small decrease within ses- 
sions, and shifts in B, ie, a slight 
increase from the first to the last 20 
min. of a session, Finally, 8. increases 
within early Sessions, particularly for the 
MC group. 

Signal-detection fheory postulates two 
probability density distributions on the 
Sensory dimensions (s), one (the noise 
distribution) Corresponding to the sen- 
sory event when no signal is present, and 
the other corresponding to the Sensory 
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event when a signal is present. From 
the further assumption that a utility 
matrix is applicable in the description of 
S's performance, and the above assump- 
tions, there is derived a Bayes' solution, 
a unique solution characterized in terms 
of two quantities: d', the sensitivity in- 
dex, measures the separation of the noise 
and signal plus noise distributions in 
terms of the variance of the noise distri- 
bution, and B measures the likelihood 
ratio criterion. 

Sensitivity and criterion are fixed for 
a given set of initial conditions and 
nothing is said in signal-detection theory 
about how S might modify estimates of 
the initial conditions, although it is per- 
fectly clear that, were S to change his 
estimates, there would be a change in 
the 8. This change would have rela- 
tively simple consequences: If p in- 
creases, then there would be a decrease 
in H and FA, a fact which is summarized 
in the operating characteristic, or iso- 
sensitivity curve, Such a change could 
account for trends within sessions but 
could not account for trends over ses- 
sions, Such a change would result were 
S to revise downward his estimate of 
a priori signal probability, which he un- 
doubtedly does during early sessions. 

We can account for the differential 
trends over sessions in the FA and H if 
we assume an increase in d' but what 
this means in terms of the theory is not 
too clear: Either there is an increase 
in d', the separation of the means of the 
two distributions; or there is a decrease 
in variance of one or both of the distri- 
butions. If S were able to establish these 
criteria, as half the Ss were asked to do, 
and were able to maintain these criteria 
fixed over sessions, a shrinking variance 
and a constant d’ would account for most 
of the results which we have obtained: 
Ss would make fewer and fewer FA and 
finally would assign FA only to the lax 
criterion, and would make more H, more 
and more at the strictest criterion level. 

It is only possible to speculate as to 
what might account for a change in the 
relationship of the variance of the signal 
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and signal plus noise distributions to the 
separation of these distributions—i.e., the 
change in d’, However, it might be 
assumed that Os gradually learn enough 
about the stimulus characteristics of the 
signal and nonsignal pulses so that they 
become less frequently confused. This 
sort of change might be viewed either as 
a decrease in variance or an increase in 
separation of distributions. In any event, 
the effect of such a change would be an 
increase in H and a decrease in FA. 
Since the effect of a change in Os’ 
expectations regarding signal schedule 
would probably be a decrease in H and 
FA, the learning and expectation effects 
acting together might produce primarily 
a decrease in FA (as observed). 
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CONSERVATISM IN A SIMPLE PROBABILITY 
INFERENCE TASK ! 


LAWRENCE D. PHILLIPS AND WARD EDWARDS 
University of Michigan 


3 experiments investigated the effects on posterior probability estimates 
of (a) prior probabilities, amount of data, and diagnostic impact of the 


data; (b) payoffs; and (c) response modes. 


In all the experiments Ss 


usually behaved conservatively, i.e., the difference between their prior 
and posterior probability estimates was less than that prescribed by 
Bayes’ theorem. Conservatism was unaffected by prior probabilities, 
remained constant as the amount of data increased, and decreased as 
the diagnostic value of each datum decreased, More learning occurred 
under payoff than under nonpayoff conditions and between-S variance 


was less under payoff conditions. 


Estimates were most nearly Bayesian 


under the (formally inappropriate) linear payoff, but considerable over- 
estimation resulted; the log payoff condition yielded less conservatism 
than the quadratic payoff. Estimates were most nearly Bayesian when 
Ss estimated odds on a logarithmic scale. 


In several recent studies of the 
ability of human Ss to make infer- 
ences in probabilities, Ss behaved 
conservatively. In one experiment 
(Phillips, Edwards, & Hays, 1965) 
Ss were presented with four hypoth- 
eses, only one of which could be true. 
A sequence of data was shown and Ss 
were required, after each datum, to 
estimate how probable they thought 
each hypothesis was. Other than the 
data itself, only two kinds of infor- 
mation were given to Ss, the proba- 
bilities assigned to the hypotheses 
before observing the data, and the 
probability that each single datum 
would occur given that a particular 
hypothesis were true. These two 


! The research reported in this Paper was 
sponsored by the Air Force Systems Com- 
mand, Electronic Systems Division, Decision 
Sciences Laboratory, under Contract No. 
AF 19(628)-2823 with the University of 
Michigan. The paper is identified as ESD- 
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States Government. 
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quantities, prior probabilities, P(H), 
and likelihoods, P(D| H), when com- 
bined using Bayes' theorem, yield a 
set of posterior probabilities that de- 
scribe the opinions, in the light of the 
data, of an ideal person (Edwards, 
Lindman, & Savage, 1963). The 
amount of probability revision is in- 
dicated by the difference between 
posterior and prior probabilities. 
Revision was consistently smaller for 
Ss in this experiment than the amount 
prescribed by Bayes' theorem. This 
reluctance of Ss to extract from the 
data as much certainty as is theo- 
retically implied by Bayes' theorem 
has been called the conservatism 
effect (Edwards & Phillips, 1964). 
Several factors influence conserv- 
atism. Peterson, Schneider, and 
Miller (1965) found greater conserv- 
atism when Ss were presented with 
large samples than when they were 
given successive small samples. In 
another experiment (Peterson & 
Miller, 1965), the amount of con- 
Servatism was found to depend on 
the prior probabilities of the hypoth- 
eses and on the theoretical diagnostic 
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impact of a single datum, i.e., on the 
likelihood ratio. In general, con- 
servatism was greater the further the 
likelihood ratio was from 1, and when 
prior probabilities were equal rather 
than extreme. Finally, Phillips et 
al., found Ss to be more conservative 
for ambiguous sequences of data than 
for sequences whose Bayesian poste- 
rior probabilities quickly approached 
1.0 and 0. 

The following experiments explored 
several other factors that affect con- 
servatism. Experiment I examined 
the effects of prior probabilities, 
amount of data, and the diagnostic 
impact of the data. In Exp. II pay- 
offs were imposed on the task, and in 
Exp. III the effects of different re- 
sponse modes were examined. 


EXPERIMENT I 


The purpose of this experiment 
was to determine the effect of prior 
probabilities, amount of data, and 
diagnostic impact of the data on Ss 
posterior probability estimates. 


Method 


Subjects.—Five volunteers, University of 
Michigan undergraduate men, served as Ss. 
They were paid $1.25 per hour. 

Procedure and design.—The Ss were told to 
imagine 10 bags, each of them containing 100 
poker chips, with red chips predominating in 
7 of the 10 bags, and blue chips predominating 
in the remaining 10 — r bags. They were 
shown a bag and told that E had chosen it 
from the 10 bags, where each of the 10 bags 
was equally likely to have been chosen. The 
Ss were asked to make estimates of the prob- 
abilities that a predominantly red or a pre- 
dominantly blue bag was the chosen bag. If 
these estimates differed from r/10 and 
(10 — r)/10 E explained that since no other in- 
formation was available, r/10and (10 — r)/10 
were the best estimates of the prior prob- 
abilities, and their estimates should be 
changed to these values. This procedure 
ensured that all Ss started with the same 
prior probabilities. 

The Ss were told that the predominantly 
red bags contained the percentage p of red 
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chips and percentage q of blue chips, while 
the predominantly blue bags contained the 
inverse percentages, p blue chips and q red 
chips. The values of p and q were either 
10%-30% or 6095-4046. Twenty chips were 
then shown one at a time; Ss were told that 
the sequence of chips was the result of random 
draws, with replacement, from the chosen 
bag. After each new chip was shown, Ss 
revised their previous intuitive estimates of 
the probabilities that each type of bag had 
been chosen. Estimates were made by dis- 
tributing 100 metal washers over two pegs, 
the height of each stack representing the 
probability of the corresponding hypothesis. 
This process of selecting one bag at random 
from 10 and then drawing 20 chips from the 
bag was repeated 24 times; thus, every S 
made 20 pairs of estimates for each of 24 
sequences. The sequences of 20 draws were 
actually planned prior to the experiment so 
that the same sequences could be shown 
to all Ss. 

Half of the 24 sequences came from 10%- 
30% bookbags, the other half from 60%-40% 
bookbags. Of the 12 sequences in each half, 
2 started with prior probabilities of 0.3 and 
0.7, 2 started with 0.4 and 0.6, 4 started with 
0.5 and 0.5, 2 with 0.6 and 0.4, and 2 with 
0.7 and 0.3. The predominant color of the 
chosen bag was counterbalanced within each 
set of 12 sequences. 

Data analysis.—Theoretical probabilities 
for each sequence can be calculated from 
Bayes’ theorem: 

P(Hp|D) = kP(D|He)P(He) — [13 
P(Hs|D) = kP(D|Hz)P(Hz) [2] 


The hypotheses being considered, that a pre- 
dominantly red or predominantly blue bag 
was chosen, are represented by Hr and Hp. 
The prior probabilities of these hypotheses 
are represented by P(Hz) and P(Hz), while 
P(Hp|D) and P(Hz|D) represent the pos- 
terior probabilities, the probabilities of the 
hypotheses after observing the data D. 
P(D|Hg) and P(D|Hx) represent the likeli- 
hoods of the data, or the conditional prob- 
abilities of the data given the truth of the 
particular hypothesis. The normalizing con- 
stant k ensures that 
P(Hz|D) + P(He|D) = 1. 

A form of Bayes’ theorem that is more 
convenient for data analysis can be obtained 
by dividing Equation 1 by Equation 2 when- 
ever Hg is the correct hypothesis, and Equa- 
tion 2 by Equation 1 whenever Hz is correct. 
This gives, 

Qi = LQ%. E31 
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ACCURACY RATIO 
rj 


E 0 2 4 6 8 10 12 


Fic. 1. Effect of bag composition on 
accuracy ratio. 


The posterior odds in favor of the correct 
hypothesis is given by Q, while Q represents 
the prior odds in favor of the correct hy- 
pothesis. L represents the likelihood ratio 
of the data. 

The chips drawn from the chosen bag are 
best described by a binomial process, in which 
a success will be defined as the drawing of a 
chip with the same color as the predominant 
chips in the chosen bag, and a failure as the 
drawing of a chip of the other color. If the 
probability of a success is represented by p, 
and the probability of a failure is q-1—hp, 
then the likelihood ratio of the data is 


given by 
ET 


where s represents the number of successes 
in the sample, and f the number of failures. 
Thus, the diagnostic value of the data, as 
reflected by the size of the likelihood ratio, 
is a function of the proportion of red and 
blue chips in the bags, and of the numerical 
difference between the red and blue chips in 
the sample. (Note: Equation 4 is appro- 
priate only if the hypotheses are equidistant 
from 0.5, as is the case in this experiment.) 
Likelihood ratios inferred from Ss’ esti- 
mates were also computed by first converting 
their estimates to posterior odds and then 
dividing these posterior odds by the corre- 
sponding prior odds given at the start of the 
Sequence. Since log L varies linearly with 
5 — f, for given values of b and q, the log- 
arithms of the inferred likelihood ratios were 
examined for this linear relationship. Fur- 
ther, Ss’ performance was compared to 
Bayesian performance by computing the ratio 
of the inferred log likelihood ratio to log L. 
This ratio will be termed the "accuracy ratio” 
(after Peterson et al., 1965). An accuracy 
ratio of less than 1 indicates that an S’s revi- 
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sion of his subjective probabilities is less than 
the amount calculated from Bayes' theorem. 


Results 


Plots of inferred log likelihood’ 


ratios (averaged across Ss and se- 
quences) as a function of s—f 
showed no systematic effects from 
prior probabilities. . Accuracy ratios 
based on the mean inferred log likeli- 
hood ratios are shown in Fig. 1. 
(The accuracy ratio is not defined for 
s—f=0. Consequently, the plots 
from s —f =— 1 to s—f — 1 are 
connected with dotted lines.) For 
all values of s — f and for both bag 
compositions, accuracy ratios are less 
than 1. Further, performance is less 
Bayesian for sequences using 70-30 
bags than fór sequences using 60-40 
bags. If inferred log likelihood ratios 
varied linearly with s — f, the curves 
in Fig. 1 would be horizontal. Since 
they are not, this simple model of Ss' 
behavior, suggested by Bayes' the- 
orem, is not adequate. The positive 
slopes of the curves indicate that 
probability revisions become more 
nearly like Bayesian revisions as the 
data become more diagnostic. How- 
ever, the extent of this nonconstancy 
of the accuracy ratio is fairly small; 
a model implying that accuracy ratios 
are constant would be a tolerable 
first approximation. 


Discussion 


Despite the simplicity of this task, 5s' 
probability revisions still show consider- 
able conservatism. Apparently, the con- 
servatism effect in the Phillips et al. 
experiment is not entirely caused by the 
complexity of that task. The finding 
that conservatism is largely unaffected 
by prior probabilities would seem to con- 
flict with the results obtained by Peter- 
Son et al. However, within our experi- 
ment's restricted range of prior prob- 
abilities and Bayesian likelihood ratios, 
our results agree with theirs. 
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EXPERIMENT II 


A possible reason for conservatism 
is that Ss were not motivated ade- 
quately; there ‘was nothing to pre- 
vent. them from waiting for more 
information before making probabil- 
ity estimates more nearly like Baye- 
sian probabilities. In other words, 
Ss’ subjective probabilities may have 
been. more extreme than their esti- 
mated probabilities. Experiment II 
imposed a payoff scheme on Ss’ prob- 
ability estimates. Two of the payoff 
schemes examined have thé property 
that the subjectively expected value 
of a probability estimate is a mono- 
tonically decreasing function of the 
difference between S’s estimated 
probability and his subjective pro- 
bability. For these payoff schemes, 
S can maximize his subjectively ex- 
pected winnings by reporting his 
true subjective probability. 


Method. 


Subjects.—Forty-eight volunteers, Univer- 
sity of Michigan undergraduate and graduate 
men, served as paid Ss. 

Procedure-—The basic experimental para- 
digm was the bookbag and poker-chips situa- 
tion described in Exp. I. However, only bag 
“compositions of 70%-30% were used, all se- 
quences started with prior probabilities of 
0.5-0.5, and each S was presented with 20 
sequences. After each sequence of 20 draws, 
S was told which hypothesis was correct. He 
then spun a spinner which randomly selected 
one of his 20 pairs of estimates; the estimate 
for the correct hypothesis determined the 
amount of his payoff in points. Payoffs were 
accumulated over the 20 sequences, and the 
total points were converted linearly to money. 
The resulting bonus was added to the basic 
pay rate, $1.00 per hour. The maximum 
bonus possible was $2.00; 5s earned between 
$1.29 and $1.92 in bonuses. 

Design.—The Ss were assigned randomly 
to four groups, 12 Ss per group. The control 
group received no payoff but they were told 
which hypothesis was correct after each se- 
quence. The first experimental group re- 
ceived payoffs that were logarithmically re- 


lated to the probability estimates and a second 
` 
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group received payoffs that had a quadratic 
relationship to the estimated probabilities. 
Letting p equal the probability estimated for 
the correct hypothesis, and o(p) equal the 
payoff for the probability p, then, for the log 
payoff group, 
o(p) = 10,000 + 5,000 logio p, 
and for the quadratic payoff group, 
o(p) = 10,000 — 10,000(1 — p)?. 


(The —% payoff encountered in the log 
scheme when p = 0 was handled by telling 
Ss they would lose all their earnings in the 
experiment up to that point. As a result, 
most Ss never estimated probabilities more 
extreme than .01 and .99, and no S in the 
log payoff condition ever estimated a prob- 
ability of 0 on the correct hypothesis.) For 
these payoffs, the optimal strategy is for Ss 
to estimate their subjective probabilities 
rather than any other probabilities. This 
strategy is optimal in the sense that it maxi- 
mizes subjective expected value (SEV). Spe- 
cifically, letting y equal S's subjective prob- 
ability for the correct hypothesis, the SEV 
function, as given by 


SEV = yo(p) + (1 — (1 — b), 


has its maximum at the point where p = V. 
Further discussion of this class of payoffs can 
be found in Toda (1963), and van Naerssen 
(1962). 

A third group received linear payoffs where 
v(p) = 10,0005. For this scheme the optimal 
strategy is to estimate 1.0 for the more prob- 
able hypothesis. 

The sequences were selected prior to the 
experiment so that the same sequences could 
be presented to all Ss. Actually, the 20 se- 
quences were 10 different sequences repli- 
cated once; the order in which the sequences 
were presented in the replication was the 
same as in the original presentation. These 
10 sequences were divided into two blocks of 
5 and were presented randomly within each 
block. A different random order was shown 
to each S. The order of presenting the blocks 
was counterbalanced within each group of Ss. 

The 10 sequences were generated by a re- 
peated Bernoulli process but were con- 
strained to have the proper error character- 
istics. This means that if a perfect Bayesian 
S had to select the correct hypothesis after 
n draws, he would be wrong the expected 
number of times for each block of 5 sequences. 
This procedure prevents wildly unrepresenta- 
tive samples from occurring. 

Apparatus.—The Ss were shown a vertical 
row of 20 pushbuttons mounted on an up- 
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Fic. 2. Effects of practité in the task on 
` the mean absolute deviations of Ss’ estimates 
from Bayes’ theorem. " 


right panel. A green light bulb was located 
just to the left of each button, and a red light 
bulb was located on the right. Pressing a 
button would cause either the green or the 
red bulb to light. Thus, pressing each suc- 
cessive button revealed to Ss the result of 
each draw in the sequence. The pattern of 
red and green lights was changed for each 
sequence by changing prewired plugs inserted 
into the machine. 

Each S estimated probabilities by distri- 
buting 100 white discs, each } in. thick, in 
two vertical troughs, the height of the discs 
in each trough indicating the probabilities of 
the two hypotheses. Probabilities could be 
read directly from scales located along the 
outer edges of the troughs. The payoff 
schedule was displayed as a column of 101 
numbers in the space between the troughs. 
Thus, S could look to the outer scale to read 
the probability corresponding to the highest 
chip in the stack, and to the inner scale for 
the payoff he would receive if that hypothesis 
were correct. Bag compositions were shown 
on a separate display. 


Results 


The Ss in the experimental groups 
sometimes estimated the probability 
of the correct hypothesis to be 1.0. 
Since this estimate leads to an accu- 


racy ratio of infinity, a different de- 
pendent variable was used in this 
experiment: Absolute deviations, in 
probability, of Ss’ estimates from 
Bayes’ theorem. This variable will 
be referred to subsequently simply as 
“Ss’ deviations." The difference be- 
tween each S’s estimate and the cor- 


‘responding Bayesian probability was 


computed and, averaged across Ss 
within each group, and within each 
block of five sequences. Figure 2 
shows the’ mean absolute deviations, 
times 100,.of the control group plotted 
as a'function of the number of draws. 
Nearly.all of these deviations were 
conservative. A small amount of 
learning can be seen to have occurred 
from the first to the fourth block; Ss’ 
estimates tended to become more 
Bayesian later in the experiment. 
The percentage decrease, from Block 
1 to Block 4, of Ss’ deviations was 
computed for each group and plotted, 
as a function of the number of draws, 
in Fig. 3. After the eighth draw, all 
experimental groups showed more 
learning than the control group. 
There was no evidence that learning 
had reached an asymptote by the 
fourth block. The effects of payoffs 
on Ss’ deviations were compared for 
the three experimental groups 
by computing percent-improvement 
scores on the mean absolute devia- 
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Fic. 3. Percent decrease, from Block 1 
to Block 4, of Ss’ absolute deviations. 
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tions. Letting M. be the mean. ab- 


solute deviation of the Control group 
for a given block and for a given value 
of s — f, and M, the same quantity 
for one of the experimental groups, 
the improvement score is given by 


*s* 


Percent-improvement 
M.— Me 
x Me 


Improvement scores are shown in 
Fig. 4 for the fourth block of se- 
quences. Similar resülts were -ob- 
tained for the other three ,blócks. 
These curves show that payoffs have 
different effects, the linear payoff re- 
sulting in the most nearly Bayesian 
performance. The Ss in the quad- 
ratic group made estimates that were 
generally more conservative than 
those of the control group up to the 
thirteenth draw. Less than one- 
third of the estimates made by .Ss in 
the control, quadratic, and loga- 
rithmic groups were more extreme 
than the Bayesian probabilities, 
while over two-thirds of the estimates 
made by the linear group were more 
extreme. 

The between-S variance of Ss’ 
deviations was greater for the control 
group than for any of the experi- 
mental groups. The between-S vari- 
ance averaged over the 20 draws was 
0122 for the control group, -0071 
for the quadratic group, .0046 for the 
linear group, and .0039 for the log 
group. The greatest differences be- 
tween the control group and experi- 
mental group variances occurred 
after the sixth draw. 


Discussion 


Payoffs help to decrease the amount 
of conservatism shown in Ss' probability 
estimates, but they still do not eliminate 
conservatism. The extreme estimates 
frequently made by Ss in the linear 
payoff group reflect the extent to which 
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“uto the control group. 


Ss are influenced in the direction of th 
optimal strategy,of estimating 1.0 for 
the more probable, Hypothesis. 

Although the optimal strategy for 
both quadratic and logarithmic payoff 
groups was to estimate true subjective 
probabilities, Ss’ estimates were differ- 
ent, those of the quadratic group differ- 
ing little from those of the control group, 
while the logarithmic group showed less 
conservatism than the control group. 
One possible explanation for this theo- 
retically unexpected discrepancy is to be 
found in the slope of the SEV functions. 
The SEV function for the quadratic 
payoff is more peaked at the value of V 
than is the SEV function for the log- 
arithmic payoff. Thus, in the logarith- 
mic case Ss can report a probability 
value slightly higher than their subjec- 
tive probability at only a slight cost in 
SEV. This strategy applied in the case 
of quadratic payoffs would result in a 
greater loss in SEV than in the log- 
arithmic case, In other words, it ap- 
pears that Ss are not maximizing SEV, 
but are trying to effect some reasonable 
trade-off between the amount they would 
win if Bag G were correct and the 
amount they would win if Bag R were 
correct. It is not reasonable to attribute 
the difference between these two groups 
to nonlinear utility functions, for most 
Ss in the logarithmic group won only a 
few pennies more, on the average, than 
Ss in the quadratic group. 
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The smaller between-S variances of 
the experimental groups were probably 
due not only to the motivating property 
of the payoff schemes, but also to their 
instructional value (Edwards, 1961). If 
Ss are initially vague about the value of 
their subjective probability at a given 
draw, they are aided in discovering these 
values by observing the payoffs they ac- 
tually were given as compared to those 
they could have obtained had they made 
a different estimate. Instructional value 
of payoffs is supported by the greater 
amount of learning shown by the payoff 
groups than by the control group. 

These results suggest that payoffs 
should be used in probability estimation 
studies when small between-S variances 
are desirable. Further, logarithmic pay- 
offs are more effective than no pay- 
offs, quadratic payoffs, or linear payoffs 
in producing estimates more nearly 
like probabilities calculated from Bayes’ 
theorem, without incurring a consider- 
able degree of overestimation. 


EXPERIMENT III 


All Ss in the previous experiments 
responded by distributing 100 washers 
or chips between the two hypoth- 
eses. If this mode of response was 
incompatible with Ss’ population 
stereotypes, the difficulty in making 
a response may have caused some 
amount of conservatism. The pur- 
pose of this experiment was to ex- 
amine the effects of other response 
modes on conservatism. 


Method 


Subjects —Forty-eight volunteers, Univer- 
sity of Michigan undergraduate men, served 
as Ss. They were paid $1.25 per hour. 

Procedure.—The experiment followed the 
bookbag and poker-chip paradigm. All se- 
quences started with equal prior probabilities, 
and 15 sequences of 20 draws each were 
shown. To minimize learning effects, S was 
not told which hypothesis was correct at any 
time. No payoffs were used. 

Apparatus.—The same data-display device 
described in the previous experiment was 
used. Three response devices, one discrete 
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and two continuous, were used. One group 
of Ss estimated probabilities on the two- 
trough device described in Exp. II. Another 
group of Ss estimated odds by setting a slid- 
ing pointer on a scale of odds spaced log- 
arithmically. Probabilities were estimated 
by another group of Ss by setting a sliding 
pointer on a scale of probabilities, where the 
spacing of the probabilities was determined 
by converting the probabilities to odds and 
scaling the odds logarithmically. The con- 
tinuous probability scale extended from .50 
to .9999, while the odds scale ran from 1:1 
to 10,000:1. Both scales were 271 in. long, 
and the pointer moved horizontally. Bag 
compositions were shown on separate displays. 

Design.—The “Ss were assigned randomly 
to four groups, 12 Ss per group. In the first 
group Ss estimated probabilities on the dis- 
crete device. In the second group, Ss gave 
verbal statements of the posterior odds after 
each draw. The Ss in the third group esti- 
mated posterior odds on the continuous de- 
vice, while Ss in the fourth group estimated 
probabilities on the continuous scale of prob- 
abilities. These four groups will be referred 
to as the probability, verbal odds, log odds, 
and log probability groups. 

Odds were always in the direction of x:1, 
so Ss in the third group had to state which 
bag they thought was the chosen bag, before 
they estimated how much it was more likely 
to be that bag than the other. The Ss using 
the log probability scale had to state which 
bag was most probable before estimating how 
probable it was. The Ss were told that the 
length of the scale was arbitrary; they were 
free to use only a small portion of it, or they 
could, if they wanted to, make verbal esti- 
mates to extend the scale. Most of the re- 
sponses of Ss fell within the limits of the 
scale. 

Five of the 15 sequences were generated 
from 85%-15% bags, five from 70%-30% 
bags, and five from 5574507 bags; thus, the 
composition of each bag identifies a block of 
Sequences. These blocks were shown to 5S 
in completely counterbalanced order, and se- 
quences within a block were shown in @ 
different random order for each S. 


Results 


Estimates made by Ss in the two 
odds groups were converted to prob- 
abilities and the mean absolute devia- 
tions of all Ss’ estimates from Bayes’ 
theorem were computed within each 
group and block. Figure 5 shows 
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Fic. 5. Effect of bag composition on the 
mean absolute deviations of Ss’ estimates 
from Bayes’ theorem. 


these deviations as a function of 
s—f. Conservatism is again evi- 
dent, but it is least for the verbal 
odds and log odds groups. The 
method of responding makes very 
little difference when a single chip 
has little diagnostic value, i.e., when 
the bag composition is close to 
50% 50%. 

The decrease of Ss’ deviations as 
s—f increases suggests that Ss’ 
estimates are becoming more Baye- 
sian as evidence piles up. This, 
however, is an artifact caused by the 
boundedness of probabilities. Odds 
data do not have this property, so 
accuracy ratios better reflect the de- 
gree to which Ss are Bayesian. Ac- 
curacy ratios are shown in Fig. 6 for 


the log odds group. For the more 
diagnostic data, the degree of Baye- 
sianness shown by Ss is a constant. 
The same results were obtained from 
the verbal odds groups, but for the 
probability groups the accuracy ratios 
increase as a function of s —f. 
However, accuracy ratios computed 
from median inferred log likelihood 
ratios are considerably less affected 
by s — f for all groups than are accu- 
racy ratios based on mean data. Ac- 
curacy ratios based on median data 
are approximately constant even for 
data of low diagnostic value. 

The method of responding had no 
affect on between-S variances within 
each group; these variances were 
nearly identical from one group to the 
next. i 


Discussion 


Conservatism in the processing of 
probabilistic information appears to be 
a pervasive effect. Although the effect 
is nonexistent when the diagnostic value 
of a single chip is small, the amount 
of conservatism increases as the diag- 
nostic value of a single chip increases. 
When the diagnostic value of a single 
chip is not too low, conservatism re- 
mains approximately constant as the 
diagnostic value of the sample increases. 

Much of the data reported in these 
experiments could be described with a 
very simple model requiring only one 
fitted parameter. The horizontal plots 
in Fig. 6 suggest the following model : 
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Fic. 6. Effect of bag composition on 
accuracy ratio. 
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where the posterior odds estimated by 
S is given by €, the likelihood ratio is 
defined in Equation 4, and Qo represents 
the prior odds stated at the beginning 
of the experiment. The fitted param- 
eter c is a function of p, as can be seen 
in Fig. 6. This model holds only for 
values of p that are greater than about 
0.6 (see Fig. 1 and 6); when p is less 
than 0.6, c becomes a function of both 
bands —f. However, this model with 
c a function of p only describes median 
performance for each group very well. 

These experiments show that payoffs 
and response modes affect the amount 
of conservatism found to some extent, 
But the amount of conservatism found 
is large relative to the effects of these 
procedural variables. The failure of Ss 
to extract from the data all the certainty 
that is theoretically available is con- 
sistent and orderly, and may reflect a 
general limitation on human ability to 
process information. 
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RESPONSE UNITS IN THE PREDICTION OF 
SIMPLE EVENT PATTERNS * 


H. WAYNE LUDVIGSON ? 
University of Texas Southwestern Medical School 


2 response unit hypotheses were tested in a task similar to probability 
learning except that the stimulus events occurred in a simple recurring 
pattern prior to an extinction series of all one event. Ss were 144 


male and female college students. 


The hypothesis, that persistence 


of erroneous responding in extinction is constant when measured 
in units of the length of the acquisition pattern of events, received 


support. 


However, the hypothesis more closely predicted the data of 


Ss who learned and could describe the pattern than the data of other 
Ss. A 2nd hypothesis, that equally resistant units of specific elemental 


responses are acquired and extinguished, was not supported. 
assumption of a sequence of mediating 


necessity for an additional 


The 


responses representing the same-different relations among successive 


events is discussed. 


Attempting to explain the greater 
resistance to extinction resulting from 
partial reinforcement as compared with 
continuous reinforcement, Mowrer and 
Jones (1945) proposed a “response 
unit hypothesis” which defines as the 
unit of response the set of elemental 
reactions comprising a behavioral se- 
quence eventuating in a reinforcement. 
This hypothesis predicts, ceteris pari- 
bus, a single extinction function for 
such response units regardless of the 
number of elemental responses in- 
volved, 

Perhaps justifiably, little direct at- 
tention has been given the hypothesis 
in recent years since it does not easily 
fit certain data from animal studies 
(Kimble, 1961). However, since the 
hypothesis can be conceived to stem 
from the search for parsimonious ex- 
planation, some form of it may warrant 
reexamination, at least with regard to 
certain behavioral situations. 

The present study undertook such a 
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reexamination for a type of elementary 
decision making in which human Ss 
predict which of two events will occur 
on a series of occasions. The problem 
investigated was the persistence of er- 
roneous prediction during an "extinc- 
tion" series of recurrences of only one 
event, as a function of the length of the 
cycle of a simple repeated pattern of 
events in acquisition. Two "response- 
unit" hypotheses, generating different 
predictions, were tested. The first 
(Hypothesis 1), a purely empirical ex- 
tension of the Mowrer and Jones no- 
tion to the present situation, was that 
the number of trials to reach extinction 
is constant when the dependent vari- 
able is defined in units of the acquisi- 
tion cycle of events. The second 
(Hypothesis 2) assumed that once S 
has acquired a sequence of responses, 
corresponding to and resulting from a 
cyclic pattern of acquisition events, 
the sequence tends to function as a 
larger response unit, and that during 
extinction, it is the tendency to react 
with the whole unit of responses that 
is replaced by a tendency to predict 
only the event occurring in extinction. 
"The core of the hypothesis is that such 
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response units are equally resistant to 
extinction regardless of the number of 
responses comprising the unit. Unlike 
the first hypothesis, the second takes 
into consideration the relationship be- 
tween the extinction events and the 
specific responses occurring in the ac- 
quisition pattern. In addition the rela- 
tionship of S's extinction behavior to 
his verbalized awareness of the pattern- 
ing in the acquisition series as well as 
his learning of the pattern was inves- 
tigated. 


METHOD 


Apparatus and task.—The S's task was to 
predict, following a warning buzzer, which 
of two events (A or B) would occur. The 
events were 1-in. diameter spots of red light 
appearing in viewing windows in front of S 
slightly to his right and left. In front of 
each viewing window was a switch with 
which S indicated his prediction of the 
location of the next spot of light. Presenta- 
tion of the buzzer and lights was precisely 
controlled such that the following sequence 
of events, each 2 sec. in duration, recurred: 
(a) warning buzzer, (b) response interval, 
(c) an event light, (d) intertrial interval. 
The rhythmicality of the sequence was ac- 
centuated by clearly audible clicks from the 
control equipment occurring at the beginning 
of and 0.5 sec. before each 2-sec. interval. 

Design and subjects—The independent 
variables of primary interest were “length 
of pattern” and “type of shift.” Length of 
pattern refers to the number of elemental 
events in one cycle of the recurring pattern 
of events in acquisition. The patterns em- 
ployed were one, three, and seven events of 
Class A followed by a single B event yielding 
lengths of two, four, and eight. During 
extinction, when the events were of the A 
class, S was said to have undergone a “major 
shift” from acquisition to extinction, because 
the extinction events were all of the class 
from which occurred the majority of events 
within an acquisition cycle, Similarly when 
the extinction events were of the B class the 
phrase “minor shift" was applied. For a 
pattern of Length 2 the terms simply dis- 
tinguished a shift to A from a shift to B 
since there was no “major event.” 

A third variable was the number of times 
a pattern was repeated prior to extinction. 
The data herein reported came from two ex- 
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periments, identical except for this variable. 
In the initial study three and seven repeti- 
tions were used, yielding, for the extinction 
data, a 3X 2X2 design. Four male and 
four female volunteers from introductory 
psychology classes were randomly assigned 
to each treatment combination. For two Ss 
of each sex within each treatment combina- 
tion, the A events were lights in the window 
to S’s left, while for the other two Ss they 
were to their right. Following preliminary 
analysis of the initial study, it was decided to 
replicate the work but use an intermediate 
number of repetitions, viz., five, since repe- 
titions had no detectable influence on the 
extinction data of the first study. There- 
fore, four males and four females were 
again assigned to each treatment combina- 
tion making a total of 48 Ss in addition to 
the 96 of the first study. The data from 
the two experiments are treated as a single 
study. 

Procedure.—T wo practice trials were given 
on which both lights appeared simultane- 
ously, with S forewarned of this irregularity. 
Acquisition was followed, without break, by 
extinction which continued for a number of 
trials equal to seven repetitions of the acqui- 
sition pattern, ie, 14, 28, and 56 trials for 
Lengths 2, 4, and 8, respectively. Number 
of pattern cycles instead of number of trials 
was equated in extinction in order to provide 
a substantial but equal number of opportuni- 
ties for response patterns of different lengths 
to appear, and to prevent a spurious inflation 
of the error data among Ss with short ac- 
quisition patterns but very long extinction 
Series. In fact, the data strongly indicate 
that the results did not depend on this extinc- 
tion procedure. The number of errors in 
the first 14 extinction trials yields a picture 
almost identical to the error data presented 
below. As would be expected the points for 
Lengths 4 and 8 of the latter data are 
slightly higher, but only 0.58 error on the 
average. 

Since all patterns began with an A event 
and ended with a B event the transition to 
extinction was necessarily different for major 
and minor shifts. For a minor shift the first 
trial following the final repetition of the 
acquisition pattern was a B and thus the 
first disconfirmation or extinguishing event. 
For a major shift the first disconfirmation, 
an A event, could not occur until the end 
of what would have been another repetition 
of the acquisition pattern. The extinction 
series was defined as beginning on the first 
trial after the first disconfirmation. The 
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response measures taken from this series 
were total number of errors and number of 
trials, excluding criterial trials, to reach a 
criterion of zero errors in two successive pat- 
tern lengths of trials, i.e., 4, 8, and 16 trials 
for Lengths 2, 4, and 8, respectively. Only 
eight Ss made any errors after achieving 
this criterion. Nine Ss, distributed among 
the treatments, failed to reach the criterion 
and were assigned, as their trials score, the 
number of trials in their extinction series. 

Following the experiment, E questioned S 
regarding the patterning in the event se- 
quence. The questions were, in order of 
use, (a) "Can you tell me how the lights 
flashed?" (b) "Did the lights flash in any 
regular pattern or sequence over and over?" 
(c) *Suppose I told you there was a regular 
pattern or sequence. What would you say 
that it was? Make a guess.” Questioning 
ceased with the last of these, or as soon as 
S clearly verbalized a specific pattern of 
events. 


RESULTS 


Extinction.—For both statistical and 
theoretical purposes number of trials 
to extinction was divided by the acqui- 
sition pattern length. This measure, 
called cycles to extinction, expressed 
number of trials in a unit correspond- 
ing to the length of the acquisition pat- 
tern. The statistical advantage was a 
marked reduction in the heterogeneity 
of error variance. 

Preliminary analyses of variance of 
both cycles and errors were made, with 
Pattern Length (2, 4, and 8), Type of 
Shift (major and minor), Sex of S, 
and Number of Acquisition Repeti- 
tions of the Pattern (3; 5, and 7) as 
the factors. Consonant with Hypoth- 
esis 1 Pattern Length produced marked 
differences in resistance to extinction 
when expressed in number of trials, 
but only negligible differences when the 
unit was number of cycles, F (2, 108) 
<1. Number of errors was affected, 
however, F (2, 108) — 4.25, p < .025, 
with means of 2.8, 44, and 3.0 for 
Lengths 2, 4, and 8, respectively. Type 
of shift was a significant factor in both 
analyses, A minor shift took more 
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cycles to extinguish, F (1, 108) 
= 10.79, p < .005, and resulted in 
more errors, F (1, 108) =823, f 
< Ol, than a major shift. 

The extinction data were related to 
two other factors. The Ss who cor- 
rectly described their acquisition pat- 
tern were called "aware," while the 
designation "learners" was applied to 
Ss who displayed no errors in the cycle 
immediately preceding the cycle in 
which the first disconfirmation oc- 
curred. Employing a stringent cri- 
terion of verbalization, agreement be- 
tween two judges on the awareness 
classification was perfect. Eighty-four 
Ss were categorized as aware, 85 as 
learners, and 75 Ss, "aware learners” 
(ALs), met both criteria. The ALs 
were selected to test the view that the 
response-unit hypotheses would most 
closely predict the extinction behavior 
of Ss who had learned the acquisition 


NALs 


TRIALS 


CYCLES 


ERRORS 


4 
PATTERN LENGTH 


Fic. 1. Mean trials and cycles to criterion, 
and errors in extinction for major and minor 
Ss in both ALand NAL categories. (Heavy 
lines are least-squares best-fitting lines. 
Light lines in the AL trials plot are best- 
fitting lines through the origin.) 
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F:6.2. Percentage of Ss responding erroneously on the 20 extinction trials 
immediately following the first disconfirmation (D). 


pattern. Allother Ss were grouped to- 
gether into the category nonaware 
and/or nonlearners (NALs). 

The frequency with which Ss became 
ALs decreased with pattern length 
and increased with repetitions. Fig- 
ure 1 presents the mean number of 
trials, cycles, and errors for the two 
categories of Ss pooled across repeti- 
tions. Of the 24 Ss in a length-shift 
combination the number of ALs was 
18 for major and 17 for minor shifts 
of Length 2; 14 and Jis respectively, 
for Length 4; and 9 and 6, respectively, 
for Length 8, Least-squares best-fit- 
ting lines for the several conditions are 
shown in Fig. 1, along with lines of 
best fit made to pass through the origin 
for the ALs' trial data. Special inter- 


? Details may be obtained by writing the 
author. 


est resided with these latter lines be- 
cause they should describe the data if 
trials to extinction is simply the result 
Of some constant number of nonre- 
inforced repetitions of the patterns 
(Hypothesis 1). In terms of cycles 
this relationship should be displayed 
in zero-slope trends with the mean 
lines approximating the slope constants 
for trials. 

Tests of these relationships were 
conducted on the regression coefficients 
of the lines for cycles and errors. 
Neither of the slopes for the ALs 
cycle data was significantly different 
from zero, #(39) = 0.498, and t(32) 
= 0.979 for major and minor Ss, Te- 
spectively. For the errors measure the 
major ALs’ regression coefficient was 
again not significant, 1(39) = 1.180, 
although the coefficient for the minor 
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Ss was significantly greater than zero, 
t(32) = 2.814, p « 01. Comparing 
the coefficients of major and minor 
ALs, analyses indicated that they did 
not significantly differ from each other 
in either the cycles data, t(71) = 0.798, 
or the error data, #(71) = 1293. 
Neither did the major and minor 
NALs differ significantly in their re- 
gression coefficients. Because of lesser 
interest in the NALs' slopes, only the 
mean of the major and minor coeffi- 
cients was tested for deviation from 
zero. There was a significant negative 
slope to the cycles coefficient, £(65) 


= 2.013, p< 05. The coefficient for 
errors was not significant, +(65) 
= 1.822, 


To more directly assess the reliabil- 
ity of the gross differences in trend be- 
tween ALs and NALs, major and 
minor coefficients were averaged within 
each category. These means were 
significantly different for both cycles 
and errors, #(136) = 3.032, p < .01, 
and 1(136) = 3.476, p < 001. 

Evidence regarding the presence and 
nature of response patterning may be 
obtained by examining the percentage 
of errors on each of the extinction 
trials immediately following the first 
disconfirmation (D) as shown in Fig. 
2. Clear response patterning may be 
observed in Lengths 4 and 8 in the 
increased percentage of erroneous Te- 
sponses just anticipatory to the trial 
on which a B event would have oc- 
curred had acquisition continued. This 
“negative recency” effect (Jarvik, 
1951) is sharper and more pronounced 
among the ALs than the NALs. 

Acquisition—The NALs, though 
failing to meet the pattern learning and 
awareness criteria, displayed some 
probability learning. In the later 
cycles Ss having the longer patterns 
made a lower percentage of errors by 
learning to predict the major event 
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more frequently. Similarly the ALs 


displayed probability learning prior to 
their complete elimination of errors.? 


Discussion 


As an approximation Hypothesis 1 re- 
ceived some support from the nonsig- 
nificant effect of pattern length on the 
cycles data of all Ss. Dividing the Ss 
into ALs and NALs, however, suggests 
that the hypothesis may be tentatively 
accepted for ALs but not for NALs, 
The untenability of Hypothesis 2 may 
be determined from the pattern of errors 
in extinction (Fig. 2). Both major and 
minor ALs displayed highly similar func- 
tions despite the fact that for minor Ss 
the pattern of specific responses was quite 
opposite to that executed in acquisition. 

Attempting to account for these results 
within an S-R analysis raises problems 
regarding the functional roles of the se- 
quence of predicted events and S’s reac- 
tions, A typical approach in the irregu- 
lar-event probability-learning task has 
been to consider the events solely as 
reinforcers, with neither the events nor 
the responses providing stimulus traces 
on subsequent trials. Such definitions 
cannot, of course, account for certain 
sequential dependences (Anderson, 1964; 
Jarvik, 1951) not to mention the learn- 
ing of regular event patterns. Faced with 
such data one approach (eg. Restle, 
1961) has been to assume that traces 
from the series of events can singly or in 
combination provide differential cues for 
patterned responding. As an alternative 
to this "stimulus unit" view, it could be 
assumed that responses on successive 
trials become associated through feedback 
traces which provide the differential cues 
for patterned responding, with the events 
functioning primarily as reinforcers. Re- 
curring sequences of such feedback- . 
chained responses might, then, be con- 
ceived to underlie the response units of 
Hypothesis 2. 

While this latter, response-unit concep- 
tion is not supported by the present data, 
neither is the stimulus-unit view. Again 
the difficulty stems from the rapid trans- 
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fer of the acquisition pattern of re- 
sponses to the exact inverse pattern in 
extinction by minor shift ALs. To ac- 
count for this inverted pattern appeal 
presumably would have to be made both 
to stimulus and response generalization, 
since a “new” pattern of stimulus traces 
from a series of B events in extinction 
elicits not the response associated with 
the “similar” acquisition pattern, but the 
alternative one. Unfortunately the con- 
ditions under which such generalization 
is to be invoked are not clear. Moreover 
such generalized responding might be ex- 
pected to be weaker than the original 
mode of response, but Fig. 2 indicates 
that, if anything, it was stronger. 
Similar difficulties would seem inherent 
in any stimulus sampling theory or other 
“single-stage S-R” model (Kendler & 
Kendler, 1962) in which overt responses 
become associated directly with some sort 
of stimulus trace. The rapid transfer of 
the minor ALs suggests that not only is 
an appropriate sequence of overt re- 
sponses acquired but S also learns cov- 
ertly the sequence of same-different rela- 
tions among successive events. It is this 
latter sequence which, similar to media- 
tional activity in concept identification, 
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appears to persist following the shift and 
to guide the negative recency behavior. 
Moreover, the fact that the evidence for 
relational responding is stronger among 
the ALs appears consonant with the ap- 
parently close relationship between me- 
diation and verbalization (Kendler & 
Kendler, 1962). 
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NATION AND RANDOM PARTIAL-REINFORCEMENT CON- 
DITIONS WITH A 24-HOUR INTERTRIAL INTERVAL* 
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Rats were run 1 trial a day for 192 days on single alternations of 
reward and nonreward and on a random 50% pattern of reward and 
nonreward. No evidence for patterned running was found in the single 
alternation group or for the aftereffects of reward and nonreward in 
the random group. Large differences were found in extinction 
between these groups and a continuous control, both partial groups 
showing almost no extinction over 44 days relative to the continuous 
control group. The slight extinction differences between the single 
alternation and random groups were attributable to asymptotic acquisi- 
tion performance differences. The results were discussed in relation 
to hypotheses that appeal to carried-over traces of the aftereffects of 


reward and nonreward from one trial to the next. 


Lawrence (1958) pointed out that 
explanations of the partial-reinforce- 
ment extinction effect (PRE) were of 
two kinds: intertrial and intratrial. 
Intratrial explanations ascribe the 
PRE to learned mechanisms that de- 
velop on each trial in acquisition and 
have their effects on later acquisition 
trials and in extinction. For example, 
Amsel's (1962) explanation of the 
PRE "attributes the PRE to the evo- 
cation of frustration by nonreward in 
partial reinforcement, to the condition- 
ing of anticipatory frustration (rr-sr) 
to cues in the instrumental sequence, 
and to the association of sp with ap- 
proach responses in acquisition [p- 
311]."  Intertrial approaches, on the 
other hand, attribute the PRE in one 
way or another to immediately carried- 
over traces of stimulation (stimulus 
aftereffects of reinforcement) from one 
trial to the next in partial-reinforce- 
ment (PRF) acquisition. Such inter- 
pretations assume that these traces of 
reward and nonreward make acquisi- 
tion more clearly discriminable from 
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extinction in continuously than in par- 
tially reinforced Ss. 

The Hull-Sheffield hypothesis (Shef- 
field, 1949), an S-R intertrial position 
which appealed to stimulus aftereffects, 
has stimulated a considerable amount 
of experimental research (e.g., Jenkins, 
1962; Theios, 1962; Weinstock, 1954; 
Wilson, Weiss, & Amsel, 1955). 
These experiments have shown that 
the intertrial mechanisms, no matter 
how plausible, cannot by themselves 
account for some of the partial-rein- 
forcement phenomena. 

Some recent experiments by Capaldi 
and others (eg. Capaldi & Senko, 
1962; Capaldi & Spivey, 1964b; Ca- 
paldi & Stanley, 1963 ; Capaldi, Turner, 
& Wynn, 1962) based on a modifica- 
tion of the Hull-Sheffield position, have 
revived interest in intertrial positions 
by claiming to demonstrate carried- 
over aftereffects or "memory traces" of 
reward and nonreward from one trial 
to the next, even at 24-hr. intertrial 
intervals (ITIs). According to the 
intertrial explanation of the PRE pro- 
posed by Capaldi (1962), the greater 
resistance to extinction exhibited under 
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PRF as compared with continuous re- 
inforcement (CRF) is due to the con- 
ditioning of the stimulus aftereffects of 
nonreward (NR) to locomotion on 
succeeding reward (R) trials. That 
is, a reinforcement serves to condition 
the aftereffects of NR from the preced- 
ing trial (or trials) to the approach 
responses, and thus in extinction the 
PRF Ss will persist in running longer 
than CRF Ss which have not had the 
opportunity for conditioning of the 
aftereffects of NR to R trials, 

It seems quite reasonable to enter- 
tain the possibility of stimulus after- 
effects of reward and nonreward in rats 
under conditions of highly massed 
trials. However, the appeal to such 
traces in the explanation of partial- 
reinforcement phenomena, when very 
long ITIs are employed (e.g., Wagner, 
1961; Weinstock, 1954), requires an 
unequivocal demonstration that rats 
can use the cue provided by reward or 
nonreward on Trial n as a basis for 
performance on Trial n + 1 when these 
trials are separated by, say, 24 hr. 

The experiment we report involves 
three reinforcement conditions run at 
a 24-hr. ITI: single alternation of R 
and NR trials, random presentation of 
R and NR trials on a 50% schedule, 
and continuous reinforcement. The 
experiment is designed to investigate 
the stimulus aftereffects hypothesis by 
running these schedules under con- 
ditions of rigorous control of extrane- 
ous sources of stimulation, so that any 
effects on Trial n + 1 must be the re- 
sult of the Trial n reward or nonreward 
rather than some source of extraneous 
stimulation on Trial n + 1, 


METHOD 
Subjects 


The Ss were 33 male Wistar albino rats, 
approximately 90 days old, supplied by 
Woodlyn Farms, Guelph, Ontario. They 
were housed in individual cages and main- 
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tained on ad libitum water. The daily ra- 
tion of food was 10 gm. of Purina lab chow 
checkers, which included the total weight of 
food pellets given as reward in the experi- 
ment. 


Apparatus 


A straight wooden alley contained, in 
series, an 11 X 3 in. entry box, a 10} X 25 in. 
start box, a 27 X 2% in. runway, and a 12 
X 2% in. goal box. Height was 3 in. through- 
out. The entry box and start box were 
painted flat gray, while the runway and 
goal box were painted flat black. All sec- 
tions were covered with hinged clear Plexi- 
glas lids. Guillotine doors separated the 
adjacent sections: the entry-box to start-box 
door was of sheet metal, painted flat gray; 
the start-box to alley door of clear Plexiglas; 
and the alley to goal-box door of sheet metal, 
painted flat black. A black, 2%-in. wide food 
cup was mounted approximately 2 in. from 
the floor on the end wall of the goal box. 

Three time measures of locomotor per- 
formance were provided. Starting time was 
measured from the opening of the start-box 
door to the interruption of a photobeam 1o- 
cated 12 in. into the alley. Running time 
was measured from the interruption of the 
first beam to the interruption of a second 
beam located 12 in, beyond the first. Goal 
time was measured from the interruption of 
the second beam to the interruption of a 
third beam located 12 in. from the second and 
9 in. into the goal box, ie. 3 in. before the 
end wall of the goal box. A microswitch 
on the start door and the three photobeams 
controlled three timers which recorded to 
-01 sec. 


Procedure 


Experimental design—The Ss were ran- 
domly assigned to one of three groups, a 
continuously rewarded group (Group C), a 
random (50%) PRF group (Group R), and 
a group given a single alternating pattern of 
partial „reinforcement ( Group SA). 

Habituation—The Ss were placed on a 
10-gm. Per day deprivation schedule 43 days 
prior to the start of the experiment, during 
which time they were handled and tamed. 
There was no pretraining or preliminary 
exploration of the alley. 

Experimental training—A training trial 
Was initiated with the introduction of S into 
the entry box. The entry-box door was then 
raised, and lowered as soon as S entered the 
Start box. The start-box door was raised 
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approximately 2 sec. later. When S had 
entered the goal box the retrace door was 
lowered, and S was removed 30-35 sec. later 
on NR trials and immediately after eating 
on R trials, the two intervals being approxi- 
mately the same. 

The Ss were tested in three squads of 11 
and the three experimental groups were rep- 
resented within each squad as equally as 
possible. The position of each S within a 
squad (ie. in the carrying cage) was ran- 
domly assigned at the beginning of the 
experiment and remained the same through- 
out, but the order of running Ss from day 
to day was randomized. Moreover, every 
attempt was made not to provide S with 
systematic cues for R and NR trials. For 
example, E “false-baited” the food cup 
prior to NR trials and brushed out the goal 
box after every trial, rewarded or not. The 
possibility of olfactory discrimination was 
reduced by having the floor under the goal 
box (not visible to S) covered with food 
dust. 

Following the testing of all 11 Ss in a 
squad, they were returned to their home 
cages, and were given their daily feeding 30— 
40 min. later. 

Acquisition—The Ss received a total of 
192 trials, 1 trial per day. Reward was one 
500-mg. Noyes pellet. The sequence of 
reward and nonreward for Group R was ar- 
ranged as follows: R NR R NR NR R NR 
RNRRRNRRNRNRR. The se- 
quence was repeated every 16 days, there 
being 12 such sequences. Group SA was 
given a single alternating pattern of PRF: 
Trial 1 was rewarded for all Ss, Trial 2 
nonrewarded. 

Extinction.—The Ss were run 44 extinc- 
tion trials, 1 trial per day. Extinction Trial 
1 was administered to each group on Day 
193, and the other extinction trials were run 
on consecutive days. The procedure was 
essentially the same as that for acquisition 
with the exception that there was no food in 
the goal box at any time. 


RESULTS 


Two Ss died during the course of 
the experiment, one from Group C dur- 
ing acquisition and one from Group R 
during extinction. The number of Ss 
per group was therefore reduced to 10 
by randomly removing the data from 
one S from Group SA, 


Acquisition 

Figure 1 presents for Group SA 
mean median times for R and NR 
trials, and also for trials following re- 
ward (TFR) and trials following non- 
reward (TFNR) in this case, for 
starting, running, and goal measures. 
Figure 2 presents the data for Group 
R showing, in the right-hand panels, 
mean median times for the TFR vs. 
TFNR comparison and, in the left- 
hand panels, the R vs. NR comparison 
for the same data. The results for 
Group C are included in each panel, 
for comparison purposes,” 

There is no evidence of differential 
performance on R and NR trials or 
on TFR and TFNR trials for either 
Group SA or Group R over the 192- 
day period. The starting and running 
curves in each case are virtually in- 
separable, as are the goal curves for 
Group R. The goal-time curves for 
Group SA show somewhat greater 
variability from one block to the next 
but are completely interlaced. There 
is some difference between these (par- 
tial) goal-time curves and the perform- 
ance level of the continuous group, a 
difference often found in the goal 
measure of partial-continuous com- 
parisons. 

Goal times for the last 44 acquisition 
trials of the SA group were con- 
verted to reciprocals and an analysis 
of variance was performed, Perform- 
ance was not significantly different for 
type of trial (R and NR), F (1, 210) 
= 2.83, but the Day x Type interaction 
attained significance, F (21, 210) 
= 2.01, p<.05. This latter finding 
would seem to indicate that perform- 


2 The data of Fig. 2 were replotted in the 
more usual fashion comparing CRF with 
PRF acquisition. The typical PRF acquisi- 
tion effect (Wagner, 1961) was found, ie., 
PRF Ss were faster than CRF at asymptote 
in the start and run measures, but ran more 
slowly in the goal measure. 
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Fic. 1. Mean median times for starting, 
rewarded and four nonrewarded trials in each 
Group C are plotted in eight-trial blocks.) 


ance on R and NR trials was changing 
differentially over days. However, a 
t test on mean goal speeds for R vs. 
NR trials of the SA condition did not 
show statistical significance, ¢ (21) 
= 267, p > .05. The .05 level of sig- 
nificance of the Day x Type interac- 
tion is probably attributable to the 
large number of degrees of freedom 


running, and goal measures for the four 
eight-trial block of Group SA. (Data for 


involved and the day-to-day fluctua- 
tions of the data. 


Extinction 


Figure 3 presents, for all three 
groups, mean median times in blocks 
of four trials for the starting, running, 
and goal measures during the 44 ex- 
tinction days. It may be observed 
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that CRF led to extremely rapid ex- 
tinction relative to PRF. This was to 
be expected. There appear, however, 
to be extinction differences between 
Groups R and SA, extremely small 
relative to the differences between the 
CRF and PRF extinction curves. 
Analyses of variance between Groups 
R and SA performed on reciprocals of 
the raw scores showed (a) that the 
group difference was significant in all 
three performance measures, F (1, 18) 
= 5.82, p « .05; F (1, 18) = 8.65, p 
< .01; and F (1, 18) = 10.91, p < .01 
for starting, running, and goal meas- 
ures, respectively; (b) that the effect 
over days was significant, F (43, 774) 
= 5.36, 7.42, and 7.88, respectively, 
for starting, running and goal meas- 
ures, p < .01 in all cases; and (c) that 
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the Group X Day interaction did not 
attain significance in any measure. 
The fact that the Group main effect 
was significant and the lack of a reliable 
Group X Day interaction indicated 
that, although the group levels were 
different, the two groups were not ex- 
tinguishing at different rates, and that 
the difference might have been due to 
different acquisition asymptotes. Ac- 
cordingly, analyses of variance were 
performed on reciprocals of raw scores 
for the last 44 days of acquisition. The 
analyses revealed (a) that performance 
of the two groups in the starting and 
running measures was not significantly 
different, F (1, 18) — 2.08 and 2.09, 
respectively, p > .05 in both cases, but 
that the two groups differed reliably in 
the goal measure, F (1, 18) — 5.30, 
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p «.05; and (b) that the Day effect 
was significant for the starting, run- 
ning, and goal measures, F (43, 774) 
— 940, 4.86, and 4.00, respectively, 
p < 01 in all cases; and (c) that the 
Group X Day interaction did not attain 
significance in any measure. 

The group differences at the end of 
acquisition account for the early extinc- 
tion differences. This, coupled with 
the lack of a significant Group x Day 
interaction over all of extinction, makes 
it hazardous to take the differences in 
level of the SA and R groups in ex- 
tinction as an indication that resistance 
to extinction was greater for Group R 
than for Group SA. 


Discussion 


We ran rats one trial a day for 192 
days on single alternations of reward and 
nonreward and on a random 50% pattern 
of rewards and nonrewards and found no 


evidence of patterned performance in the 
SA group or of TFR and TFNR effects 
in the random group. There were enor- 
mous differences in extinction between 
these groups and a continuous control, 
both partial groups showing almost no 
extinction over 44 days relative to the 
continuous reward group. There was 
some slight suggestion of a difference 
in extinction between the SA and random 
groups, but we could have little confi- 
dence that this difference could not be 
accounted for by a corresponding differ- 
ence in acquisition performance level be- 
tween these groups. We are, therefore, 
forced to conclude that carried-over 
traces or memories of goal events on the 
previous day is not a factor in the be- 
havior of the rat run one trial a day on 
a partial-reinforcement schedule, and that 
such traces or memories should not enter 
into explanations of the PRE in rat ex- 
periments conducted at intervals of 24 hr. 
or more. Whether stimulus traces of this 
sort operate to affect performance at 


PRE WITH 24-HOUR INTERTRIAL INTERVALS 


shorter intervals is still not entirely clear, 
and the answer will await a program of 
careful experimentation testing for the 
presence of such effects at a variety of 
intertrial intervals. As we indicated 
earlier, there is no reason in principle— 
certainly so far as S-R theorists are con- 
cerned—why stimulus aftereffects from 
reward and nonreward on Trial n should 
not form part of the effective stimulation 
on Trial 5 4- 1 when the ITIs are very 
short. 

Two recently published experiments on 
aftereffects must be discussed in the light 
of the present finding: The first is by 
Capaldi and Spivey (19642), who have 
claimed to show the presence of after- 
effects at 24-hr, ITIs, using the so-called 
intertrial reinforcement (ITR) method. 
They assume that a given aftereffect will 
remain functional, even over 24 hr., until 
replaced by another aftereffect in the 
same or in a similar stimulus situation. 

In the ITR method, S is returned to 
the goal box 30 sec. after removal follow- 
ing completion of a trial and is rewarded 
there, The assumption is that the ITR 
given following NR replaces the after- 
effects of NR with the aftereffects of R, 
and should thus reduce or, indeed, elimi- 
nate the PRE.  Capaldi and Spivey's 
results were as predicted: Ss given ITRs 
following those NR trials which preceded 
R trials were less resistant to extinction 
than Ss given ITRs following all R trials. 
Capaldi and Spivey interpret their results 
to mean that aftereffects remain func- 
tional for 24 hr. 

An alternative interpretation is that 
ITRs given very shortly after NR ac- 
tually convert the NR trial to an R trial, 
and that ITRs following R trials simply 
add to the magnitude of reward on those 
trials. If this is correct, the Capaldi and 
Spivey experiment involves a total of 25 
R vs. 6 NR trials for the group showing 
less resistance to extinction, against a 
total of 19 R vs. 12 NR for the group 
showing more resistance to extinction. 
In the latter case, more NR trials would 
allow, in our terms, for greater condition- 
ing of rp-sp to approach and thus lead to 
greater persistence in extinction. Also, 
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double reinforcement given on 6 of the 
R trials in the latter group would lead 
to more 7p-Sp, which in turn would lead 
to the greater strength of rp-sp during 
acquisition. In any event, the evidence 
from our present experiment would not 
support their conclusion that the effects of 
an ITR carry over a 24-hr, interval. 

The second experiment, also by Capaldi 
and Spivey (1964b), appeared when our 
experiment was almost completed, and is 
in essence a duplicate of our SA condi- 
tion. Rats were run on an SA schedule 
for 126 consecutive days, 1 trial a day, 
and clear evidence of pattern running 
emerged in their goal measure by the 
fifth block of trials (46 trials). The 
other noteworthy finding of their experi- 
ment was that, in the start and running 
measures, performance improved both on 
R and NR trials until the third block of 
trials and then deteriorated (vigor de- 
creased) on both kinds of trial to the end 
of the experiment; so that the acquisition 
curves in the start and running measures 
look like extinction curves after the first 
few blocks of trials. It is not clear to us 
why Ss operating on the basis of after- 
effects from the previous day's trial 
should slow down on both NR and R 
trials in the early segments of a response 
chain. 

There is some reason to suspect that 
the pattern running reported by Capaldi 
and Spivey occurred on some basis other 
than "memories" of the previous day. 
They used different but, as they say, 
"identical" food cups, one holding wet 
mash on R trials, the other empty on NR 
trials. There is no suggestion in the 
article that the possibility of olfactory 
discrimination was eliminated, and the 
strong discrimination in the goal measure 
would suggest this possibility. As a 
matter of fact, it is difficult to understand 
why, if S “remembers” the previous day's 
trial, he should show this memory very 
clearly close to the goal and not at the 
start, particularly since S is put into a 
"wait cage" for 15 sec. after being taken 
from his home cage and before being 
placed in the start box. Surely this inter- 
val of 15 sec. would provide a greater 
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basis for recollection of the previous day's 
trial than the second or two of time S 
spends in the alley. The very clear dis- 
crimination near the goal and the use 
of wet mash and separate goal cups for 
R and NR trials implicate olfactory dis- 
crimination as a possible basis for the 
Capaldi and Spivey result. 

The other possible difference between 
their procedure and ours is in the possi- 
ble presence of extraneous sources of 
stimulation which are different before a 
reward trial and a nonreward trial. We 
took great pains to provide S with no 
systematic cues for an R and an NR 
trial. Capaldi and Spivey are not specific 
on this, but their procedure of placing 
S in a wait cage before each trial might 
have promoted possible exposure to dif- 
ferential cues associated with R and NR 
trials, such as putting wet mash into the 
food cup, and other movements and 
noises made by E that may be different 
on R and NR trials unless great care is 
taken to prevent such differential stimu- 
lation. 
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ACQUISITION AND RETENTION IN SHORT-TERM MEMORY * 
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Retention in short-term memory was studied by manipulating rates of 
presentation (from 1 to 10 digits per second), the type of digit presen- 
tation (spoken, computer spoken, and visually presented), the type of 
item (single digits, paired digits, and nonsense sounds), and the type 
of test (recall and recognition). Performance in short-term memory 
experiments is attributed to interactions among 3 different processes: 
acquisition, retention, and decision. Rate of presentation, length of 
list, type of item, and modality seemed mainly to affect the initial 
acquisition of items in memory. The rate of forgetting depended 
mainly upon the number of items presented between the critical item 


and its test. 


Some things can be remembered im- 
mediately after their presentation but 
forgotten soon thereafter. The rate 
at which this forgetting takes place 
may depend upon many factors but, on 
the whole, it appears to be remarkably 
similar under a number of different 
experimental conditions. 

Waugh and Norman (1965) sug- 
gested that the recall of an item (the 
critical item) depends upon two things: 
the likelihood that it is still in a tem- 
porary memory system (primary mem- 
ory) and the likelihood that it got into 
a longer term memory (secondary 
memory). Many factors affect the 
likelihood that the critical item got into 
secondary memory—e.g., its context 
and the amount of attention it received 
—but these factors are presumed to 
remain constant during most short- 
term memory experiments. Immediate 
forgetting is a result of the transient 
nature of primary memory. 

To test these hypotheses, Waugh and 
Norman devised the “probe-digit” ex- 


1 This investigation was supported, in part, 
by a United States Public Health Service 
Research Grant MH-08083-02 from the Na- 
tional Institutes of Health to the Center 
for Cognitive Studies, Harvard University. 

2 Now at the University of California, San 
Diego. 
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periment to minimize the effects of 
long-term (secondary) memory and 
response interference. The results of 
the probe-digit experiment were com- 
pared with the recall measures obtained 
by several Es in a variety of experi- 
ments. | When allowances had been 
made for response interference and 
long-term memory in these other ex- 
periments, there was reasonably good 
agreement among the experimental re- 
sults. The observed rate of forgetting 
seemed to depend mainly upon the 
number of interfering items and was 
the same for all experiments, regardless 
of the type of item involved or the rate 
of presentation. 

In a study of the serial-position ef- 
fect in recognition memory, Wickel- 
gren and Norman (in press) found a 
similar consistency in the rate of for- 
getting. The forgetting of three-digit 
numbers in a short-term memory ex- 
periment could be described by the 
same exponential decay of memory- 
trace strengths with the number of 
interfering items, regardless of the 
length of the list. Memory strength, in 
this case, was measured from the op- 
erating characteristics. 

In this study the author continues 
the analysis of short-term memory us- 


370 


ing the probe-digit experiment and 
manipulating rates of presentation 
(from 1 to 10 digits per second), the 
type of digit presentation (spoken, 
computer spoken, and visually pre- 
sented), the type of item (single digits, 
paired digits, and nonsense sounds), 
and the type of test (recall and recogni- 
tion). 

Performance in a memory experi- 
ment can be attributed to the inter- 
actions among three different proc- 
esses: the initial acquisition of the 
critical item, the retention of that item 
in memory, and the retrieval of the 
item from memory. Although the 
primary purpose of this study is to 
examine the retention of an item over 
time, it is not possible to do this 
without also considering the effects of 
attention and perceptual factors on 
the initial acquisition of that item and 
of response biases and guessing on its 
retrieval. 

One method of dealing with these 
various factors is to postulate an un- 
derlying process from which probability 
scores can be derived. This approach 
offers some ways of coping with re- 
sponse biases and resolving the differ- 
ences among the various tests and 
measures of memory. In the probe- 
digit experiment, for example; it is 
possible to do both recall and recogni- 
tion experiments that differ only in the 
instructions to S. It seems unlikely 
that the representation of the items in 
memory (the memory trace) should be 
affected by the instructions, although 
the probability of a correct response is 
not the same in the two experiments. 
The differences in these two experi- 
ments ought to lie in the strategies used 
by Ss to select a response. 


‘THEORY 


In general, let us assume that the 
effect of the stimulus presentation and 
experimental variables on the memory 
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trace can be summarized along a 
unidimensional scale designated as 
strength. The location of a particular 
memory item along this scale reflects, 
to a large measure, the type of pres- 
entation it received and interference it 
has encountered. The S chooses his 
response on the basis of the strengths 
of the memory items in question. Be- 
cause he makes mistakes, we assume 
the strength values are probabilistic, 
rather than deterministic: The nature 
of his mistakes allows us to specify the 
nature of the distributions of memory 
strengths. Strength models of this 
form are not new to psychology. They 
were extensively studied by Thurstone 
(1927) and later in psychophysics as 
theories of signal detection (Swets, 
Tanner, & Birdsall, 1961). They are 
relatively new in memory, although 
they bear some resemblance to the 
model by Hull, Hovland, Ross, Hall, 
Perkins, and Fitch (1940). A detailed 
description of strength models can be 
found in Norman and Wickelgren 
(1965) and Wickelgren and Norman 
(in press). 

There are two strength distributions 
of interest: The distribution for the 
critical item and the distribution of 
items that were not presented in the 
experiment (new items), Strength is 
measured by the separation of these 
two distributions expressed in units of 
their standard deviation. This distance 
is equivalent to the d’ measure of sig- 
nal-detection theory. The analysis re- 
quires that the two distributions both 
be normal with equal variances, but 
these assumptions can be tested. If 
the assumptions that the same memory 
traces are used in recall as in recogni- 
tion are true, the results of one experi- 
ment can be used to predict the results 
of the other. 

In the recognition experiment it is 
assumed that S chooses his responses 
by selecting a particular strength value 
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as a criterion. He responds yes if and 
only if the memory-trace strength of 
the test digit exceeds the criterion. 
He selects his confidence ratings in 
much the same way. Strengths that 
are far greater or less than the cri- 
terion receive high confidence ratings 
(Egan, 1958; Norman & Wickelgren, 
1965). 

A different strategy must be used in 
recall. Recall is a complex task in the 
probe-digit experiment because S must 
both recognize the probe as having oc- 
curred in the list and then recall the 
digit with the highest forward asso- 
ciation to the probe. The analysis of 
the situation is made somewhat easier 
because S knows that the only possible 
responses are the digits used in the ex- 
periment, excluding the probe itself. In 
recall, assume that S assesses the rela- 
tive strength of each response and 
chooses as his answer the one with the 
highest strength. If there are k items 
in the response set, the probability that 
the ith will have a strength greater than 
all of the k — i others is 


p(recall) 
-f AOL TL FjG)Ms, [13 


isi 


where F; (x) = f_.” fj(y) dy and fi() 
is the probability density function of 
strengths for response i. This is the 
same result used by Green and Birdsall 
(1964), to study the effect of vocabu- 
lary size on articulation score. It is 
most convenient to compare recall with 
recognition by reversing the equation, 
using the probability of correct recall 
to estimate the underlying strengths, 
and comparing these estimates with the 
recognition data. To do this we make 
two assumptions for mathematical con- 
venience: first, that the means of dis- 
tributions of all the incorrect items are 
zero and the mean of the distribution 
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of the correct item is d (7) ; second, that 
all the distributions are normal with 
unit variance. 

Recently, Wickelgren and Norman 
(in press) have found that the mean 
of the strengths for the short-term 
memory trace in a recognition experi- 
ment can be described by the exponen- 
tial function 


d(i) = adi, (21 


where i is the number of items pre- 
sented after the critical item. The 
parameters o and ¢ can be interperted 
as describing the acquisition phase (or 
initial strength) and the retention 
phase (or rate of forgetting), respec- 
tively. Equation 2 gives a way of 
estimating the relative sizes of the two 
processes in different recognition ex- 
periments, and Equation 1 lets us 
transform recall scores into strength 
for analysis by Equation 2 and com- 
parison with recognition measures. 
This linkage of recall and recognition 
ignores many difficulties. For one, 
the strengths in recall relate to the as- 
sociation between the critical item and 
its successor. In recognition the 
strength refers to the association be- 
tween the critical item and contextual 
cues. These are not the same things, 
although they are probably related. At 
this stage in the development of the 
theory, however, it seems best to test 
what we have got, and return to these 
other problems when we understand 
more about the nature of the associa- 
tions. 


=a; 


0=¢31, 


GENERAL METHOD 


The experiments studied here are all 
similar to the probe-digit experiment. In 
each experiment a number of lists are 
presented, followed by the probes and S's 
responses. The lists all consist of 15 items 
(although the items can be composed of one 
or two digits) followed by the probe. The 
time interval between the last item and the 
probe is the same as between any two items 
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on the list. In a recall experiment S’s task 
is to respond with the digit that followed 
the probe when it was presented in the list, 
guessing if he does not remember. The 
probe was presented only once in the list and 
once afterwards. In a recognition experi- 
ment $'s task is to decide whether or not 
the probe had occurred in the list, and re- 
spond yes (y) or no (n). This is followed 
by a judgment of his confidence in the 
response, ranging from 5 for very confident 
to 1 for no confidence. 

In each experiment a large number of ob- 
servations was obtained for a small number 
of individual Ss. The same Ss were used 
for as many different conditions as possible. 
There were frequent rests during the experi- 
ment, always followed by a number of prac- 
tice trials that were not scored. The first 
few sessions of each new condition were 
considered to be practice and also were not 
scored, All Ss received instructions to 
control their rehearsal by “thinking only of 
the last item you have heard, and not of any 
earlier ones." The rehearsal instructions 
were those used by Waugh and Norman 
(1965) and were repeated frequently during 
the first few weeks of each S's participation 
in the experiments. 


RECALL EXPERIMENTS 
Method 


In this group of experiments the digits 
and probes were presented at the rate of 1, 
4, 7, and 10 digits per second. Each list 
consisted of 15 digits and one probe. The 
probe occurred exactly once in the list, and 
once afterwards. The critical digit was fol- 
lowed by 1, 2, 3, 4, 5, 6, 8, 10, or 12 digits 
(counting the probe). The other items in 
the list consisted of random permutations of 
the remaining digits with the restrictions 
that no digit could be used more than twice 
and no digit ever followed immediately upon 
itself. 

Auditory presentation.—The lists of audi- 
tory digits were prepared, "spoken," and re- 
corded on magnetic tape by the Lincoln 
Laboratory TX-2 digital computer3 Each 
digit was derived from the voice of Douwe 
Yntema, carefully compressed in time by the 
computer analog of the "chopsplice" tech- 
nique to last exactly 100 msec. The rate of 


3I would like to thank Douwe Yntema 
for letting me use his computer-spoken digits 
and for preparing the special sequences and 
magnetic tapes necessary for the probe-digit 
experiments. 


presentation could be manipulated by varying 
the time interval between digits without 
altering the individual digits in any way, up 
to a minimum time separation of zero and 
the corresponding maximum presentation 
rate of 10 per second. The digits were all 
highly intelligible after a little practice. Re- 
call measures for lists of three digits at all 
the rates were greater than 99.5% (Yntema, 
Wozencraft, & Klem, 1964). Unfortunately, 
the digits 8 and 9 could not be compressed 
without losing intelligibility, so only the 
digits 0 (pronounced “zero”) through 7 were 
used in this study. A high-frequency tone 
followed the presentation of the probe to 
aid S in discriminating the end of the list 
and, hence, the probe. The Ss listened to 
the digit lists recorded on magnetic tape and 
played to them individually over high-quality 
earphones. 

Visual presentation.—The visual experi- 
ments were designed to be as similar as 
possible to the auditory ones except for the 
obvious differences required by the change 
in modality. Each digit was presented for 
100 msec. in a window 14 in. wide and 11 in. 
high. A masking background of alternating 
light and dark dots filled the window be- 
tween digit presentation; because the digits 
were selected mechanically by an electro- 
magnetic decoding device, there was no ap- 
preciable turn on or off time. The list 
digits were all displayed in the same win- 
dow in the same location; the probe was 
distinguished by being presented in a dif- 
ferent window, 33 in. to the left of the list 
digit’s display. The Ss sat at a teletype- 
writer in an acoustically isolated chamber. 
The display was above the typewriter, ap- 
aproximately 13 ft. from their heads. 

The display was controlled by a digital 

computer which prepared the lists and pre- 
sented them at the proper rate, accepted S’s 
responses from a typewriter keyboard, and 
scored them. In addition the computer forced 
S to take a break in the l/sec rate condi- 
tion (by displaying the words, “take a break” 
and then locking the typewriter keyboard). 
The experiment was self-paced; after S had 
made his response, the new list was not 
presented until he signaled he was ready by 
pressing the space bar. There was a 3-sec. 
wait between S's ready signal and.the list 
presentation. 
i The response time between the presenta- 
tion of the probe and S’s responses was 
measured. The measurements were made 
with millisecond precision, but mechanical 
delays introduced by the typewriter reduced 
this to about 10 msec. 
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Subjects.—Three Harvard undergraduates 
(two female and one male) served as Ss. 
Each S worked no longer than 2 hr. on any 
day (two different conditions) and was asked 
to take frequent breaks. All three Ss per- 
formed both the visual and auditory experi- 
ments, doing each of the eight conditions 
approximately five times, not including prac- 
tice sessions, and taking about 40 hr. Be- 
cause of scheduling difficulties, one S was 
unable to finish five sessions of visual pres- 
entations at the 7/sec rate. Some of the Ss 
continued the 7 and 10/sec rates with both 
auditory and visual presentations for as 
many as 10 sessions. 

Practice.—Before starting the auditory ex- 
periment, each S listened to a practice tape 
which described the nature of the digits, 
listed them in numerical order several times 
at several presentation rates, and gave a 
general introduction to the sounds of the 
digits. The first few sessions for each S 
were considered to be practice and were not 
scored. The same procedure was followed 
with the visual digits, except that no training 
was ‘necessary to recognize the digits. In 
addition, the first five trials of each experi- 
ment and the first three after each break 
were also considered practice trials. The 
Ss did not know that they were being given 
any practice trials. 


Results 


The relative frequencies of correctly 
recalling the test digits as a function of 
position from the end of the list and 
averaged across the three Ss are shown 
in Fig. 1A (auditory data) and Fig. 
1B (visual data). (Note that the 
abscissa in these figures is the inverse 
of serial position.) Some Ss did extra 
sessions at the fast rates to test 
whether they would improve with fur- 
ther practice. There was no improve- 


4 Tables of response probabilities and re- 
action time for individual Ss in the recall 
experiment and a figure of the error analy- 
sis have been deposited with the American 
Documentation Institute. Order Document 
No. 8898 from ADI Auxiliary Publication 
Project, Photo Duplication Service, Library 
of Congress, Washington, D.C. 20540. Remit 
in advance $1.25 for microfilm or $1.25 for 
photocopies and make checks payable to: 
Chief, Photoduplication Service, Library of 
Congress. 
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across three Ss. (Each point comes from a 
minimum of 150 responses. The abscissa is 
the number of digits presented between the 
critical digit and its test—counting the 
probe.) 


ment, so their extra sessions were aver- 
aged with the others. The number of 
observations varies with the conditions. 
The auditory conditions of 1, 4, 7, and 
10 digits/sec have 150, 150, 260, and 
260 observations at each test position, 
respectively, and the corresponding vis- 
ual conditions have 150, 150, 190, and 
220 observations at each test position. 

Mean response times to the visual 
digits are plotted in Fig. 2 (averaged 
across Ss) as a function of position 
from the end of the list, with correct 
responses separated from incorrect 
ones for the different presentation 
tates. The sample size for each reac- 
tion-time figure varies, of course, with 
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Fic. 2. Mean response time for correct 
(A) and incorrect (B) responses in the 
visually presented probe-digit experiment. 


the variation in the frequency of correct 
recall. As a result, the data in Fig. 2 
represent from as few as 9 responses 
(1/sec, Position 1, wrong responses) 
to as many as 152 responses (10/sec, 
Position 1, correct responses). Unfor- 
tunately, no record of the variability of 
these reaction times is available: The 
computer program kept a running total 
of the response times (grouped accord- 
ing to the categories of Fig. 2) and 
discarded the individual scores. 

The auditory and visual data at the 
slow rates ought to agree with the 
primary memory function of Waugh 
and Norman (1965), for they are ob- 
tained in an identical way. The 1 and 
4/sec data show excellent agreement 
with the function and 99% confidence 
limits of the earlier study (see Fig. 1 
and 3 in Waugh and Norman). 

Errors—Errors made by Ss follow 
a definite pattern. When S made an 
error, he was most likely to do so 
by either recalling the digit preceding 


the probe or recalling the digit that 
followed the correct response. The 
relative likelihood of these two errors 
decreased with the position of the probe 
from the end of this list and, to a slight 
extent, with rate of presentation. The 
error analysis could only be done for 
the auditory digits, because a written 
permanent record of the responses was 
kept; in the visual experiment the 
computer program scored each re- 
sponse as correct or incorrect, and then 
discarded the responses without further 
analysis, 


Discussion 


Several observations are immediately 
obvious from the response-time data of 
Fig. 2: (a) Incorrect responses are gen- 
erally slower than correct ones; (b) In- 
correct response times do not vary much 
with test position; (c) Correct response 
times appear to be nearly linear increas- 
ing functions of test-item position; (d) 
All response times are decreasing func- 
tion of presentation rate. 

The approximate linear increase in 
search time is highly suggestive of a 
serial search process (Sternberg, 1963) 
in which Ss scan their memories starting 
with the last item presented and working 
towards the beginning of the list. The 
slopes of the curves indicate that under 
this hypothesis the scanning must be 
taking place at a rate of 5.8 and 15.2 
digits/sec at the 1- and 4-sec. presenta- 
tion rates, respectively. The continual 
decrease in the slope of the response- 
time functions with increasing presenta- 
tion rates indicates that, under the linear 
search hypothesis, the search process is 
speeding up. If the item is not recalled 
the entire list must be searched, so that 
response time is both independent of the 
actual position of the digit and approxi- 
mately the same as that for digits at the 
beginning of the list (Position 12). 

These data can do no more than sug- 
gest a serial processing. Without other 
information on the distributions and, per- 
haps, different experimental methods, it 
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is not possible to distinguish between a 
linear search process and other alterna- 
tives. For example, an alternative ex- 
planation is that response time increases 
as the decisions become more difficult. 
If we assume that weak memory traces 
lead to difficult decisions, the response 
time ought to increase as the number 
of digits following the critical digit in- 
creases, Although it is hard to see why 
this process would produce a linear in- 
crease in response time, such a result is 
certainly possible. 

Neither hypothesis explains why the 
slope of the response-time functions 
should decrease with increasing presenta- 
tion rates, There are at least two possi- 
ble explanations. One, Ss paced their 
response rate to be proportional to the 
presentation rate. Two, less items were 
remembered at fast presentation rates and 
so the search or decision process was 
faster. The first hypothesis predicts a 
decrement in performance at high pres- 
entation rates as a result of the decrease 
in decision or search time. The second 
hypothesis predicts a decrease in response 
time as a result of a decrement in per- 
formance. Both hypotheses are consist- 
ent with the data and so cannot be dis- 
tinguished from each other, 

The differences observed in perform- 
ances in the various conditions could 
result from differences between the acqui- 
sition of an item into memory and the 
retention of that item in the memory. It 
seems likely that presentation rate might 
very well affect acquisition—whether for 
perceptual or other reasons—without 
changing retention (or, in this case, the 
rate of forgetting). We can examine 
this analytically by using Equations 1 
and 2 to get estimates of the two param- 
eters for acquisition and retention. To 
do this, recall probabilities have been 
transformed into strength values, d(i), 
by means of the published tables of solu- 
tions to Equation 1 (Elliot, 1964). The 
size of the response set, b, was taken to 
be 8 because that is the closest number 
to the actual values of 7 and 9 readily 
available in the tables. Estimates for 
the parameters of Equation 2 were then 
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TABLE 
ACQUISITION AND RETENTION PARAMETERS 
IN RECALL 
7 95% 
Modality | Digits ver| aA 4 | Confidence 
on ġ 
Auditory 1 4.3 41 | .72-.81 
4 2.5 .86 19-91 
7 2.0 .81 15:81. 
10 1 .78 .63-.98 
Visual 1 5.0 .69 | .56-.85 
4 4.4 69 59-81 
7 2.2 18 | .65-.95 
10 DT 45 | .62-.93 


found by determining the best linear re- 
gression line to the logarithms of strength 
values. The acquisition and retention 
parameters are the antilogarithms of the 
intercept and slope coefficients for the 
regression lines. These estimates are 
presented in Table 1, along with the 95% 
confidence limits on the retention pa- 
rameter (the antilogarithm of the 95% 
confidence limits of the regression-slope 
coefficients). Note that all the confidence 
limits overlap for retention and the acqui- 
sition parameter decreases monotonically 
with presentation rate, An F test of the 
differences in the regression-line slopes 
indicates that the slopes do not differ 
significantly, F (7, 56) =1.21, p> .05, 

The analytical comparison of acquisi- 
tion and retention depends upon the many 
assumptions involved in going through 
Equations 1 and 2 and, therefore, upon 
how well this particular theory of recall 
applies to these data. There is a graphi- 
cal analysis of the same factors that does 
not require so many assumptions, but, of 
course, provides a weaker test. If the 
curves of Fig. 1 all have the same rate 
of decay, they are different only because 
they start differently. If they can be 
superimposed by shifts along the horizon- 
tal axis, without any change in their 
vertical values, it would strongly imply 
that the only differences among the 
curves is their initial value. The results 
of this shift are shown in Fig. 5A and 5B 
for auditory and visual digits, respec- 
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Fic. 3. Data from Fig. 1A (A) and 1B 

(B) shifted along the abscissa so that the 

~ rates of forgetting can be compared. (The 
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tively. The transformed curves are in- 
deed very similar. We can compare the 
effect of modality on the forgetting 
curves by shifting the curves of Fig. 1A 
along the horizontal axis until they match 
the curves of Fig. 1B. The results of 
this shift are illustrated in Fig. 3A where 
the 1/sec visual results from Fig. 3B 
have been superimposed on the auditory 
results. This comparison indicates that 
whatever differences exist as a result of 
modality has the greatest effect on acqui- 
sition and, again, not on rate of forget- 
ting. 

We would not expect to find much dif- 
ference in the performance in the two 
modalities, because recent experiments 
indicate that visual material is often re- 
coded into an acoustical storage system 
(Conrad, 1964; Sperling, 1963; Wickel- 
gren, 1965). Under this hypothesis 
stimuli presented in either modality end 
up in the same memory; hence the only 
differences in performance would result 
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from the initial perception and recoding, 
not in the memory itself. 

Why should presentation rate affect 
performance? Yntema, Wozencraft, and 
Klem (1964) found excellent recall of 
short digit list lengths at fast presenta- 
tion rates. Yet in the present experi- 
ments even the last digit in the list was 
sometimes poorly recalled. The main 
difference in these experiments is the 
length of the list. Do the number of 
preceding digits—proactive interference 
—have an effect on the recall of the last 
digits of a list? In order to answer this 
question we need a different experiment. 


Method 


List length and presentation rate on recall 
at the end of a list.—To answer the question 
raised above Ss were asked to recall the last 
three digits of lists as list length and 
presentation rate varied. The experiment 
was done visually with the same conditions 
as before except that no probe was used. 
The presentation and scoring was done by 
the computer. List lengths of 3, 6, 9, 12, 
and 15 digits were presented at the rate of 1, 
4, 7, and 10 digits/sec. The Ss received the 
same rehearsal instructions as before but 
were asked to respond by typing only the 
last three digits they had seen, guessing if 
they did not know. Response times were 
not measured. 

There were 20 different experimental con- 
ditions (five list lengths and four rates). 
Each condition was presented in a separate 
session and each S received a different 
(random) permutation of session ordering. 
Whenever a session lasted more than 20 
min. rests were given. As before, each 
session was preceded by practice trials. No 
data were collected until each S had enough 
practice to get better than 90% of the digits 
correctly in lists of Length 3 at all presenta- 
tion rates. 

Subjects—The Ss were three Harvard un- 
dergraduates, two male and one female (not 
the same Ss used before). 


Results 


The relative frequency that each re- 
sponse was correct and in the correct 
location is plotted in Fig. 4; each point 
represents 300 observations. The fre- 
quency of correct responses drops both 
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as a function of serial position and as 
presentation rate increases. The im- 
portant result for this experiment is 
the change in recall for the next to the 
last item. This corresponds to the 
last digit in the probe-digit experiment 
which is always followed by a probe. 
Note particularly the effect of the list 
length on the recall frequencies. 


Discussion 


The results of this experiment suggest 
that performance depends upon all three 
factors: length of list, position of the 
critical item, and rate of presentation. 
A possible interpretation of these results 
is that with long fast lists, presentations 
outpace the encoding. Aaronson (1965) 
has found that when presentation rates 
exceed the encoding rate, performance 
suffers, The effect is most noticeable 
at high presentation rates and at the end 
of lists, where the cumulative effects of 
insufficient coding time reach a maxi- 
mum. An alternative interpretation of 
these results is suggested by Yntema, 
Wozencraft, and Klem (1964) who sug- 
gest that S’s ability to organize the mate- 
rial determines his later performance. 
Rate of presentation affects performance 
because slow lists give him more time to 
organize than do fast lists. Short lists 
require much less organization than long 
ones, so rate of presentation has less 
effect as list length decreases. Under 
either interpretation length of list and 
rate of presentation affect the acquisition 
of information, whereas position in list 
determines its retention. 


RECOGNITION EXPERIMENTS 


The recognition experiments fol- 
lowed the paradigm of the probe-digit 
experiment. In recognition, however, 
S’s task was to indicate whether he 
recognized the probe as having oc- 
curred in the list. The lists consisted 
of 15 items at the rate of 1 item/sec. 
The last item was followed immediately 
by a high-frequency tone and then the 
probe. The probe had occurred in the 
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Fic. 4. Relative frequency of recall of 
the last three digits from lists of various 
lengths presented at different rates. (Each 
point is the average of three Ss for a total 
of 300 observations. The data are not cor- 
rected for guessing.) 


list exactly half of the time and, when 
it did occur, it did so exactly once. 
The S responded after the probe by 
writing down yes (the probe had oc- 
curred) or no, followed by a number 
from one to five indicating his confi- 
dence in his response. 


Method 


The sequence of items and conditions and 
the scoring of the data were prepared and 
typed by the digital computer. The se- 
quences were then recorded onto magnetic 
tape (by E, not the computer), and Ss 
listened to these tapes individually with a 
tape recorder and high-quality earphones. 
Except for the necessary changes in the 
probe and responses, the procedures and in- 
structions were identical to those used for 
the other probe-digit experiments. 

Subjects.—Two Harvard undergraduates 
(one male) served as Ss. One (52, female) 
had also served in the recall experiments. 

Conditions.—Three different types of items 
were used in these experiments: single digits, 
paired digits, and nonsense sounds. The 
single digits consisted of the numbers 0 
(pronounced “oh”) through 9, with the 
critical digit at Positions 2 through 6 from 
the end of the list (counting the probe). 

In the paired-digit list, the same format as 
the single digits was used except that each 
of the 15 items in a list consisted of two 
numbers read as a pair without any pause 
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Fic. 5. Relative frequency of correct and false recognition for the three types of stimuli. 
(The recall data from the one per second auditory and visual probe digit experiments are 


included for comparison purposes.) 


between the two digits. The two digits 
of each pair, however, were pronounced indi- 
vidually. The probe was also a pair of 
digits. The critical items were in positions 
two through six items from the end of the 
list (counting the probe) so that there were 
4, 6, 8, 10, and 12 digits intervening between 
presentation of the critical pair and its test. 

The nonsense sounds also used the same 
format as the single digits except that each 
item was a digit played backwards through 
the tape recorder. This created a list made 
of 10 distinctive sounds. There was no 
trouble in distinguishing the sounds of one 
backwards digit from any other and the 
original source of the sounds was not iden- 
tifiable by Ss. In all conditions each tape 
had 108 lists: five test positions, with 10 
trials at each position, 50 trials in which the 
probe had not occurred on the previous list, 
and eight practice lists. Each S listened to 
each tape 10 times, but never twice on the 
same day. 


Results 


The forgetting curves for the three 
recognition conditions and the average 
of the visual and auditory 1/sec curves 
from the recall experiments are shown 
in Fig. 5. Each point of the correct 
recognition data comes from 100 obser- 
vations and the false-recognition rates 
have 500 observations each, 


Operating characteristics were ob- 
tained for each list position tested: a 
total of 30 curves for the two Ss. The 
operating characteristics were all nearly 
symmetrical about the line y = —4 on 
linear graph paper and straight lines 
with slopes of 1 on normal-normal 
probability paper. This last result 
means that the memory-strength distri- 
butions for old and new items can be 
approximated by normal distributions 
with equal variances. ^ Recognition 
data have not been averaged across Ss 
because both Fig. 5 and the operating 
Characteristics. show large individual 
differences which were not observed in 
the recall studies. 

To analyze these data further it is 
necessary to transform the probability 
measures into strengths. This can be 
done from the location of the operating 
characteristics on probability paper; 
the sum of the x and y coordinates of 
the intersection of the curves with the 
line y = —x gives the separation of the 
two distributions in standard-deviation 
units (Swets, Tanner, & Birdsall, 
1961). These strengths are plotted in 
Fig. 6 along with d(i) values esti- 


- Equation 2. 
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“mated. for the average of the 1/sec 


auditory and visual recall data, using 
Some operating charac: 
teristics were too high to give statis- 
tically reliable estimates for d(i) and 
were not included. 

Let us assume, as we did before, 
that the strength measures of Fig. 6 are 
a result of two aspects of the memory 
process: the initial level of strength 
at the time of acquisition and the re- 
duction in that strength caused by the 
forgetting process. Estimates of slope 
and initial value for Equation 2 are 
shown in Table 2. The level of each 
curve indicates the strength acquired in 
acquisition and the slope indicates the 
rate of forgetting. Estimates of the 
acquisition and retention parameters of 
Equation 2 for these data were found 
by determining the best linear regres- 
sion line to the logarithms of the 
strength values. ‘The acquisition and 
retention parameters are the anti- 
logarithms of the intercept and slope 
coefficients for the regression lines. 
These estimations are presented in 
Table 2 along with the 95% confidence 
limits on the retention parameters. 
Some of the confidence limits on the re- 
tention parameters are rather large be- 
cause of the limited number of strength 
estimates available for some of the rec- 
ognition conditions. An F test of the 
differences in the regression-line slopes 
(calling each recognition curve of Fig. 
6 a condition, ignoring the noninde- 
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the data of Fig. 5. 


TABLE 2 


ACQUISITION AND RETENTION PARAMETERS 
IN RECOGNITION 


95% 
Cond. SN con Ded ders 
on ġ 
Single Digits S2 | 13.4] .77 | .66-.90 
S4 | 11.5} .80 | .71-.89 
Double Digits S2 | 9.6] .78 | .61-1.00 
S4 | 7.8) .87 | .82-.91 
Backwards Digits] S2 | 7.7] .71 | .55-.93 
S4 | 7.2] .73 | .65-.82 


pendence of Ss) indicates that the 
slopes do not differ significantly, F (5, 
19) = 1.92, p> .05. The acquisition 
parameters vary considerably with the 
type of material and S. 


Discussion 


The results from the recognition and 
recall experiments suggest that the initial 
level of strength of the memory trace 
depends upon the type of material and its 
rate of presentation but that the rate of 
forgetting is relatively independent of 
these factors. (A third factor, biases 
introduced in the decision process, must 
also be considered in recognition experi- 
ments, but the operating characteristics 
allow us to separate these biases from 
memory strength.) 

The retention parameters estimated 
from the recognition data are very simi- 
lar in value to the parameters estimated 
in the recall experiments. It is difficult 
to decide statistically whether they differ 
significantly: the two sets of data have 
undergone complex and different trans- 
formations; one S in the recognition ex- 
periments also took part in the recall 
experiments ; different test positions were 
used with varying numbers of observa- 
tions. But regardless of the statistical 
issues, there is little question that the 
strength analysis indicates the decay rates 
to be very similar in both types of ex- 
periments—much more similar than the 
probability data of Fig. 5 would indicate. 
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The single-digits recognition experi- 
ment is almost the exact analog of the 
recall experiments. If the recall theory 
of Equations 1 and 2 were correct, 
strengths from recall ought to agree 
closely with those from single-digit rec- 
ognition. They do not. In Fig. 6 (and 
Tables 1 and 2) recognition strengths 
are generally above recall strengths. 
Thus, although the curves are similar 
in rate of decay, they differ in height. 
If the strengths measured in the recog- 
nition experiment reflect different types 
of associations from those measured in 
recall, it is not surprising that the acqui- 
sition absolute level of strengths differ. 
It is encouraging to find that strength 
values for both types of experiments are 
reasonably well described by Equation 2 
and that the estimates for rate of decay 
appear to be similar. This implies that 
although the details of the relation be- 
tween recall and recognition are wrong 
the principle may be correct. The tenta- 
tive conclusion from this analysis is that 
the rate of forgetting is the same in both 
recall and recognition. This conclusion 
requires that modifications of the recall- 
recognition relationship will change only 
the height of the recall strengths, not the 
slope of the function. 

One assumption made in applying the 
model is that all the strength distributions 
except the correct one have a mean 
strength of zero. Were this true, all 
errors in recall would be distributed 
equally among the possible alternatives, 
The error analysis of the recall experi- 
ments shows this is not the case. If this 
assumption were relaxed we would, in 
fact, have gotten higher strength values 
from the recall probabilities, Unfortu- 
nately, it is not easy to carry out this 
analysis because the published tables can- 
not be used for this purpose and the 


numerical calculations required are very 
difficult. 


GENERAL Discussion 


In this paper, performance in short- 
term memory experiments has been at- 
tributed to interactions among three 
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different processes: (a) Acquisition 
which determines the initial strength of 
an item in memory; (5) Retention which 
describes the rate that items are forgotten 
from memory or, equivalently, describes 
the decrease in strength value of the 
memory trace; (c) Decision which de- 
scribes the strategy and biases used by S 
in determining his response from memory 
traces. 

Rate of presentation and length of list 
in the recall experiments of this paper 
seems mainly to effect the initial acqui- 
sition of items into memory: the rate of 
forgetting depends only upon the number 
of items presented between the critical 
item and its test. Similar results are 
found in the recognition studies, Rate 
of forgetting is similar among the vari- 
ous types of stimuli tested, but the initial 
strength values are quite different. 

Although recall and recognition ought 
to be directly related, the model studied 
in this paper fails to predict recognition 
strengths from recall probabilities. The 
model does suggest, however, that the 
rate of forgetting is the same in both 
types of experiments. 
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RUN STRUCTURE AND PROBABILITY LEARNING: 
DISPROOF OF RESTLE’S MODEL 


FRANK RESTLE ! 
Indiana University 


In 2 experiments, Ss predicted on each trial which of 2 lights would come 
on. The sequences were constructed of only 2 run lengths, the runs 


occurring in unpredictable order. 


length 2 and 5. After 2 ina row, 
beof20r5. Restle's theory says 
perseverate response) more often th 
quency of 5's, and the quantitative 
In Experiment II, 4 groups had se 


In the 1st experiment, runs were of 
S must predict whether the run will 
that Ss will guess a run of 5 (and 
an is justified by the objective fre- 


predictions were close to the data. 


quences, each consisting of only 2 


run lengths: (1,4), (2,3), (2,8), and (6,9). Again, all groups predicted 
the longer run more often than it occurred inthesequence. However, 
in Restle's theory the discrepancy should be a function of the ratio of 


the 2 run lengths, and in the data, 


the observed discrepancy depended 


on the sum of the run lengths, that is, the overall conditional probability 
that an event would follow itself in the sequence. 


Goodnow and Pettigrew (1955) 
and Nicks (1959) among others have 
hypothesized that Ss notice and use 
the distribution of run lengths in 
probability-learning tasks. Any se- 
quence of binary events, such as 
blue (6) and yellow (y), can be char- 
acterized by the sequence of runs; 
for example, the sequence yybbbbyyybb 
*** has the run sequence, 2-5-3-2. If 
long runs are relatively frequent, the 
Sequence shows high probabilities 
P(b|b) and P(y|y),2 whereas if short 


‘This research was supported in part by 
Research Grant GB-2848 from the National 
Science Foundation. Data were collected by 
Kenneth Hillner and James Jolly, and ana- 
lyzed with the aid of Frances Davis. The 
writer is indebted to James Greeno for many 
valuable suggestions. 

? The expression P(B|b) should be read, 
"the probability of response B after a single 
event b,” that is, after segment ...yb. Simi- 
larly, P(B |bbbb) is the probability of B after 
the segment ybbbb, the order of events being 
written from left to right. The usual first- 
order conditional probability is written 
P(B|b), the probability of B after a run of 
atleast one b. On critical trials at which the 
next event is not Predictable, S may choose 
to predict the same event he has most re- 
cently seen or the opposite. The probability 


runs are more frequent, the sequence 
tends to alternate. Run-length 
models of binary prediction say that 
S learns his response (B or Y), not 
to just any sequence of past events, 
but only to the longest sequence of 
homogeneous events, i.e., the length 
of run. 

One possible theory, devised by 
J. G. Greeno? considers the run 
length to be a stimulus. Whenever 
Event y follows a run of say yyy, 
then the connection of Response Y 
to the stimulus of a run of 3 y's in- 
creases. Response Y may also be- 
come conditioned to the shorter runs 
of y. Whenever Event y follows yyy 
then the stimuli y and yy as well as 
yyy are available to be conditioned 
to Response Y. Hence, in the se- 
quence yyy, Stimulus y will be avail- 
able to conditioned three times, yy 
twice, and yyy once. 


that the next event actually is the same as 
the most recent one, is called P(stay). In 
the usual, bulkier notation (Anderson, 1960; 
Engler, 1958), y and b would correspond to 
Ei and Es, and Y and B would correspond 
to A, and A». 

5 Personal communication, December 1964. 
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The writer (Restle, 1961) has 
offered the theory that all past trials 
are retained but cannot easily be 
recovered, 


Imagine a subject at some late point in a 
guessing experiment having just experi- 
enced & blue cards in a row, and deciding 
whether to respond B or Y. Suppose that 
he compares his present situation, say ybbbb 
(k = 4), with previous runs. Any runs 
shorter than four, such as yby, ybby, and 
ybbby, are irrelevant to his decision. A 
previous run of exactly four, ybbbby, indi- 
cates he should switch to response Y, hav- 
ing already used up the four bs. A previ- 
ous run of length 5 or more (ybbbbby, 
ybbbbbby, ...) indicates that he should 
make response B, for there are more bs to 
come [Restle, 1961, pp. 113-114]. 


Restle supposed that S enters his 
memory at some random event, and 
then is able to recall the length of the 
run about that point. Long runs 
leave relatively large targets for the 
random memory search and are more 
likely to be remembered than short 
runs. One way of expressing the 
probability P(b|bbb), for example, is 
the number of runs of length 4 or 
greater, divided by the number of 
runs of length 3 or greater. Each 
run counts once. According to 
Restle’s theory, longer runs count 
more im memory, hence Ss respond 
as if they expect longer runs than 
occur; there is a perseverative bias. 

Empirical studies have shown that 
Ss can use past events as cues to 
present responses. For example, 
Engler (1958) and Anderson (1960) 
had college students attempt to pre- 
dict binary events having first-order 
probabilities, P(b|b), different from 
the marginal overall probability of 
the event P(b). Such sequences, 
random except for the first-order 
constraint one trial back, lead to 
predictions reflecting the contingency; 
that is, P(B|b) is strongly affected 
by P(b|b). This shows that Ss are 
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using past events. Other relevant 
evidence is that if the events occur 
in periodic sequences, hence are per- 
fectly predictable (a sort of temporal 
maze for humans), college students 
master sequences at least of length 
12, perhaps much longer (Keller, 
1963; Restle, in press). 

Anderson and Engler both found 
that Ss did not match overall first- 
order conditional probabilities; con- 
sistently, P(B|b) is higher than 
P(b|b). Descriptively, Ss act as if 
they expect runs longer than are 
found. This is explained by Restle's 
theory on the idea that longer runs 
have larger or more probable repre- 
sentations in memory. 

Greeno's idea also leads to a high 
probability of choosing B after bb; in 
his case because the third 5 and all 
following b's are opportunities to 
condition the response B to the stimu- 
lus bb, whereas if y occurs it is the 
only opportunity to condition 
Response Y. 

A third general possibility is that 
S responds partly to the past situa- 
tion, as a cue, and partly either by 
guessing or generalizing between run 
lengths. For example, one may im- 
agine an S responding on the basis of 
the current run (if he remembers it), 
or on the basis of just the previous 
event (if he remembers it), or on some 
general basis, if he does not remember 
even the past event. In simple ran- 
dom sequences, as used in conven- 
tional probability-learning experi- 
ments, it is extremely difficult to 
separate these three possibilities. 
The reason is that after, for example, 
three bs in a row, 


P(b|bbd) = P(b|b) = P()) = 7. [1] 


That is, all of these cues lead to pre- 
cisely the same conditional probabil- 
ity of b. If S responds in the same 
way to different kinds of probabilistic 
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cues, we cannot separate the possibi- 
lities. 

In sequences taken from Markov 
generators with a fixed P(b|5), where 
P(b|b) differs from P(»), it is possible 
to separate the responses without 
memory from responses with mem- 
ory of the past event. However, for 
all runs of r b's, 


P(b|r b’s) = P(b|b) [2] 
hence it is not possible to tell whether 
Sis using run cues. Of course, S may 
respond differently to different length 
runs although they receive the same 
reinforcement, but the fact that such 
responses are not controlled. by E 
makes their interpretation hazy. 

A simple way to distinguish whether 
an S used run cues or merely responds 
to the previous event, is to use se- 
quences having extremely "'nonran- 
dom" distributions of run lengths. 
This paper reports the results of two 
such experiments. 


EXPERIMENT I 


In this experiment each S had a se- 
quence of events like bb yyyyy bbbbb 
yy bb yyyyy bb yyyyy---, in which the 
two events always occur either as a 
run of 2 or a run of 5, and no other 
run. The two events, b and y, were 
treated just alike. 

If S can master the sequence, then 
after one b (or y) he should always 
predict B (or Y), after bbb, or bbbb, 
he should always predict B, and after 
bbbbb he should predict Y. After bb 
S has a “critical trial” in which he has 
à probabilistic choice. The objective 
probability that the event will "stay," 
that is, P(b|bb) = P(y|yy), was 
varied in two groups. For one group, 
P(b|bb) — .500, and for the other 
group, P(b|bb) = .286. 

By Restle's theory, the first group 
with P(b|bb) = 3, has a probability 
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2/7 of selecting from memory a run 
of length 2, and a probability 5/7 of 
selecting a run of length 5. Hence, 
P(follow) — 5/7 — .714 is predicted. 
TThe second group has enough more 
runs of length 2 to make up for the 
greater size of runs of length 5, and 
the theory predictions P(follow) 
= .50. Notice that the predictions 
depend upon the ratio of 5/5 + 2. 

By the Greeno model, P(follow) is 
greater than 3, in the first condition, 
because of the fact that there are 
three opportunities to Cond. B and 
only one to Cond. Y. Overall, the 
probability should be given by the 
equilibrium of two operators. Con- 
sidering a linear model, whenever the 
run is of length 2 (and letting 
P[follow] = p), 


Q2(b) = p(1 — 0) 
and when the run is of length 5, 
Qs(6) = 1 — (1 — p)(1 — 0r 


These are both linear operators. If 
the two operators are applied with 
probabilities m and 1 — r, 


Q(») =p (1—0) 
+(1—n)[1—(1—p)(1-0)*] [3] 
the mean operator. It can be shown 
that in the case of linear operators, 
OP) =p(1—6) 
*ü-sp--(-e*] [4] 
a recursion on the means. At asymp- 
tote, Q(p) = p, whence 
p=1 
as TO 
1—r(1—6)— (1—7) (1—90): 


[5] 


This theory can predict values quite 
near those predicted by the Restle 
theory. For example, with r = .500, 
and 8 = .20, the Greeno theory pre- 
dicts = .710, very close to the 
Restle-theory prediction of .714. 


y 
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When Qs (a run of 2) is applied 
with probability t = .714, and 6 = 
.20, the predicted p = .490, very 
close to the prediction from the Restle 
theory of .500. 


Method 


Subjects.—Eighty students from introduc- 
tory psychology courses at Indiana Univer- 
sity served as part of course requirements. 
Most were run in platoons of four, all work- 
ing on the same problem at the same time. 
Assignment of platoons to the two conditions 
was in a prearranged erratic order. 

Apparalus.—Four Ss sat side by side at a 
long table, separated by plywood dividers. 
A gray metal box was before each, with a 
centered amber ready light, two green event 
lights, and two red response buttons. Events 
were presented and responses recorded auto- 
matically by a system of timers and relays 
and an IBM summary card punch. 

Procedure.—The Ss were told to predict 
which green event light would come on next, 
by pressing the corresponding button. Time 
to respond would be indicated by the amber 
ready light. The sequencing was automatic; 
1 sec. after the last of the four Ss responded, 
the event light for all would go on for 1 sec. 
One second later, the ready light would come 
on, permitting a new response. Pushing a 
button doused the ready light for that booth. 
The response interval was group paced, the 
interval being the longest of the four la- 
tencies plus 1 sec. This procedure permits Ss 
to work rapidly without time pressure, yet 
(because of obvious social pressures) en- 
courages relatively rapid, steady responding. 
All groups responded steadily with mean 
maximum latencies of about 1 sec. 

Experimental design.—Group I, of 40 Ss, 
had a sequence made up of 100 runs of 5 
and 100 runs of 2. The runs of 5 and 2 
occurred on left and right (Events b and y) 
equally often and were in "random order"— 
more specifically, arranged by hand in a 
sequence of 2s and 5s having statistical prop- 
erties of the expectation of a random se- 
quence. Each platoon made 700 predictions 
or as many as could be made in an experi- 
mental period of about 35 min. 

Group II, also of 40 Ss, had a sequence 
made up of 175 runs of 2 and 70 runs of 5, 
arranged in the same sort of “random order.” 


Results 


The proportion of predictions of 
the most recent event, P (follow) were 
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Fic. 1. Proportion of following responses 
on indeterminate trials (after bb or yy) in 
Exp. I. 


predicted by the Restle theory to be 
at .714 for Group I and .500 for 
Group II. The conditional event 
probabilities P(stay) (for an S who 
might match conditional probabilities) 
were .500 and .286, respectively. 

The learning curve of P(follow) for 
Group I, beginning after Trial 200, is 
very close to the prediction from 
Restle's theory. The curve of Group 
II begins above and ends below the 
prediction from Restle's theory, but 
does not, in the 600 trials available 
for all Ss, approach the probability- 
matching value of .286. These results 
are shown in Fig. 1. 

Learning on the other locations 
ie, P(B|b), P(B|bbb), P(B|bbbb) 
and P(B|bbbbb), was rapid but never 
stabilized at a high level. See Fig. 2. 

Notice also the consistent ordering; 
Ss were most successful in making 
the second response, and P(B|b) is 
greater than .90. The next most accu- 
rate discrimination was P(B |bbb), 
then P(B |bbbb), which took values of 
about .88 and .82, respectively. 
However, the shift after five-in-a-row 
was frequently missed; the probabil- 
ity of the correct response, y, after 
bbbbb, never became higher than 


386 FRANK RESTLE 


o 


9 i 
ML <5 2 
eee 
Th oo A 
E ^ 
s} P(Bibbbb) 
ap PlBlbbbbb) 
3 
2 
a 


o 10 200 300 400 500 600 
TRIALS (IN BLOCKS OF I00) 


PROPORTION OF PREDICTIONS OF MOST RECENT EVENT 


Fic. 2. Proportion of following responses 
on determinate trials in Exp. I. (Perfect 
discrimination would put the top three curves 
at 1.0, the bottom curve at 0.) 


about .70. Also notice that the most 
accurate performance occurred from 
Trials 200—500, and there is a general 
slight drop in accuracy during Trials 
300-600. 


Discussion 


Frequencies of choice on the critical 
trials, after bb, are remarkably in ac- 
cord with the Restle model, also with 
Greeno's hypothesis with 0 = 20.5 How- 
ever, the shaky performance by many 
Ss at other points in the sequence, at 
which the next event could be predicted 
with perfect accuracy, was disquieting. 
Both the Restle and Greeno models say 
that S is responding accurately to run 
lengths, and yet Ss reported were mak- 
ing errors, and particularly were show- 
ing a relatively high P(B|bbbbb), though 
the sixth event in a row they were pre- 
dicting never happened. The Ss might 
have been showing a tendency to per- 
severate responses. In the overall se- 
quence, the proportion of times b follows 
b is given by the following formula; 
letting 7 be the run lengths, and E(r) 
be the mean run length, 


Pls) =1 Me [6] 


With runs of length 2 and 5 equally 
likely, mean run length is 3.5, and 
1 


dm i-:.286 
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= 714. 


With the run of length 2 having prob- 
ability 5/7 as in Group II, E(r) = (5/7) 
X 2+ (2/7) X 5 = 20/7, whence 
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These values are not very far from the 
observed proportions after 600 trials of 
training. 

This calculation shows that Ss might 
respond to run lengths when run lengths 
are reliable cues, but when faced with 
a cue like bb, after which both events 
have occurred, might fall back on simple 
first-order conditional probabilities. 

Recall that Ss did far better than 
chance on trials after b, bbb, bbbb, and 
bbbbb, that is, on trials in which the next 
event could be predicted perfectly. This 
is evidence that they are doing some- 
thing at least as complicated as using 
run lengths as cues. However, per- 
formance on the indeterminate trials in 
Exp. I can be accounted for by three 
different theories. Therefore, a differ- 
ential test of the three hypotheses was 
sought. 


P(b|b) = = .350 


EXPERIMENT II 


This experiment used four se- 
quences, each treating the two events 
alike and each using exactly two run 
lengths. In this context, Exp. I 
used only the runs 2 and 5, and might 
be thought of as Group (2,5). The 
second experiment uses Groups (1,4); 
(2,3), (2,8), and (6,9). 

Notice that according to Restle’s 
theory, the probability P(follow) on 
a critical trial, in such a sequence, 
depends upon the ratio of the two 
run lengths, for it is the relative 
length of the two runs that deter- 
mines the probability of hitting a 
longer or shorter run upon the ran- 
dom entry into memory. 
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According to Greeno’s model, the 
probability of continuing at the cri- 
tical trial depends upon the difference 
between the two run lengths, for the 
conditioning operator is applied on 
all trials of a run after the critical 
trial. 

If P(follow) depends upon the 
overall probability P(b|b), which by 
Equation 6, depends upon E(r), a 
third theoretical possibility arises. 
In the experiments reported, the two 
run lengths each have probability 
a = 4, whence E(r) depends solely 
upon the swm of the two run lengths. 

'This gives us three theories, pre- 
dicting that the probability of follow- 
ing the most recent event on the criti- 
cal trial will depend upon the ratio, 
difference, or sum of the two run 
lengths. The pairs of numbers, (1,4), 
(2,3), (2,8), and (6,9) are usefully 
arranged to test the three hypotheses. 
If the Restle ratio theory holds, then 
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Fic. 3. Proportion of following responses 
on indeterminate trials in Exp. II. (Panel A 
shows that groups with the same run ratio 
do not yield the same response probability. 
Panel B shows that groups with the same 
run difference do not yield the same response 
probability. Panel C suggests that response 
probabilities fall between P(b |b) and .500.) 
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Fic. 4. Proportion of following responses, 
on all trials, as a function of length of runs, 
in Exp. II. 


Groups (1,4) and (2,8) will perform 
alike, as will (2,3) and (6,9). If the 
Greeno difference hypothesis holds, 
then (1,4) and (6,9) will be alike, 
(2,3) showing a lower probability of 
following, and (2,8) a higher. If, 
finally, the probability of continuing 
depends upon the overall P(b|D), 
hence on the sum of the two run 
lengths, then (1,4) and (2,3) will be 
alike, (2,8) will be higher, and (6,9) 
highest yet in P(follow). 


Method 

The four groups, (1,4), (2,3), (2,8), and 
(6,9), had a total of 22, 16, 19, and 17 Ss, 
respectively. The Ss were drawn as in Exp. I, 
but in a different semester and, in part, from 
an elementary laboratory course as well as 
introductory psychology. Apparatus and 
procedure were exactly as in Exp. I. 


Results 


Figure 3 shows that the probability 
of following on critical trials depends 
on the overall P(b|b), whereas the 
Restle ratio hypothesis and the 
Greeno difference hypotheses appear 
wrong. 

Figure 4 gives a picture of discrim- 
ination of run lengths. Note the 
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relatively poor accuracy on points 
that can be learned, and the high 
frequency of overrunning the longest 
run length. 

It might be thought that the run- 
length theories would fit the data, 
except that the data are contaminated 
with early trials and with the per- 
formance of Ss who for some reason 
do not discriminate run lengths. To 
test this possibility, although imper- 
fectly, parts of the data were selected. 
Each S’s data were divided into 100- 
trial blocks. Any 100-trial block in 
which S performed above 75% cor- 
rect on all points other than indeter- 
minate trials, was classified as reach- 
ing the criterion of run discrimination. 
This criterion is more severe for the 
(6,9) group than the (1,4) or (2,3) 
groups, because the (6,9) group must 
pass more points and has fewer, more 
widely spaced trials on them. How- 
ever, the method gives a reasonably 
homogeneous class of data consisting 
of very accurate performance pep- 
pered with what appear to be careless, 
occasional ^ errors. Approximately 
half the data passed the run-discrim- 
ination criterion. 

The proportion of predictions of 
the most recent event on critical 
trials, for selected data, lies closer to 
the objective value of 3 than do un- 
selected data. Otherwise, the pat- 
tern of results is the same for selected 
and unselected data; P(follow) is, 
for Group (2,3), .554; for (1,4), .548; 
for (2,8), .584; and for (6,9), .617. 
If plotted, these values give the same 
general picture as Fig. 3. 


Discussion 


The fact under investigation here is 
that when faced with a probabilistic 
choice, Ss tend to predict most recent 
event more than they should. That is, 
P(follow) is greater than P(stay). Al 
sequences of both experiments showed 
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this bias; in Exp. II, on critical trials 
the probability of following is above .50 
for all groups, though the objective 
probability that the same event would 
recur on the critical trial was .50. 

Three general explanations were under 
consideration.  Restle's hypothesis was 
that S searches his memory for past 
events, and long runs have higher prob- 
abilities of retrieval than shorter runs. 
This theory leads to the conclusion that 
in the 2-run experiments performed, 
P(follow) on critical trials should de- 
pend upon the ratio of the two run 
lengths. Figure 3A disposes of this 
claim, for sequences with the same ratio 
of run lengths, e.g., (2,3), and (6,9), 
also (1,4) and (2,8), lead to very differ- 
ent probabilities of following. 

Greeno's hypothesis was that on con- 
tinuations of the run after the critical 
trial S may condition responses B to 
shorter run lengths, and in particular 
to the critical run length. This hy- 
pothesis led to the conclusion that 
P(follow) would depend upon the num- 
ber of such trials, i.e., on the difference 
between the two run lengths. Figure 3B 
disposes of this claim, for the two se- 
quences having the same difference of 
run lengths, (1,4) and (6,9), are, respec- 
tively, the lowest and highest in the 
probability of staying. 

The third hypothesis is that on criti- 
cal trials, S may either discriminate the 
run length and match the probability of 
response to the conditional probability 
of the event, or (presumably if he has 
lost count) may match his response to 
the average first-order response proba- 
bility, having P(follow) — P(b|b). Fig- 
ure 3C shows that the data are in perfect 
accord with this hypothesis. 

Unfortunately, this hypothesis does 
not explain the fact we began with, 
namely, that when events are generated 
by a first-order Markov generator, S 
tends to follow events more often than 
he should, more precisely, P(B|b) is 
greater than P(b|b). A good first-order 
Markov generator has P(b|b) = P(b|bb) 
= P(b|bbb) —..., and all trials are 
critical. The hypothesis that P (follow) 
= P(b|b) predicts perfect, not biased, 
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matching of the first-order conditional 
probability. 

A possible explanation is suggested by 
the pattern of errors made by Ss on 
trials where perfect performance is pos- 
sible. At the end of his longest run S 
may err by anticipation (shifting re- 
sponse before the run is over) or by per- 
severation (staying with the same re- 
sponse after the run is over). Of these, 
perseveration is the more common error. 
In Exp. I, there were .21 errors of an- 
ticipation, and .42 errors of persevera- 
tion. In Exp. II there were .17 antici- 
pation to .26 perseveration in Group 
(1,4), .18—43 in Group (8,2), and .26- 
.41 in Group (6,9). This bias cannot be 
attributed to the influence of longer runs 
of events. Figure 2 suggests that the 
effect neither increases nor decreases 
much in the course of 600 trials, Errors 
become more frequent after 500 trials 
but the ratio of perseverative to antici- 
patory errors does not change markedly. 
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CONFLICT AS A FUNCTION OF FOOD-DEPRIVATION TIME 
DURING APPROACH TRAINING, AVOIDANCE TRAINING, 
AND CONFLICT TESTS * 


JUDSON S. BROWN; D. CHRIS ANDERSON,’ ann CONRAD S. BROWN * 
University of Oregon Medical School 


"The effects of severity of food deprivation on behavior in an approach- 
avoidance conflict situation were studied in 96 rats by a 2X2X3 
factorial design involving 1 and 44 hr. of deprivation during approach 
training and during shock-induced avoidance trials, and 1, 14, and 44 
hr. of privation during conflict tests. Degree of hunger during 
approach training affected performance at that time but not on conflict 
tests due to its interactions with avoidance-training deprivation and 
testing-time deprivation. These latter 2 variables emerged as 
significant sources of variance. Ss shocked when hungry were less 
deterred from approaching the bivalent goal than Ss shocked when 
not very hungry. Hunger at testing time greatly facilitated 
approach. The results were related to several theories dealing with 
motivational variables in conflict situations and the outlines of a 


graduated-drive theory were presented. 


One of the major problems facing 
the student of conflict behavior is that 
of how organisms’ reactions to an ambi- 
valent situation are affected by vari- 
ations in the level of one or another 
motivational variable. Relevant em- 
pirical data (Miller, 1959), though 
sparce, indicate that increases in 
hunger tend to drive an organism 
closer to a bivalent goal at which both 
food and shock have been administered 
and that increases in severity of pun- 
ishment tend to augment the repellant 
power of that goal. 

Attempts to explain behavioral ef- 
fects of this kind (e.g., Brown, 1942; 
Hull, 1952; Miller, 1944) have usu- 
ally involved the assumptions that 
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gradients in the strength of approach 
tendencies are established by food re- 
wards, that punishment sets up gradi- 
ents in the strength of avoidance 
tendencies—these being generally the 
steeper of the two kinds—and that 
behavior in space is determined by the 
joint influence of these incompatible 
tendencies, Changes in a motivational 
variable such as time of deprivation 
have been thought to exert a differ- 
ential effect upon the two tendencies, 
altering the positive but not the nega- 
tive. Thus an increase in hunger has 
been regarded as elevating the ap- 
proach gradient relative to the avoid- 
ance gradient, thereby shifting the in- 
tersection toward the goal and leading 
S to approach it more closely. It has 
also been assumed that variations in 
punishment affect the avoidance tend- 
ency without altering approach 
strength. 

This basic supposition, that a source 
of drive relevant to a particular re- 
active tendency affects that tendency 
alone, will be described hereafter as 
the selective-drive assumption. While 
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CONFLICT AND DEPRIVATION TIME 


this somewhat ad hoc notion fits the 
available data, the expectations it yields 
differ from those provided by Hull’s 
(1943) multiplicative drive (D X H) 
assumption and from the predictions 
of driveless theories (Brown, 1961; 
Estes, 1958) that rely upon associ- 
ative and generalized tendencies be- 
tween one or both responses and the 
internal stimuli produced by the op- 
erations of feeding and fasting. For 
instance, Hull’s view generates the 
conclusion that a deprivation-induced 
increase in general drive (D), subse- 
quent to the acquisition of approach 
and avoidance reaction tendencies, in- 
creases both tendencies equally through 
its nonselective multiplicative effect. 
Therefore, the locus of conflict, where 
the two gradients cross and are of 
equal strength, would not be shifted 
toward the goal, and closer approach to 
the goal would not be predicted. 

Unless bolstered by special-purpose 
postulates, the purely associative views 
also fail to provide predictions con- 
sonant with empirically based expecta- 
tions. The unembroidered principle of 
stimulus generalization, for example, 
leads to the deduction that, if two 
incompatible responses have become 
conditioned to moderately intense, dep- 
rivation-produced stimuli, either an 
increase or a decrease in deprivation 
will weaken both tendencies. This 
holds as long as the generalization 
gradients for the two tendencies are 
alike and are symmetrical about the 
conditioning point. To derive the 
phenomenon of closer goal approach 
with increased deprivation, and its con- 
verse, the gradient of generalized ap- 
proach must be different from the 
gradient of avoidance, and one, per- 
haps both, must have different forms 
depending on whether the reactive 
tendency is generalizing downward or 
upward along the deprivation-stimu- 
lus-intensity dimension. 
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Champion’s (1961) suggestion that 
Hull’s (1952) stimulus dynamism 
mechanism (7) might provide an ac- 
ceptable solution also encounters diffi- 
culties, since the properties of V are 
exceptionally vague (cf. Koch, 1954) 
and since it sometimes appears to 
contribute to general drive (D). As 
we have seen, D, regardless of its 
source, cannot account for the diferen- 
tial effects an empirical variable may 
exert on competitive response tend- 
encies. 

We have spoken above as though the 
empirical phenomena of conflict be- 
havior are well enough established to 
constrain new flights of theory and 
sieve the old. It is because this is in 
fact not the case, of course, that the 
present study was conducted. In it, 
two large groups of rats were trained 
to approach one end of a straight alley 
under 1 and 44 hr. of food deprivation, 
respectively. Half the members of each 
group were then given avoidance 
training under 1 and 44 hr. of depriva- 
tion, followed by conflict tests for sepa- 
rate subgroups under 1, 14, or 44 hr. 
of deprivation, 


METHOD 


Subjects —The Ss were 96 naive, male 
albino rats of the Sprague-Dawley strain 
purchased locally. They were 80-93 days 
old when training was initiated. 

Apparatus.—The major component of the 
apparatus, described in detail elsewhere 
(Brown & Hoopes, 1965), was a straight 
alley (72 in. long, 12 in. deep, and 3.5 in. 
wide, inside). It was painted a glossy 
white and was covered with hinged Plexi- 
glas panels. A 7.5-w., 117-v. lamp was 
mounted in a hole near the top of one end.of 
the alley so that only 3 of the bulb protruded 
into the interior of the runway. A glass 
furniture caster, which served as a food dish, 
was fastened to the center of the same end 
of the alley about 1 in. above the floor. The 
floor was made of s-in. stainless-steel rods 
mounted in a framework of aluminum and 
bakelite. "This framework, in turn, was sup- 
ported at its midpoint by a knife-edge ful- 
crum, being free to pivot relative to the 
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walls of the alley. Movements away from 
the center in either direction tilted the floor 
by an amount proportional to the rat's 
distance from the center. Displacements of 
the floor were transmitted to a strain gage 
and the resulting resistance changes, trans- 
formed to isomorphic voltage variations, 
were recorded by an ink-writing millivolt- 
meter. These devices provided a continuous 
indication of a rat's position in the 6-ft. 
alley which was accurate within +1 in. 
From the graphic records determinations 
could also be made of the point at which S 
first stopped approaching, of time consumed 
in reaching the food cup, and so forth. 

The runway was located in a windowless 
room that was dimly illuminated by a shaded 
7.5-w. work lamp on a nearby table. Under 
these conditions the lamp above the food dish 
produced a gradient of illumination ranging 
from approximately 14 ftc. at the bright 
end to .44 ftc. at the opposite end. 

The 60-cycle, ac shock administered during 
avoidance training was generated by a 
matched-impedance circuit (Campbell & 
Teghtsoonian, 1958). The open-circuit volt- 
age, measured with a high-impedance volt- 
meter, was maintained at 300 for all shocks. 
This voltage was applied to the grid through 
a series resistance of 150 K. On all trials 
shock duration was fixed at .25 sec. 

Experimental design—The design was a 
2X2 3 factorial, involving 1 and 44 hr. 
of food deprivation during approach training, 
1 and 44 hr. of deprivation during shock-in- 
duced avoidance trials, and 1, 14, and 44 hr. 
of deprivation during the major conflict tests. 
In addition, a second conflict test was con- 
ducted on one replication of S's after they had 
had free access to food and water for 10 
days following the major conflict test. 

Procedure,.—Upon receipt from the sup- 
plier Ss were transferred to group living 
cages and given food and water ad libitum 
for several days. They were then moved to 
individual cages and were placed on a 45-hr. 
deprivation schedule calling for 3 continuous 
hr. of feeding out of every 48. Despite the 
austerity of this regimen, Ss were maintained 
in satisfactory physical condition by enriching 
their drinking water with vitamins and by 
supplementing their diet with fresh cabbage 
leaves on alternate days. 

On the first day of training, Ss were 
habituated to the alley by being permitted to 
explore it in groups of four for 20 min. 
These habituation trials as well as all subse- 
quent approach-training trials were admin- 
istered either 1 or 44 hr. after the scheduled 
3-hr. feeding period, ie, at each group's 


J. S. BROWN, D. C. ANDERSON, AND C. S. BROWN 


nominal time of deprivation (Td). The 
data were gathered in two successively run, 
identical replications, each involving 48 Ss. 
Half the Ss of each replication were run on 
odd-numbered days and the remaining half 
on even-numbered days. At each scheduled 
running time 12 rats were run in a squad. 
All Ss were randomly assigned to experi- 
mental groups with the single restriction that 
each of the 12 groups be equally represented 
within each half replication. 

On the second day, a temporary barrier 
was installed in the alley 12 in. from the goal 
and each S was placed in the small com- 
partment thus formed and permitted to eat 
10, .045-gm. Noyes pellets. If S failed to 
consume these pellets within 5 min. it was 
removed and given a second trial 30 min. 
later. 

The third training day involved two ap- 
proach trials from a distance of 24 in., three 
from 36 in., three from 48 in., and two full- 
length trials from 72 in. On each of these 
10 trials, three Noyes pellets served as 
reward. The 12 Ss of a squad were run in 
sequence with an intertrial interval of about 
6 min. Ten full-length trials were run on 
each of the next 4 training days. By the 
end of the seventh day, therefore, all Ss 
had had 42 reinforced trials in traversing 
the entire length of the alley for food. On 
all full-length trials Ss were placed in the 
alley at the starting end facing away from 
the goal. 

The Ss were trained to avoid by placing 
them in the alley at the goal end, facing the 
empty food cup, where they received a .25- 
sec, 300-v. shock. Five such shock trials 
were given each S at an intertrial interval of 
about 5 min. For half the Ss shocks were 
administered under 1-hr. Td and for the 
other half under 44-hr. Td. Shock onset 
on each trial coincided with the closing of 
the alley lid immediately after $ was gently 
placed on the grid floor. 

On the major conflict test each S was put 
into the alley at the starting end, facing that 
end, and was permitted free access to the 
runway for a 5-min. period during which all 
movements were continuously recorded by 
the millivoltmeter. ^ During these tests, 
deprivation levels of 1, 14, and 44 hr. were 
used for the appropriate subgroups of 32 Ss 
each. For all Ss the tests were conducted 
at the same time of day at which the habitua- 
tion and approach-training trials had been 
given. In order to control conflict-testing 
time in this manner it was necessary to 
permit the interval between shock and the 
conflict tests to vary slightly. This interval 
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was 48 hr. for half the Ss (those trained 
to approach under 1 or 44 hr. and shocked 
under 44 hr.) and 43 hr. for the other half. 
It was assumed that with intervals of this 
magnitude a discrepancy of 5 hr. would not 
be significant. 
A second conflict test was given to half 
` the Ss, as an afterthought, 10 days after the 
first test. During the interval and at the 
time of the second test Ss were on an ad 
lib regimen. 


RESULTS 


Because of the complexity of the 
design, a method for readily identifying 
each of the 12 groups is required. 
Hereafter, each group will be desig- 
nated by a series of three numbers, 
the elements of which correspond to 
the deprivation times in effect during 
approach training, avoidance training, 
and conflict testing, respectively. Thus 
Group 1-44-14 denotes Ss trained to 
approach under l-hr. Td, shocked fol- 
lowing 44-hr. Td, and tested under 
14-hr. Td. 

Acquisition performance.—Clearcut 
differences in runway performance 
were observed during the initial ap- 
proach training under 1- and 44-hr. 
Td. The terminal mean running time 
of the 1-hr. Ss was 3.81 sec. whereas 
that of the 44-hr. Ss was 1.87 sec. The 
difference between these means, which 
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were based on scores over the last 5 
trials, was highly significant, £ (94) 
= 2.55, p < .02. 

First conflict-test performance —The 
graphic records of Ss’ behavior in the 
alley were read to determine (a) the 
distance from the starting point at 
which each S first stopped, (b) how 
far each S ran toward the ambivalent 
goal during the first 30 sec. of the 
conflict test, and (c) how much time 
elapsed until the empty food cup was 
reached. From conflict-theory prin- 
ciples (Brown, 1957; Miller, 1944) 
these behavioral indexes should reflect, 
more or less directly, the relative 
strengths of the approach and avoid- 
ance tendencies. 

The initial stopping point was de- 
fined as the point, measured from the 
start, at which forward motion ceased 
for a full 5 sec., or as the point at which 
S stopped for less than 5 sec., pro- 
vided the stop was folowed by a 
backward movement of at least 5 in. 
If S did not leave the starting position 
within 20 sec. then that point was 
taken as the first stopping point. Stop- 
ping points, averaged and combined 
across groups in several ways, are 
shown in the three panels of Fig. 1. 
Panel A presents mean stopping-point 
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Fic. 1. Average distance traversed from the starting end of the alley until the first 


definite stop was made. 


panels to exhibit the several interactions and main effects. 
not be interpreted as expressing the assumption that the 


join the data points should 
functions are linear.) 


(The same data are grouped in different ways in the three 


The straight lines used to 
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Scores as a function of approach-train- 
ing Td, with avoidance-training Td as 
the parameter. Three major conclu- 
sions are represented in this panel, each 
of which is supported by an overall 
analysis of variance. ^ Specifically, 
Td during avoidance-training trials 
emerged as a significant main effect, 
F (1, 84) = 9.93, p < .01, and its in- 
teraction with approach-training Td 
was also significant, F (1, 84) — 3.98, 
$«.05. However, training Td, as 
a main effect, was not (F< 1.0). 
The directions of the differences are 
evident in each case from the figure. 
The same data regrouped in Panel B 
Show that Td during tests interacted 
with approach Td, F (2, 84) = 3.37, 
P «.05, and that Ss approached 
closer to the goal as testing Td in- 
creased, F (1, 84) — 963, p< 01. 
Panel C shows that testing Td did not 
interact with avoidance-training Td 
(F < 1.0) since the three curves are 
essentially parallel. The significant 
main effects of avoidance-training Td 
and testing-time Td are also evident in 
this panel. 

The maximum - distance - forward 
scores paralleled the stopping-point data 
in nearly all respects. Avoidance Td 
was highly significant as a contributor 
to variance, F (1, 84) = 19.1, p 
<.005, as were testing Td, F (2, 84) 
= 10.53, p< .005, and the interac- 
tions of training Td with avoidance Td, 
F (1, 84) —498, »<.05, and of 
training Td with testing Td, F (2, 84) 
= 3.56, p< .05. Training Td was 
not significant (F < 1.0). 

The outcomes of the time-to-food- 
cup determinations were in most re- 
Spects identical with the findings ob- 
tained from the first two measures. 
Specifically, all of the effects, save one, 
that were significant in the case of the 
maximum-distance-forward measure 
were also significant in the time-to- 
food-cup analysis, The one exception 


was the interaction of training Td with 
testing Td which fell far short of cus- 
tomarily accepted levels. Graphic plots 
for the second and third measures 
closely resembled those of Fig. 1. 
Second conflict-test performance.— 
Ten days after the first conflict tests, 
Ss of the second replication, having 
been maintained on the ad lib diet dur- 
ing the interim, were given a second 
5-min. conflict test in the alley. This 
test was conducted after Ss of the 
first replication had been discarded, 
and was designed to determine whether 
treatment conditions effective at the 
time of the original training, or testing, 
or both, would be reflected in discern- 
ible residua even when all 5s were op- 
erating under semisatiated conditions. 
Measurements of maximum distance 
forward during the first 30 sec. of this 
test revealed that the original approach- 
training Td, which had not exerted a 
significant effect in the case of any 
measure during the first conflict test, 
was now highly significant as a source 
of variance, F (1, 36) = 8.79, p 
<.01. In this test, Ss formerly given 
approach training under l-hr. Td 
moved an average of 34.9 in. toward 
the goal, whereas Ss trained after 44 
hr. without food moved only 18.1 in. 
forward. The comparable differences 
obtained on the first test were 42.4 
and 397 in., respectively. Avoidance- 
training Td was also found to be sig- 
nificant, F (1, 36) — 5.36, p< .05, 
with Ss formerly shocked under long- 
term Td going much farther forward 
(mean = 33.0 in.) than Ss shocked un- 
der I-hr. Td (mean = 20.0 in). Pre- 
vious testing time seemed to affect per- 
formance since Ss tested under 44-hr. 
Td went less far forward on the second 
test than Ss given the first test under 
lhr. This effect, though substantial 
(the means for the 1- and 44-hr. groups 
were 344 and 179 in., respectively), 
fell short of significance when the data 
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for the entire 30-sec. period were an- 
alyzed. When only the first 15-sec. 
period was considered, the difference 
became significant, F (2, 36) = 3.28, 
p < .05, for all practical purposes. 
No other effects approached acceptable 
levels of significance. 

The time-to-food-cup data yielded 
comparable results. Those Ss pre- 
viously trained to approach under 1-hr. 
Td reached the food cup sooner than 
44-hr. trained Ss (not significantly so), 
and Ss shocked under 1-hr. Td took 
200.4 sec. to get to the cup while Ss 
punished when 44 hr. deprived took 
136.4 sec. to reach the cup, F (1, 36) 
= 4.71, p< .05. The stopping-point 
data conformed to these outcomes in 
that the directions of the differences 
were the same though none of the 
sources of variance reached conven- 
tional levels of significance. 


Discussion 


While deprivation level during ap- 
proach training significantly affected per- 
formance at that time, with the 44-hr. 
group running faster than l-hr. Ss, its 
effects upon conflict performance proved 
to be specific to the Td in effect during 
avoidance training and conflict testing. 
That is, the main effect of approach Td 
in the overall analysis was not significant 
but the interactions of that variable with 
Td during shock administration and with 
Td during testing were significant. This 
means that the strength of the approach 
tendency was dependent, in part, upon 
the specific characteristics of the depri- 
vation-induced stimuli present during 
approach training. Thus Ss given both 
approach training and testing under iden- 
tical deprivation times exhibited stronger 
approach tendencies than those for whom 
disparate values were used. Similarly, 
the approach behavior of rats that were 
shocked and given approach training at 
the same Td tended to be weaker than 
that of Ss for whom the two deprivation 
values were different. The importance 
of deprivation-stimulus integrity is also 


395 


indicated by data obtained during the 
second test. On this test, Ss formerly 
trained under 1 hr. went significantly 
farther forward than the 44-hr. trained 
Ss. Ostensibly, the internal stimuli due 
to the ad lib conditions at the time of the 
second test were very like those present 
when the l-hr. Ss were given approach 
training. 

In general, therefore, these findings 
add credibility to the supposition that the 
internal stimulus accompaniments of the 
deprivation-time variable do indeed affect 
instrumental behavior. Nevertheless, the 
similarity of the internal cues on different 
occasions is probably a factor of only 
modest importance, since the interaction 
of approach Td with test Td was not 
significant in the case of the time-to-cup 
score and since none of the various inter- 
acting functions involving 1- and 44-hr. 
Td actually crossed. To be specific, the 
ordinal positions of the right-hand points 
of Panel A in Fig. 1 would have been 
inverted had the similarity of training- 
and shock-Td stimuli been of outstanding 
significance. A high degree of trans- 
situational deprivation-stimulus similarity 
would also have produced an inversion 
in the order of the three points at the 
left of Panel B, and comparable effects 
would have been apparent in the analo- 
gous functions derived from other meas- 
ures, Evidently, other factors, notably 
Td at test time, were capable of over- 
riding the similarity effects. 

One of the most consistent findings of 
the study was that rats shocked under 
l-hr. Td were less prone to approach the 
bivalent goal than those shocked when 
44 hr. deprived. Variations in punish- 
ment can hardly provide an explanation 
of this outcome since all $s were given 
the same number of shocks of equal 
intensity. Neither does it make much 
sense to suggest that the more recently 
fed Ss were more sensitive to shock than 
the hungrier Ss. It may be reasonable 
to suppose, however, that the 44-hr. Ss 
had a stronger tendency to seek food at 
the goal and hence did not run as far 
therefrom when shocked. If so, they 
might have acquired weaker avoidance 
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Fic. 2. A purely associative conflict model 
avoidance gradients after training under low-level deprivation. 
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representing hypothetical approach and 
(The conditioned reactions 


are shown as generalizing along both the internal deprivation-stimulus dimension—from left 
to right—and along the spatial stimulus dimension provided by the straight alley—from back 
to front. The two gradients originating at the testing point represent tendencies assumed 


to be present if a conflict test were carried out under moderately severe deprivation. 


This 


model fails to predict that the spatial location of the intersection of the approach and 
avoidance gradients, which determines how closely S will approach the bivalent goal, will 
be shifted toward the goal by increased deprivation following training at a lower level 


of hunger.) 


responses than Ss that were shocked 
when 1 hr. hungry. In any event, our 
data suggest that a weaker avoidance 
tendency results from a punishing shock 
if it is administered when a strong com- 
peting approach tendency is present than 
when it is not, 

As has been observed, our data bear on 
the relative utility of several theories 
dealing with the effects of motivational 
variables upon the comparative strengths 
of incompatible tendencies to action. One 
such theory, purely associative in char- 
acter, would hold that no drivelike con- 
Struct is necessary and that all the 


consequences of changes in the “motiva- 
tional” variable of deprivation could 
properly be attributed to altered internal 
stimuli, In its simplest form (cf. Brown, 
1961) such a theory would rely on the 
Principle of stimulus generalization to 
determine the manner in which changes 
in deprivation time, and hence in internal 
stimuli, affect the associative strengths 
of approach and avoidance reactions. The 
essential elements of one version of such 
a theory are represented in Fig. 2 where 
the spatial dimension of the conflict alley 
is shown extending from front to back 
and the deprivation-stimulus dimension 
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from left to right. Consistent with cus- 
tom, the negative gradients that extend 
along the spatial stimulus dimension have 
been plotted as steeper than the positive 
ones. But the positive and negative gra- 
dients along the deprivation-stimulus con- 
tinuum have been derived from a single 
equation. In every case the equations 
are instances of Hull’s (1952) exponen- 
tial generalization function (gH = sHp 
X 10-44) where gH, is the strength of 
the habit at the conditioned stimulus, 
sHg is generalized habit strength, d is 
the distance in jnd units between the 
conditioned and generalized stimuli, and 
j is an empirical constant. The location 
of the positive and negative gradients at 
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the left of the figure indicates that the 
conflicting reactions were both assumed 
to have been acquired under conditions 
of short-term deprivation. The second 
set of gradients at the testing point de- 
picts the expected consequences of an 
upward shift in deprivation duration at 
the time of the conflict test. Obviously, 
two intersecting three-dimensional sur- 
faces would be required to represent all 
possible values of the conditioned and 
generalized tendencies. 

Figure 2 indicates that two orthogonal 
stimulus dimensions are needed for a 
conflict-behavior model in the situations 
under consideration. More importantly, 
the figure shows that given the basic as- 
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Fic. 3. This alternative form of an associative theory is like that of Fig. 2 save that 
the approach tendency generalizing from left to right across the back surface of the diagram 
(the thin solid line with open circles) has been developed from a different equation so as 


to make it less steep than its companion negative gradient. 


(This produces a goalward 


shift of the intersection of the gradients at the testing point relative to its position at the 
training point. Substituting the dashed-line gradient joined by squares for the gradient 
immediately above it yields a third model in which the difference in steepness is the result 
of the introduction of a drive factor that multiplies approach but not avoidance.) 
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sociative assumptions, the position of the 
intersection of the gradients along the 
spatial stimulus dimension is not affected 
by the upward shift in deprivation sever- 
ity. The same conclusion would be 
reached were one to suppose that train- 
ing had been conducted at the strong end 
of the deprivation stimulus continuum 
with subsequent tests carried out at lower 
values along that dimension. Thus if it 
is assumed that the avoidance gradient is 
steeper than the positive along only one 
of the two stimulus dimensions, i.e., the 
spatial one, the resulting associative the- 
ory fails to predict, as the data of this 
study require, either closer approach with 
increased Td or lesser approach with 
decreased Td. 

Figure 3 is a modified version of Fig. 
2 in which the avoidance gradient is 
shown as steeper than the positive (i.e., 
the value of the constant j is larger) 
along the spatial dimension, and steeper 
also along the deprivation-stimulus con- 
tinuum. (The dashed-line gradient is 
irrelevant at this juncture.) This sec- 
ond associative model seems to be a step 
in the right direction since it correctly 
predicts that an increase in deprivation 
will shift the spatial location of the inter- 
section of the gradients toward the goal. 
Unfortunately, it also generates the pre- 
diction that a decrease in Td from the 
training level will have precisely the same 
effect. The sets of gradients underlying 
this incorrect prediction would look es- 
sentially like a mirror image of Fig. 3. 

Evidently, a different conceptual model 
is required having the following two 
major properties: (a) that the difference 
between positive and negative gradients 
must diminish when the gradients extend 
from a given training point toward 
stronger values of thé deprivation-stimu- 
lus dimension, and (b) that when the 
gradients extend toward the weaker val- 
ues of the deprivation dimension the dif- 
ference between them must increase. 
Given the requirement of a negative gra- 
dient that is steeper than the positive 
along the spatial dimension, the two fore- 
going prescriptions must be fulfilled if 
closer approaches to the bivalent goal 
with increased deprivation, and less close 


J. S. BROWN, D. C. ANDERSON, AND C. S. BROWN 


approaches with decreased deprivation, 
are to be deduced from the model. These 
conditions could be met by assuming (a) 
that the gradients of generalized habit 
strength for approach and for avoidance 
are different, and (b) that the shapes of 
the gradients also differ as a function of 
whether they extend upward or down- 
ward along the deprivation-stimulus di- 
mension from the training point. Alter- 
natively, one can assume that the con- 
stants of the habit-strength generalization 
equations are all identical (along the 
deprivation dimension) and that the ap- 
proach, but not the avoidance, gradient 
is multiplied by values of drive which 
increase progressively as deprivation be- 
comes more severe. This is, of course, 
the selective-drive hypothesis. The ap- 
plication of this conception is represented 
in Fig. 3 provided the dashed-line ap- 
proach gradient with square points is 
regarded as replacing the thin-line gra- 
dient immediately above it. The new 
gradient of excitatory potential, which 
in this illustrative example happens to 
differ little from the gradient it replaces, 
was obtained by first computing the re- 
quired habit-strengths from the equation 
2X 10-4, (This equation, it will be 
seen, has the same constant as does the 
equation for the avoidance gradient above 
it.) The values thus obtained were then 
multiplied by arbitrarily chosen levels of 
D ranging from 1.0 at the left to 2.0 at 
the right in a linear progression. This 
variable multiplier is designated by the 
symbol xAD appended to the equation 
in the figure. These assumptions thus 
lead to the eonclusion that increased time 
of deprivation will bring about closer 
approach to the bivalent goal, even 
though the generalized habit-strength 
gradients of approach and avoidance are, 
in essence, identical. 

In Fig. 4 the selective-drive hypothesis 
has been extended to the case involving 
training under high deprivation and tests 
under reduced deprivation. Here the 
points computed from the equation 1 X 
10-14 have been multiplied by a linearly 
graded set of D values ranging from 3.0 
at the right to 1.0 at the left, Also, the 
Strength of the avoidance has been re- 
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duced slightly to coincide with our find- 
ing that the introduction of a punishing 
stimulus when a strong competitive ap- 
proach tendency is presumed to be pres- 
ent produces a weaker avoidance than 
when the approach is weak. This latter 
adjustment, incidentally, finds additional 
support in Hoffman’s (1965) conclusion 
that inhibition generalizes in inverse re- 
lation to excitation. The two gradients 
produced by these manipulations, as Fig. 
4 shows, are more widely separated at 
the weak end of the deprivation-stimulus 
dimension than at the strong end. From 
this characteristic and from the assump- 
tion of the greater steepness of the avoid- 
ance gradient along the spatial dimension, 
less close approach to the bivalent goal 
under reduced deprivation can be derived. 
This model also yields the required out- 
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come that Ss trained, shocked, and tested 
under severe deprivation should approach 
closer than Ss for whom the same three 
treatments occurred under mild depriva- 
tion. 

While the selective-drive notion gener- 
ates the sort of asymmetrical generalized 
excitatory tendencies our data require, 
comparable asymmetries could be pro- 
duced by other factors. For one, the 
continuum of deprivation-stimulus inten- 
sity is quite unlike external stimulus di- 
mensions in that no value on the internal 
dimension, save for the weakest, can be 
experienced without all other weaker 
values in the same series being encoun- 
tered first in serial order. This means 
that if Ss have been maintained on a 
regular feeding schedule, they will have 
been repeatedly exposed to a series of 
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Fic. 4. The selective-drive conception which produced the third model (Fig. 3) has 
been extended here to the case in which training is carried out under long-term deprivation 


and tests under less severe privation. 


(The gradients along the back surface have been 


constructed so as to diverge from right to left, making possible the deduction of a backward 
shift of the intersection and hence less close approach to the goal, with decreased 


privation.) 
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internal cues of rising intensity. Rein- 
forcement will have occurred in the pres- 
ence of the most intense cues, but rarely 
in close temporal proximity to weak 
deprivation stimuli. Thus any general- 
ized reaction tendencies evoked by weak 
stimuli would tend to become extin- 
guished. A regularized feeding schedule, 
therefore, must inevitably function as 
a differential reinforcement procedure 
which tends to steepen the gradients 
extending from strong to weak stimuli. 

To facilitate the exposition of the fore- 
going interpretation it has been assumed 
that changes in hunger-produced drive 
do not change the strength of the avoid- 
ance response. This selective-drive no- 
tion is probably an oversimplification, 
however, since hunger is known to affect 
the acquisition of fear-motivated be- 
havior, rate of cortical self-stimulation, 
and other nonhunger related reactions 
(Brown, 1961). Moreover, Hull's multi- 
plicative-drive conception which holds 
that each and every habit strength is 
multiplied by the same numerical value 
of D, regardless of the source of drive, 
is doubtless also only a rough approxi- 
mation, Perhaps it is time to reject both 
these views and propose a third, namely, 
a graduated-drive theory. This concep- 
tion embodies the assumption that all 
habits are multiplied by the drive arising 
from any source, but the numerical value 
of the multiplier is graduated according 
to the relevance of the response to the 
motivational variable. Specifically, a 
habit is described as highly relevant to 
a particular source of drive if the re- 
sponse has led to the diminution or 
elimination of the drive source. The 
greater the relevance of response to drive 
source, the larger the multiplier provided 
by a given motivational variable. Thus, 
as in our experiment, responses of ap- 
proaching a given region for food that 
have been reinforced should be intensified 
substantially (multiplied by large values 
of D) by increased food deprivation. 
But reactions of avoiding that region, 
even though evoked in the presence of 
the same internal deprivation stimuli and 
external cues, should only be slightly 
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augmented by increased Td. Conversely, 

avoidance responses that have effectively 

eliminated or reduced shock should be 
enhanced by drive increments attributable 
to noxious stimulation, whereas food- 
seeking responses should be modified but 
little. 
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INFERENCES ON THE BASIS OF CONDITIONALLY 


NONINDEPENDENT DATA * 
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The purpose of this experiment was to evaluate the ability of Ss to 
exploit conditional data nonindependence in making inferences in the 
form of subjective posterior probability estimates. Ss’ posterior 
probability estimates were compared with those prescribed by Bayes’ 
theorem. 2 of 3 S groups processed data exhibiting conditional 
nonindependencies. These nonindependencies, if recognized and ex- 
ploited, had significant inferential value, Ss’ posterior probability 
estimates on the basis of the nonindependent data were in close 
agreement with those prescribed by Bayes’ theorem. The conserya- 
tism frequently observed when Ss estimate posterior probabilities was 


absent in the present experiment. 


A substantial amount of research on 
human inferential skills has been gen- 
erated as a result of renewed interest 
in subjective or personal probabilities 
and in Bayes’ theorem as a normative 
model for inductive inference. Several 
experiments have indicated that Ss are 
conservative in revising subjective 
probabilities because they fail to ex- 
tract from data as much consistency 
as is theoretically indicated by Bayes’ 


1 The research reported in this paper was 
carried out in the Laboratory of Aviation 
Psychology and was sponsored by the Aero- 
space Medical Research Laboratories, Aero- 
space Medical Division, Air Force Systems 
Command, Wright-Patterson Air Force 
Base, Ohio, under Contract No. AF 33(657) - 
10763 with the Ohio State University Re- 
search Foundation. Further reproduction is 
authorized to satisfy needs of the United 
States Government. The author is pleased 
to acknowledge the advice and assistance 
rendered by Ward Edwards of the Univer- 
sity of Michigan. In addition, the author is 
indebted to Jack F. Southard and Edwin R. 
Lassettre for their advice and critical com- 
ments and to Joseph Emanuel who assisted 
in the collection of data. Jack F. Southard 
developed the computer programs necessary 
for the large amount of computation per- 
formed during the experiment. Lonnie D. 
Whitehead developed computer programs for 
certain portions of the experimental data 
analysis. 


theorem (Edwards & Phillips, 1964; 
Peterson, Schneider, & Miller, 1965). 
In another series of experiments Ss 
were less conservative and frequently 
revised their probabilistic opinions 
about the truth of hypotheses more 
drastically than the data justified 
(Schum, Goldstein, & Southard, 1966). 

One issue not investigated in previ- 
ous studies but of considerable im- 
portance in an evaluation of human 
inferential skills concerns interdepend- 
encies among data upon which infer- 
ences are to be based. Part of the 
task of weighing evidence for infer- 
ential or diagnostic purposes includes 
appraisal of the joint impact of various 
items of evidence being considered. 
Frequently, knowledge of the form, 
extent, and probable cause of inter- 
relationships among two or more items 
of evidence makes the joint impact ol 
the evidence different from the total 
impact resulting from successive but 
independent consideration of each 
item. In some cases the dependence, 
or more appropriately nonindepend- 
ence, of two or more items of data is 
conditional upon the truth of some hy- 
pothesis (H;) being considered. In 
other words, the factor or process 
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mediating the nonindependence is the 
truth of one of the hypotheses being 
entertained to explain the occurrence 
of the data. Formally, two items of 
data (Dı, Dz) are stochastically inde- 
pendent in the light of H; if 


P(D3|H;,D;) = P(D2|Hi) [1] 


This expression describes a condition 
under which the probability of observ- 
ing D» given H, is the same whether 
or not D, has been observed. Equa- 
tion 1 implies that 


P(Dy,Da|H;) 


= P(D,|H;) P(D;|H;) [2] 


Data D; and Ds are nonindependent in 
the light of H, if the equalities ex- 
pressed in Equations 1 and 2 are vio- 
lated. Considering two hypotheses (Hg 
and Hy), suppose that P(Ds|H,,D) 
> P(D:|Ha) but P(Da3|HyDi) = 
P(Ds|H;). The observation of D; can 
be said to have affected the impact of 
Dz when H, is assumed. In other 
words there is sóme interdependence 
between D; and Ds conditional upon 
or mediated by the truth of H,. Fre- 
quently, certain data or classes of data 
exhibit some recognizable interrela- 
tionships when a certain H, is true. 
Knowledge of the form and extent of 
these conditional nonindependencies 
can often be of significant value in 
revising one's opinion about the truth 
of some hypothesis. 

Following is an example of noninde- 
pendence mediated by or conditional 
upon a certain hypothesis. Even in 
what one refers to these days as 
"peace" there are occasional acts of 
sabotage and attempted or actual as- 
sassinations of key personnel. If there 
is peace, one more often expects these 
events at unrelated times. However, 
if some potential adversary is planning 
to wage war in the very near future, 
the simultaneous occurrence of these 
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events might be rendered more highly 
probable. Suppose that one receives 
the following data: D, is the report of 
suspected acts of sabotage at a certain 
military installation and D» is the re- 
ported assassination of a member of 
the Joint Chiefs of Staff. For the sake 
of simplicity consider only two hy- 
potheses, war (Hw) and peace (Hp). 
First, one might judge that P(Di|Hw) 
is somewhat greater than P(Di|Hj) 
and, similarly, that P(Ds|H,) is some- 
what greater than P(D.|H,). These 
statements imply that P(D,|Hw) - 
P(D2|Hw) > P(Di|Hy) | PCDa|H;). 
Now, if one suspects that Dı and Ds © 
are nonindependent in the event of Hy, ^ 
then P(Ds|Hw, Dı) may be greater 
than P(Ds|H,). This implies that 
P(D1,D2|Hw) > P(Di\Hw) — P(Da| | 
He). In other words, the diagnostic l 
value of Dz has been altered by knowl- 
edge of Dı when H, is assumed. One 
may thus be led more quickly towards 
acceptance of Hw by suspicion of inter- 
dependence between D, and D» because 
it follows that: 


P(D,,D2|Hw) > P(Di|Hy) 
P(D2|Hw) > P(Ds|Hy) P(Ds|Hy) - 


The purpose of the present experi- ^ 
ment was to evaluate the extent to 
which Ss can exploit simple conditional — 
data nonindependencies in making in- - 
ferences from data. These inferences - 
were in the form of subjective pos- - 
terior probability estimates [P (H |D) ]; 
ie, the probability that some hy- 
pothesis had generated the sequence of — 
data being observed. The independent 
variable in the experiment referred to : 
the conditional nonindependence be- - 
tween certain items of data Ss were 
required to evaluate. For two groups - 
of Ss there were recognizable relation- 
ships between certain items of data 
when one or the other of two hy- 
potheses being considered was true. - 


INFERENCE BASED ON CONDITIONAL NONINDEPENDENCY 


These relationships, if recognized and 
exploited, had definite inferential or 
diagnostic value. Data presented to a 
third group (control) exhibited no con- 
ditional nonindependencies. Estimates 
of P(H|D) produced by Ss in each 
group were compared with three types 
of calculations of P(H|D) from Bayes’ 
theorem. Each type of calculation was 
based upon different assumptions about 
the data-generation process. 


METHOD * 


The Ss performed these probabilistic in- 
ference tasks in a simulated threat-diagnosis 
context. During the experiment 240 samples 
of data were presented to each S in the three 
groups. These data samples, termed 
“scenarios,” were fictitious intelligence re- 
ports describing deployments of the military 
forces of a hypothetical adversary. There 
were six classes of data which described the 
characteristics of these deployments. Each 
of these data classes was divided into either 
three or four subclasses. For example, a 
certain data class described the number of 
“heavy tank units” represented in a deploy- 
ment. In this particular data class there 
were four subclasses, each of which repre- 
sented a certain amount of these units. 
There could, in fact, be either one, two, 
three, or four heavy tank units in each de- 
ployment. Each scenario or data sample 
presented to Ss consisted of six data items. 
Each item was a subclass selected by E, 
one each from each of the six data classes 
(E's selection procedure is described below). 
The task for each S was to estimate, for 
each scenario, the probability that one or the 
other of two hypotheses had generated the 
data in that scenario. The two hypotheses 
referred to whether a deployment (as de- 
picted by a certain scenario) represented a 
forthcoming attack by the adversary (Ha) 
or merely represented a rehearsal or 
maneuver (H»). All Ss were told that, 
prior to the evaluation of the data in each 
scenario, they could consider either hypothe- 


2A more detailed report of the present 
experiment including additional details of the 
data generation procedure, a more specific 
description of the process of calculating 
P(H|D), and additional experimental data 
analyses are available as a Laboratory of 
Aviation Psychology (Ohio State Univer- 
sity) research report (mimeo). 
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TABLE 1 


Basic DATUM-HYPOTHESIS RELATIONSHIPS 
[P(Djx| H2] 


Hypothesis 
Data Class Subclass 

Ha He 

Di 1 10 10 
2 10 .20 

3 .20 .30 

4 .60 40 

Da 1 40 10 
2 10 .20 

3 .20 .30 

4 .60 40 

D; 1 :28 AS 
2 .30 .30 

3 45 25 

Ds 1 25 ES 
2 30 .30 

3 45 25 

Ds 1 10 .20 
2 10 .20 

3 30 30 

4 50 .30 

Ds 1 .20 .30 
2 .30 40 

3 .50 30 


sis as being equally likely. (During the 
experiment each hypothesis was in fact true 
an equal number of times.) These subjec- 
tive estimates of P(H|D) thus represented 
revision of opinion about the likelihood that 
either hypothesis could explain the occur- 
rence of the observed sample of data. 
Data-generation models.—The independent 
variable, conditional data nonindependence, 
was manipulated through the design of the 
data-generation models underlying the con- 
struction of scenarios presented to each 
group. A scenario to be presented to a 
particular group was generated by E by 
selecting one subclass from each of the six 
data classes according to the data-generation 
model appropriate to that group. 'The basic 
model, shown in Table 1, was appropriate 
to the scenarios in all three groups. It 
illustrates, for each data class considered 
separately, the probability of occurrence of 
each individual subclass when either of the 
hypotheses was true. These probabilities 
are of the form P(Dy|H:) where j is a 
particular data class and k is a particular 
subclass of Ds. However, in order to 
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present scenarios in which certain condi- 
tional data interdependencies existed, it was 
necessary to construct matrices of joint con- 
ditional probabilities whose internal entries 
would reflect the desired interdependencies. 
The following rule prescribed the design of 
the data-generation models for Groups 1 and 
2 (whose scenarios were to depict data inter- 
dependence): when Ha was true, Data 
Classes Ds and D. would exhibit interde- 
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pendence, and when H» was true, Data 
Classes Dı and Ds would exhibit interde- 
pendence. Thus, interdependence between 
pairs of data classes constituted the highest 
order of conditional relationship instituted in 
the models. The data-generation model for 
Group 3 was designed to reflect stochastic 
independence (see Equation 2) between the 
data classes in each of these two pairs re- 
gardless of the hypothesis being considered. 


TABLE 2 

Dara-GENERATION MODEL ror Data CLASSES D; AND D» 
Ha True (120) H» True E 4 : 

TES ho T E 

1 2 3 4 1 2 3 4 

E E E E D ap | G2 
JU ERR ae P28 ARE a | GP 
[ote | 0 | 821 GP | GO| old? | a6 | G9 | of? | G9 
1 [98 [4e | G2 | GP | GO oP of oS? | GP | d» 
ao | GO) | G0 | o EO 02 | R^ | Go] go | cam 
"ito | BY 8 | DP ce oo | a | ae 
"pe oP |] e SP | oP] of) GP] GP 
"18 | 9 4 d» GPs ug oP | 22 | do 
"e e e Gu cogo | ap 
40 [do |o | do |o | ^| 20] Go| go | e 
EE e g] Bele) lel ol a 
"os ELR] D Pu» uo go] a» 
"LO POP | uo| eo wo] uo] ao 
"I| | GP] | GO 9| go] co A | Go 
a0 | 0 | GP | GD | GBP) 42) go) Go| a | gap 
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Table 2 illustrates the generation models 
for the three groups with respect to Data 
Classes D; and Ds. 

The probability values in the cells of each 
matrix in Table 2 refer to the probability of 
joint occurrence of subclasses from Dı and 
Ds given Ha or H». These values are in the 
form P(Dge,Dny|H:), where g and h are 
data classes and x and y are subclasses of 
D, and Da. In the Group 1 model, for 
example, P(Dius,Da|H»)-—.29. The mar- 
ginal probability distributions for each data 
class under Ha or H» are the same as those 
listed in Table 1 for these classes. Further, 
given Ha or Ho, the marginal probability dis- 
tributions for each data class are the same 
for all three groups. The numbers in 
parentheses in the cells of each matrix refer 
to the frequency, during the experiment, of 
joint occurrence of particular subclasses from 
D; and Ds. In the Group 1 model, for 
example, on the 120 occasions on which H» 
was true, Dıs and Ds occurred jointly on 33 
occasions. Given either hypothesis, the 
marginal frequency distributions for each 
data class are the same for all groups. 

The differences between groups, in terms 
of the experimental variable, can now be 
specified in detail. Table 2 illustrates the 
powerful diagonal relationship between Dı 
and Ds for Group 1 when H» was true. A 
similar diagonal relationship between Ds and 
D, (not shown) for Group 1 existed when 
Ha was true. Table 2 also illustrates the 
off-diagonal relationship between Dı and Ds 
for Group 2 when H» was true. Similarly, 
Ds and D, exhibited an off-diagonal rela- 
tionship when Ha was true. For Groups 1 
and 2 Data Classes Dı and Ds were stochas- 
tically independent when Ha was true and Ds 
and D, were stochastically independent when 
Hy was true. For Group 3 all matrices in- 
volving the pairs, D;,Ds and D3,D, exhibited 
stochastic independence regardless of the 
truth of Ha or H». 

Strong diagonal relationships thus char- 
acterized the conditional nonindependence 
for Group 1. A less obvious and some- 
what weaker off-diagonal relationship char- 
acterized the conditional nonindependence for 
Group 2. It is important to recognize that 
if each of the data classes were considered 
independently, the diagnostic situation would 
have been identical for all three groups, i.e., 
P(Dj|H) values were identical for each 
group. 

Of the 240 scenarios presented to Ss in 
each group, 120 were representative of Ha 
and 120 of H». In order to generate a spe- 
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cific scenario of either type for a particular 
group, E selected one subclass each from 
Pairs D:,D2 and from D,D: randomly with- 
out replacement from the joint frequency 
matrices appropriate to the group. Single 
subclasses, one each from Ds and De, were 
then drawn at random without replacement 
from frequency distributions based upon in- 
dependent consideration of the P(Ds|H:) 
and P(H|H:) distributions shown in Table 
1, Data Classes Ds and De were included in 
the scenarios merely as “filler” items. The 
diagnostic value of these data were set in- 
tentionally very low so that their effects 
would not diminish the impact of the specified 
data interrelationships. 

A measure of conditional nonindependence. 
—]t is necessary to characterize more spe- 
cifically the strength and significance of the 
conditional data class interdependencies insti- 
tuted in the data-generation models for 
Groups 1 and 2. One convenient indication 
of the interdependence between two variates, 
making no assumptions about the metric 
properties of the variates, is “contingent 
uncertainty” (Garner, 1962). Contingent 
uncertainty [U(+:y)] in a bivariate matrix 
is a measure of the amount of uncertainty re- 
duction (in bits) due to correlation or 
contingency between Variates x and y. The 
following U(x:y) values characterize the 
strength of the conditional interdependencies 
instituted into the data-generation models for 
Groups 1 and 2. U(D1:D2) was equal to 
1.24, 0.36, and 0.00 for Groups 1, 2, and 3, 
respectively. U(Ds:D.) was equal to 0.95, 
0.56, and 0.00 for Groups 1, 2, and 3, re- 
spectively. All U(x:y) values for Groups 
1 and 2 were significant (p<.001). (Tests 
of significance of these values were per- 
formed using x? = 1.3863 n[U («:y)], where 
12120) U(D::D:) and U(Ds:Ds) were 
both larger in Group 1 than in Group 2. As 
expected, these values were zero for Group 3. 

Subjects and instructions. —Nine male vol- 
unteer Ss, paid on an hourly basis, were 
assigned, three each, to Groups 1, 2, and 3. 
These Ss were not naive with respect to 
probabilistic inference tasks. Every S had 
served in at least two previous experiments 
in which probabilistic estimates of the form 
P(H|D) and P(D|H) were made. The 
three S groups were given identical instruc- 
tions. The Ss were told that by keeping 
records of the joint occurrence of certain 
data they might observe data interrelation- 
ships when one or the other of the hypothe- 
ses was true. Further, they were told that, 
if there were relationships among data 
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classes, such relationships would occur be- 
tween D, and Ds and between Ds and Da. 
Such instructions were necessary because of 
the large number of possible interrelation- 
ships. It is important to emphasize that Ss 
were given no information about the form 
or extent of the relationships nor were they 
told which hypotheses would mediate the 
relationships. In fact, Ss were told only 
where to look for data class interrelation- 
ships if they should occur. 

Data presentation.—The six items of data 
were transmitted simultaneously for each 
scenario over closed-circuit television to the 
three S groups. As previously indicated, 
120 scenarios were representative of Ha and 
120 of Hə. These two types of scenarios 
were presented in random order throughout 
the experiment. All Ss were provided with 
the P(Dj,|H.) values shown in Table 1. 
These values, it should be recalled, specified 
datum-hypothesis relationships for each data 
class considered independently. ^ Upon re- 
ceiving the six data items for each scenario, 
Ss estimated P(H|D) by evaluating these 
data with reference to the P(Dyg|H.) 
matrix and cumulative records of the joint 
occurrence of the subclasses of D;,Ds and 
Ds,Di. The Ss in all groups were required 
to maintain these accumulations of joint 
frequency. After the P(H|D) estimations 
had been made for a scenario, Ss were told 
which hypothesis was true. Under this 
hypothesis category, on an appropriate form, 
Ss plotted the joint occurrence of the ob- 
Served states of the above pairs of data 
classes. Subsequent knowledge of the true 
hypothesis was thus essential in this ex- 
periment. With this joint relative fre- 
quency information Ss in Groups 1 and 2 
were able to establish the conditional data 
interdependencies which emerged as the ex- 
periment progressed. The Ss in all groups 
were required to normalize P(H |D), ie, 
make their P(H|D) estimates have unit sum 
for each scenario, 

Bayes’ theorem calculations of P (H|D).— 
The outcome of a P(H |D) calculation from 
Bayes’ theorem in the present experimental 
situation will depend upon whether or not 
conditional nonindependencies among the data 
being Processed are taken into consideration, 
In addition, the outcome of these calculations 
will depend upon the assumptions one makes 
with regard to the stationarity of P(Dy|H.) 
in the basic data-generation model (Table 
1). Three different calculations of P(H|D) 
were performed upon the scenario data pre- 
sented to each group. In all of these cal- 
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culations the prior probability term [P (H4)] 
in Bayes theorem was set at .5 before 
processing the data from each scenario. This 
corresponded to the instructions given to 
all Ss. The three different types of calcula- 
tions can be described in the following 
manner : 

Calculation 1 incorporated the emerging 
data class nonindependencies in the scenarios 
presented to Groups 1 and 2. A computer 
was programmed to incorporate, into Bayes' 
theorem, joint conditional probabilities of 
the form P(Dyz,Dsy|H:) and to revise or 
update these values on the basis of accumula- 
tions of joint relative frequencies in the data 
class pairs D;Ds and DD. This pro- 
grammed calculation incorporated instructions 
analogous to the instructions given to all Ss. 
Before data from the first scenario were 
processed, the joint conditional probabilities 
entered into Calculation 1 for Data Classes 
D,D: and DaD, were based upon the as- 
sumption of independence and were merely 
products of appropriate values shown in 
Table 1 [eg, P(DisDa|H«) = P (Dis|Ho), 
P(Da|H.)]. The sample size upon which 
the values shown in Table 1 were based was 
not specified to Ss. The procedure for up- 
dating in Calculation 1, in effect, assumed 
that this sample size was one since decimal 
numbers rather than relative frequencies 
were specified in Table 1. This updating 
procedure, on the basis of accumulated rela- 
tive frequencies caused some fluctuation in 
P(D;sDi,|H:) values which soon dampened 
out as sample size during the experiment 
(ie, number of scenarios) increased. The 
only discrepancy between instructions to Ss 
and to Calculation 1, therefore, occurred with. 
respect to "prior »" or the sample size upon 
which P(D;,|H«) values were based. The 
data gathered in the experiment show that 
effects of this discrepancy are only apparent 
in the early scenarios. 

Calculation 2 represents optimal P(H|D) 
under the assumption of stationarity of the 
P(Dn|H:) values in Table 1 and inde- 
pendence of data classes. This calculation 
thus specified optimal P(H|D) if conditional 
nonindependencies were not exploited. 
„Calculation 3 represents optimal P(H|D) 
if the P(Djx|H:) values were updated but 
data classes were assumed to be independent 
Since this calculation incorporated mo 
P(D,Dw|H.1) values, conditional noninde- 
pendence could not be accounted for in this 
solution of P(H|D). 

An indication of inference ability.—Proba- 
bilistic inferences evaluated during the ex- 
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periment were the estimates or calculations 
of P(H|D) produced for each scenario after 
all six data items in the scenario had been 
processed. The size of the P(H|D) esti- 
mate placed by S in true hypothesis cate- 
gories was taken as a measure of S's 
certainty that these hypotheses accounted for 
the occurrence of the observed scenario data. 
Mean P(H|D) placed in true hypotheses by 
Ss and by the three Bayes' theorem solutions 
was calculated by E in each of 10 successive 
blocks of 24 scenarios throughout the ex- 
periment. Comparisons of these mean values 
for S and Bayes' theorem calculations indi- 
cated S’s inference capability relative to the 
three normative expressions. 


RESULTS 


Between-group | comparisons.—Fig- 
ure 1 illustrates the mean P(H|D) 
estimate placed by the three S groups 
in correct hypotheses in each of 10 con- 
secutive blocks of 24 estimates during 
the experiment. The Ss in Group 1 
placed significantly larger average 
P(H|D) in true hypotheses through- 
out the experiment than did Ss in 
Group 2 (p < .05, Mann-Whitney U 
test, one-tailed) or in Group 3 (P< 
01, U test, one-tailed). The Ss in 
Group 2 also placed significantly 
larger average P(H|D) in true hy- 
potheses than did Ss in Group 3 (P < 
.01, U test, one-tailed). 


o—— Group | Subjects 
Loe---- Group 2 Subjects 
e— Group 3 Subjects 


3 


Mean PI(HIO) in Correct Hypothesis Categories 


Consecutive Blocks of 24 P(H|D) Estimates 


Fic. 1. Mean P(H|D) placed in correct 
hypotheses by S groups. 
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Fic. 2. Mean estimated and calculated 
P(H|D) for Group 1. 


Subjects’ estimates of (P(H|D) 
compared with Bayes’ theorem calcu- 
lations —Figure 2 illustrates mean 
P(H|D) placed in true hypotheses by 
Group 1 Ss (taken as a group) and 
by the three Bayes' theorem calcula- 
tions performed on the Group 1 
scenario data. Beginning in the second 
block of 24 estimates, the average 
P(H|D) estimates placed in true hy- 


potheses by Group 1 Ss were nearly 
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Fic. 3. Mean estimated and calculated 
P(H|D) for Group 2. 
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Fic, 4. Mean estimated and calculated 
P(H)|D) for Group 3. 


identical to the mean P(H|D) values 
resulting from Bayes' Theorem Calcu- 
lation 1. Group 1 Ss placed signifi- 
cantly larger P(H|D) in true hy- 
potheses than did Calculations 2 or 3 
(p < .025, U test, one-tailed). 

Figure 3 indicates fairly close agree- 
ment, beginning in Block 5, between 
mean P(H|D) in true hypotheses 
placed by Group 2 Ss (taken as a 
group) and Bayes' Theorem Calcula- 
tion 1. Group 2 Ss placed consistently 
larger P(H|D) in true hypotheses than 
did Calculations 2 or 3. These differ- 
ences, although small, were significant 
(P < 05 U test, one-tailed), Figure 
4 illustrates the close agreement be- 
tween mean P(H|D) in true hy- 


TABLE 3 


RANK-ORDER CORRELATIONS BETWEEN INDI- 
VIDUAL S's P(H|D) Estimates AND 
P(H|D) RESULTING FROM Bayes’ 
THEOREM CALCULATION 1 


Group 
s 
1 2 3 
1 St E 63 
2 -67 56 34 
3 74 63 m" 
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potheses placed by Group 3 Ss and all 
three calculations. 

A more precise indication of the 
agreement between each individual S's 
P(H|D) estimates and P(H|D) re- 
sulting from Bayes' Theorem Calcula- 
tion 1 (the instructions for which were 
most analogous to those given to all 
Ss) is shown in Table 3. Rank-order 
correlations (rho) were calculated be- 
tween the P(H|D) estimates placed 
by each individual S in true hypotheses 
and the P(H|D) values placed in 
true hypotheses by Calculation 1 for 
each of the 240 scenarios. All of these 
rho values were highly significant (p 
< 001, t test using correction for tied 
ranks, df — 238). 


Discussion 


The primary purpose of the present 
experiment was to evaluate the ability of 
Ss to revise subjective probabilities on 
the basis of data exhibiting conditional 
nonindependencies. The most prominent 
result of the experiment was the ex- 
tremely close agreement between mean 
P(H|D) placed in true hypotheses by 
Group 1 Ss and by Bayes’ Theorem Cal- 
culation 1 (Fig, 2). Graphs plotted for 
individual Ss (not shown) indicated that 
this close agreement was not entirely an 
artifact of data aggregation. The magni- 
tude of the difference between Group 1 
Ss’ mean P(H|D) in true hypotheses 
and the corresponding mean values for 
Calculations 2 and 3 (which assumed 
independence) indicates that Group 1 Ss 
Were not processing nonindependent data 
as though they were independent. Per- 
formance differences between Group 1 
and Group 2 Ss are attributable to dif- 
ferences in the form and extent of non- 
independence in the scenario data for the 
two groups. The strong diagonal rela- 
tionships in Group 1 scenario data were 
apparently easier to establish than the 
Somewhat weaker off-diagonal relation- 
ships in the scenario data for Group 2. 
Figure 3 indicates that Group 2 Ss also 
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were exploiting the nonindependence in 
their scenario data. 

In probabilistic inference tasks highly 
experienced Ss, performing under no ex- 
plicit cost-payoff arrangement, are known 
to maximize P(H|D) in hypotheses they 
judge to be most likely (Schum, Gold- 
stein, & Southard, 1966). Maximization 
of P(H|D) in such a situation seems, 
in fact, to be an optimal strategy if one 
assumes that Ss adopt implicit linear 
payoff functions (ie. they imagine, for 
example, that it is twice as valuable to 
estimate a posterior probability of 1.0 
as it is to estimate one of .5 for some 
true hypothesis). One might argue, 
therefore, that the high mean P(H|D) 
in true hypotheses for Groups 1 and 2 
and the similarity of performance in these 
groups to Calculation 1 could have re- 
sulted from independent consideration of 
data and maximization of P(H|D). The 
results from Group 3 offer some informa- 
tion regarding this issue. The results 
of Calculations 2 and 3 (which assume 
independence) in Fig. 2, 3, and 4 show 
that the overall difficulty of the diagnos- 
tic situations confronting each of the 
three groups was nearly identical if the 
scenario data were processed independ- 
ently. Had Groups 1 and 2 actually 
processed their data in an independent 
fashion, one would expect their perform- 
ance to be similar to that of Group 3. 
The magnitude and consistency of the 
differences between Group 3 and Groups 
1 and 2 (Fig. 1) offer some assurance 
that Ss in Groups 1 and 2 were respond- 
ing to the nonindependencies in their 
data. The degree of similarity between 
block mean P(H|D) values in these 
groups and the results of Calculation 1, 
therefore, does not appear to be spurious. 
The best indication of similarity between 
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Ss’ responses and Calculation 1 results 
across 240 individual scenarios comes 
from the rho calculations. The typically 
moderate size of these rho values cautions 
against overenthusiasm in interpreting 
the similarities in mean P(H|D) be- 
tween Ss and the normative statement 
expressed by Bayes' Theorem Calcula- 
tion 1. 

The present experiment also provides 
additional evidence that not all Ss ex- 
hibit conservatism in subjective proba- 
bility revision. The explanation for this 
result comes from the characteristics of 
the group of Ss used in the present ex- 
periment. All Ss had made similar re- 
visions of subjective probabilities in 
several previous experiments but under 
somewhat more difficult circumstances. 
In the previous experiments (but not in 
the present experiment) Ss were, on 
occasion, given Bayes’ theorem P(H|D) 
solutions as part of their performance 
feedback. 
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STIMULUS SPECIFICITY: NONREWARD * 


E.J. CAPALDI 
University of Texas 


80 rats were trained in the runway over 3 phases: partial reinforce- 
ment acquisition, Extinction I, and Extinction II. In Extinction I, 
nonrewarded confinement (NC) was either the same as or different 
from NC in acquisition; and in Extinction II, NC was shifted to the 
opposite value, again resulting in NC being the same as or different 
from acquisition NC. Early in Extinction I, response decrement was 
greater for different than for same groups, and later, 60-sec. NC 
produced slower terminal running than did 10-sec. NC. Similar 
results occurred in Extinction II. The results suggest that each NC 
occasions a distinctive stimulus and that extinction in the instrumental 
reward situation is to a considerable extent only generalized extinction. 


Recent experimental evidence sug- 
gests that characteristic stimuli are pro- 
duced by reward and nonreward 
(Capaldi & Stanley, 1963), by differ- 
ent durations of delayed reward 
(Capaldi & Poynor, 1966), and by dif- 
ferent numbers of consecutive nonre- 
wards (Capaldi, 1964). Generalizing 
on these findings produces the stimulus 
specificity hypothesis, the assumption 
that each condition of reward and of 
nonreward has an associated and dis- 
tinctive stimulus. These stimulus after- 
effects remain functional for at least 
24 hr. (eg., Capaldi & Spivey, 1964) 
and become part of the stimulus com- 
plex on the succeeding trial. Since the 
stimuli occasioned by nonreward in ex- 
tinction are necessarily different from 
the stimuli conditioned to the instru- 
mental response in acquisition, it fol- 
lows that the instrumental response is 
always extinguished in a stimulus com- 
plex different from that in which it was 
learned. Thus, extinction in the in- 


1 This investigation was supported in part 
by Research Grant HD 00949-02 from the 
National Institute of Child Health and Hu- 
man Development. Gratitude is expressed 
to Larry R. Stanley and David A. Lynch, 
who ran a preliminary version of the present 
investigation, and to Hugh B. Poyner, who 
materially aided the present investigation. 


strumental reward situation is to a 
considerable extent only generalized 
extinction. The corollary of this propo- 
sition is, of course, that even after an 
unlimited number of nonreinforce- 
ments the strength of many of the S-R 
connections established in acquisition 
will remain at nearly their preextinc- 
tion levels. "Thus, the reintroduction 
of these acquisition stimuli, even fol- 
lowing considerable extinction, could 
be expected to evoke the instrumental 
response with considerable vigor. This 
deduction from the stimulus specificity 
hypothesis was employed to determine 
if different durations of nonrewarded 
goal-box confinement have associated 
distinctive stimuli, 


METHOD 


Subjects —The 40 male and 40 female 
Naive rats were about 90 days old when pur- 
chased from the Holtzman Company, Madi- 
son, Wisconsin. 

Apparatus—The runway, painted a flat 
gray throughout, was 82 in. long, 4 in. 
wide, with 9-in. high sides, and was covered 
with i-in. hardware cloth. An 8-in. long 
floor treadle suspended over a microswitch 
constituted the initial portion of the alley, 
and when the treadle was depressed by the 
weight of the rat, whose front paws were 
always placed at its extreme forward edge, 
a 0l-sec. clock was started. Interrupting 
a photobeam 693 in. beyond the treadle and 
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23 in. in front of a brass 2X 4 X 4 in. food 
cup covered by a tightly fitting, sliding, metal 
lid, stopped the clock and automatically 
opened the lid covering the food cup. A 
guillotine door 14 in. from the distal end of 
the alley was manually lowered behind S 
after the photobeam was broken. Through- 
out all phases of training S was allowed a 
maximum of 180 sec. to traverse the runway ; 
when 180 sec. elapsed, S was guided to the 
goal area and the guillotine door was lowered 
confining S for the appropriate time. 
Preliminary training.—On arrival at the 
laboratory Ss were housed in individual 
cages and given complete access to food and 
water for the next 5 days. On Day 6 the 
80 Ss were randomly divided into four 
groups of 20 Ss each and the daily depriva- 
tion cycle consisting of 1-hr. access to pellets 
in the home cage was begun. Since in ex- 
perimental training Ss were to be run indi- 
vidually and at the same time each day, it 
was possible to adjust each S's feeding sched- 
ule to correspond closely to the time it was 
to be run in experimental training by allow- 
ing 6 min. to run each $. The only 
restriction placed on the random procedure 
was that no two Ss from the same group 
were run consecutively. On each of the 
Days 14 through 17 S was allowed 2 min. 
to explore the unbaited alley and was han- 
dled for 1 additional min. before being re- 
turned to the home cage. On each of the 
Days 18 through 21 S was placed in the goal 
box and was removed after consuming six 
045 gm. P. J. Noyes pellets. By Day 21 
all Ss ate readily in the goal box. 
Experimental training. — Experimental 
training consisted of three phases: 20 days of 
acquisition training under partial reinforce- 
ment, 9 days of Extinction I, and 3 days of 
Extinction II. During each phase there 
were three trials each day separated by a 
30-sec. intertrial interval spent in a black 
box. Following the last daily trial S was 
returned to the home cage where it was fed 
the daily ration 10 min. later. In acquisition 
the second daily trial was always nonre- 
warded, the first and third trials being im- 
mediately rewarded with six .045 gm. P. J. 
Noyes pellets. For a random half of the Ss 
nonrewarded confinement in the goal box 
(NC) was 10 sec., for the other half, 60 sec. 
In Extinction I a random half of each acqui- 
sition group was nonrewarded employing 
the same NC as in acquisition, the remaining 
half employing the opposite NC, yielding 
four groups: 10-10, 10-60, 60-60, and 60-10. 
In Extinction II NC for each group was the 
opposite of that received in Extinction I, ie, 


10-10-60, 10-60-10, 60-60-10, and 60-10-60. 
Note that Extinction II, as in Extinction I, 
for two groups NC was the same as in 
acquisition training and for two groups it 
was the opposite. It will be helpful in what 
follows to refer to acquisition NC as AC, 
Extinction I NC as ECI, and Extinction II 
NC as ECII. 


RESULTS 


Analysis of variance on the mean 
log times for the last 5 days of 
acquisition produced F<1. The 
mean log time for each group on each 
day of Extinction I and Extinction II 
is shown in Fig. 1. Figure 1 shows 
that groups for which AC and ECI 
were the same ran considerably faster 
early in training than groups for which 
AC and ECI differed, this effect 
diminishing as training continued. Fig- 
ure 1 also shows that 10-sec. ECI pro- 
duced faster running at terminal level 
than 60-sec. ECI, this effect being 
nonexistent in the early extinction 
trials, increasing as training continued. 
It is clear from an inspection of Fig. 
1 that in the early extinction trials the 
AC x ECI interaction was the domi- 
nant influence on performance whereas 
late in extinction the ECI variable be- 
came dominant. Analysis of variance 
over the first 2 days of Extinction T 
indicated that the AC x ECI inter- 


EXTINCTION TE 


1 3 


B 
DAY 
Fic. 1. Mean log running time for each 


of the groups of each day of Extinction I 
and of Extinction II. 
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action was significant, F (1, 76) = 
6.91, p < .01, with neither AC, F < 1, 
nor ECI, F <1, being significant. 
Over the last 2 days of extinction ECI 
was significant, F (1, 76) =7.13, p< 
.01, with neither AC, F < 1, nor the 
AC X ECI interaction, F = 1.23, being 
significant. 

By Day.3 of Extinction II a com- 
plete re-ordering of the groups oc- 
curred with the fastest group in Ex- 
tinction I being the slowest group in 
Extinction II, and so on. Attending 
to Day 3 of Extinction II it can be 
seen that 10-sec. ECII produced faster 
running than 60-sec. ECII and that 
running was faster if AC and ECII 
were the same rather than different. 
An analysis of variance on Day 3 of 
Extinction II indicated that AC was 
not significant, F < 1; however, differ- 
ences associated with both ECII, F (1, 
76) =8.06, p<.01, and the inter- 
action of AC x ECII, F (1, 76) = 
7.42, p < .01, were significant. 


DISCUSSION 


Consistent with previous findings is 
the early Trial AC X ECI interaction 
(Bloom, 1963; Capaldi & Bowen, 1964; 
Hulse, 1958; Stanley & Rowe, 1954) and 
the slower running as a function of 
longer ECI (Bloom, 1963; Capaldi & 
Bowen, 1964; Klugh, 1961; Stanley & 
Rowe, 1954); however, ECI failed to 
influence performance in investigations 
by Hulse (1958) and Stanley and Clay- 
ton (1955). The present investigation 
appears to be the first to employ ECII. 

The present results support the stimu- 
lus specificity hypothesis in suggesting 
that different durations of nonrewarded 
goal-box confinement produce distinctive 
stimuli and that extinction in the instru- 
mental reward situation is, for those 
S-R connections based on the stimulus 
consequences of NC duration, little more 
than generalized extinction, The basis 
for these inferences may be developed 
as follows. Consider first that 10-sec. ^ 
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NC occasioned. a specific stimulus 
(SN). as did the 60-sec. NC (Seo). 
Accordingly, in acquisition the 10-sec. 
AC group formed the habit SNo H R and 
the 60-sec. AC group, the habit Seo H R. 
Thus, the AC x ECI interaction obtained 
here and in previous investigations cited 
above is understandable since the un- 
switched groups (10-10 and 60-60) were 
extinguished in the presence of the same 
stimulus conditioned to the instrumental 
response in acquisition (direct extinc- 
tion) while the switched groups (10-60 
and 60-10) were extinguished in the 
presence of a new or generalized stimu- 
lus (generalized extinction). - It also fol- 
lows that since the switched groups were 
given only generalized extinction in Ex- 
tinction 1, when they were returned 
in Extinction II to the originally condi- 
tioned and not fully extinguished stimu- 
lus they would tend to run more rapidly 
than the unswitched groups of Extinction 
I which now experienced a new or gen- 
eralized stimulus in Extinction II, The 
amount of generalized extinction: could 
be expected to increase as the stimulus 
similarity between the acquisition stimuli 
and the extinction stimuli increases, It 
may be considered that SN,, and and 
SN, are relatively similar stimuli. How- 
ever, acquisition training under consistent 
reward (SR, stimulus consequent of re- 
ward) followed by extinction training 
(SN), should result in relatively little 
extinction of the habit S2 HR since 
S® and SN differ considerably.. It is 
very probably for this reason that experi- 
ments dealing with the successive acqui- 
sition and extinction of habits (eg, 
North & Morton, 1962) show very rapid 
reacquisition of the habit when reward 
is reintroduced—the habit SR H R, prop- 
erly speaking, was not fully extinguished 
by the nonrewards. It can be seen that 
from the point of view of "reinstating 
the stimulus conditions which prevailed 
in acquisition” the present experiment is 
highly similar to those which have dealt 
with the successive acquisition and ex- 
tinction of habits. 


a A number of trials were required to 


hange performance in Extinction II. 
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For example, Group 10-60-10 showed 
improvement on each successive day of 
Extinction II. Such gradual performance 
change might be regarded as being in- 
compatible with the present hypothesis 
since the hypothesis obviously requires 
a sudden shift in performance when the 
previously conditioned stimulus is re- 
introduced. The gradual shifts actually 
observed can be explained on the follow- 
ing basis. Generally it can be expected 
that the nonreinforced stimulus occa- 
sioned by a particular trial is a function 
not only of that trial but of several pre- 
vious trials as well (Capaldi, 1964). 
Thus, in acquisition 10-sec. NC always 
occurred following a rewarded trial 
whereas in Extinction II, 10-se. NC 
occurred initially against a background 
of 60-séc. NC and subsequently against 
a background of 10-sec. NC. From this 
point of view the significant AC X ECIT 
interaction may be regarded as an Over- 
generous estimate of the amount of gen- 
eralized extinction, since the stimulus 
actually conditioned to the response in 
acquisition failed to occur in its original 
form in Extinction II. A fairer test for 
generalized extinction might have in- 
volved administering several rewards fol- 
lowing Extinction I and preceding Ex- 
tinction II. 

The present results suggest that in the 
instrumental reward situation the mon- 
reinforcement of a reaction is not neces- 
sarily equivalent to the extinction of an 
S-R tendency. Thus, a careful distinction 
Should be maintained between the re- 
sponse decrement occasioned by non- 
reward, on the one hand, and the ex- 
tinction of an S-R tendency, on the other. 
Clearly, a failure to make the instru- 
mental response following some number 
of nonreinforcements may be a quite 
different matter from a completely dimin- 
ished or even a substantially diminished 
tendency to make the instrumental re- 
sponse in the presence of the originally 
conditioned stimuli, Thus, partial rein- 
forcement is commonly stated to produce 
greater resistance to extinction than. 
consistent reinforcement. 


Another views» 
based on the present findings, is tha 
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the consistently reinforced group experi- 
ences greater generalization decrement 
than the partial group and so runs slower 
in the presence of nonreinforcement (re- 
sponse decrement) while suffering, as 
indicated above, less diminution in, the 
strength of certain of the S-R tendencies 
established in acquisition (extinction). 

The present results contain the impli- 
cation that shifting consistently rewarded 
and partially rewarded groups to non- 
reward literally cannot allow a deter- 
mination as to whether a consistently 
rewarded habit (SF H R) is more or less 
resistant to extinction than a partially 
rewarded one (SNHR). All that can 
properly be inferred is that nonreward 
following consistent reward produces 
greater response decrement than non- 
reward following partial reward. Infer- 
ences about the resistance to extinction 
of specific S-R connections must depend 
upon equating all to-be-extinguished 
groups for generalization decrement, an 
outcome precluded from the outset by the 
necessary shift of conditions upon enter- 
ing extinction. In the present investi- 
gation it was possible, presumably, to 
equate the to-be-extinguished groups for 
generalization decrement, and therefore 
it seems proper to talk about resistance 
to extinction as a function of AC. But, 
since the statistical analysis failed to 
show a significant AC main effect, it 
would appear that at least under the con- 
ditions of the present investigation AC 
had no effect on resistance to extinction. 
Quite possibly, this may represent the 
general rule; that is, the acquisition con- 
ditions per se may have no effect on 
resistance to extinction per se and dif- 
ferential performance under nonreward 
may always reflect differential response 
decrement occasioned by differential gen- 
eralization decrement. 
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INVERSE FORGETTING IN SHORT-TERM MEMORY * 
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University of Sydney 


As a test of short-term memory, human Ss were shown a pattern of 
letters, then asked to recall it. Stimuli were presented for less than 
1 sec., while the retention interval varied from 1 to 10 sec. The letter 
patterns used were either meaningless, formed words, or formed 


sentences, 
interval. 


There was no intervening activity during the retention 
Accuracy of recall was higher at longer retention intervals, 


in contrast to the usual fall of accuracy with time which is seen when 


retention is measured over a period of minutes or longer. 


Degree 


of meaningfulness of the stimuli did affect accuracy of recall, but there 
was no interaction with the retention-interval effect. 


Events immediately following stimu- 
lus presentation can have a marked 
effect on information retention. These 
events need not be interfering in the 
sense that, on subsequent requests to 
recall the information in the original 
stimulus, S replies with the interfering 
rather than the stimulus item. Rather, 
the interfering items seem to disrupt 
S’s consolidation of his memory trace 
for the original item. Such disruption 
has been demonstrated in peripheral 
information storage by Sperling 
(1960), who virtually destroyed 
“memory” over a period of fractions 
of a second by flashing a white light 
onto that part of the retinal field 
where stimulus information had been 
displayed. Using the paired-associates 
technique, Peterson and Peterson 
(1959) found marked forgetting for a 
single paired-associates item when 
was asked to perform an interfering 


1 This study was supported by the United 
States Public Health Service, National In- 
stitute of Mental Health Grant MH-07567 to 
the University of Sydney. Permission for 
reproduction of this report by the United 
States Government is hereby granted. 

2In conformity with the custom of the 
physical sciences and mathematics, the 
authors’ names are listed alphabetically with- 
out implication of seniority. Earl Hunt is 
now at the University of California, Los 
Angeles, All correspondence should be 
addressed to him. i 


task (counting backwards) immedi- 
ately following stimulus presentation. 

The Peterson and Peterson experi- 
ments have been repeated and ampli- 
fied on in several other studies (cf. 
Melton, 1963). It has been found 
that the effect depends on the number 
of naturally coded “chunks” (e.g., 
words) into which the items to be re- 
membered fall. For instanċe, a pair 
of three-letter words is no harder to 
learn as paired associates than two 
letters, while three two-letter paired 
associates are more like three two- 
word paired associates, Miller (1956), 
on the basis of a review of studies of 
memory over longer periods, suggested 
that one of the crucial determinants 
of memory is the extent to which the 
items to be remembered fall into 
natural “chunks” of information. Both 
the Peterson and Peterson and the 
Sperling results can be looked upon 
as evidence that memory is damaged 
if the chunking process is disrupted. 
In the paired-associates case, how- 
ever, chunking would presumably be 
occurring at a much higher level in 
the nervous system than the level prob- 
ably involved in Sperling’s experi- 
ments. 

If this analysis is correct it should 
be possible to demonstrate inverse for- 
getting, i.e., an increase in the amount 
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of information recalled as a function 
of the time intervening between stimu- 
lus and recall. To do this, one would 
have to set the stimulus-recall interval 
at a short enough time so that the act 
of recall would itself disrupt the fixa- 
tion process.  Requiring a response 
early in the fixation process would re- 
quire S to retrieve an "unprocéssed" 
piece of information, by analogy simi- 
lar to the problem of locating a journal 
in a library before the journal has 
been bound and filed. At some point 
in time, however, we would expect that 
either the fixation process would be- 
come complete or enter a stage at which 
it was protected from interfering 
events. Beyond this point, one would 
find the monotonic decrease in ac- 
curacy of recall over time which is 
normally observed in studies of mem- 
ory. 

In this experiment inverse forgetting 
was, in fact, found during the first 10 
sec. following stimulus presentation. 


METHOD 


Design.—Sequences of letters were exposed 
for a brief period. Then, within 10 sec, S 
was required to reproduce the sequence just 
shown to him. The time interval and the 
extent to which the letter sequences formed 
familiar units were varied. 

To make up the sequences of letters eight 
sentences were composed, each containing 
eight three-letter words (eg, ASK THE 
OLD MAN FOR THE WAY OUT). In 
the sentences condition the sequences of let- 
ters were presented in the order found in the 
sentences, but without spacing between 
words. In the words conditions eight se- 
quences were made up by rearranging the 
words of each sentence to produce a string 
of words which, insofar as we could tell, was 
both ungrammatical and meaningless. As 
before, there was no spacing between words. 
In the letters condition the eight strings 
were further decomposed by rearranging 
the letters within each word so that no three 
consecutive letters formed a word, 

The Ss were divided into three groups, 
Every S in every group saw all 24 sequences 
(eight of each type). The groups differed 
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only in the order in which they saw each 
type of sequence: letters, sentences, words; 
or words, letters, sentences; or sentences, 


words, letters. 


Stimuli were presented for .7 of a second, 
and recall required either 1, 4, 7, or 10 sec. 
later. The actual time of recall was de- 
termined in the following manner. The 
ordér in which sequences were presented 
within a particular type was fixed. The 
recall interval, however, was varied for each 
sequence. There are 24 possible permuta- 
tions of the four times. Each S in a 
group would see the first four items (Order 
I) with recall intervals set according to 
one of these permutations. The next four 
items (Order II) would be displayed with 
the mirror-image order of recall interval 
times. For example, suppose an S had a 
recall interval of 1 sec. for the first item 
shown, 4 for the second, 10 for the third, 
and 7 for the fourth, The fifth item would 
then be shown with a recall interval of 7 
sec, the sixth with an interval of 10 sec., 
the seventh with an interval of 4 sec., and 
finally the eighth with an interval of 1 sec. 

The resulting design is a generalization to 
four variables of Winer’s (1962, p. 319) 
example of a full factorial study with one 
measurement per S on one variable (groups) 
and repeated measures on the other three 
(type of item, recall interval, and order). 
Learning or warm-up effects across types 
are balanced by the design, since each group 
had each type in a different order. Within 
types, learning effects, and the effects of 
using particular sequences of letters are 
confounded in the order variable. The ef- 
fect of presenting retention intervals of 
different length in any particular order is 
confounded with Ss. 

Procedure—The letter sequences used as 
stimuli were block printed and photographed 
on 35-mm. slides, black on white. They 
were projected from behind onto a milk-glass 
screen approximately 2 ft. in front of a 500- 
W. projector. The screen itself was ap- 
proximately 36 in. from S. 

All Ss were seen individually. Each S 
sat in front of the screen, with a pencil and 
& piece of paper in front of him but not 
obstructing his view of the screen. A small 
red light was set to one side of S, but 
clearly in view. Prior to each presentation 
E set the timing apparatus and alerted S. 
The E then activated the apparatus, which 
initiated the following sequence of events. 
(a) The stimulus item was projected on the 
screen for 7 sec. (b) The projector turned 
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off. (c) After the chosen recall interval 
had elapsed from the point at which the 
projector was turned off, the small red light 
came on. This was a signal to S to pick 


up the paper and pencil and reproduce the ~ 


stimulus item just presented. The S was 
instructed to reproduce the stimulus sequence 
beginning at the far left-hand side. Thus 
S’s normal reading habits were utilized. All 
stimulus sequences occupied just one line. 
The S was allowed as much time as he 
wished to write down the items he recalled. 
When he indicated to E that he was through 
his answer sheet was replaced by a blank 
piece of paper and the next trial was begun. 

Responses were scored for the number of 
letters exactly correct and in place, beginning 
with the leftmost letter and continuing to the 
first error. 

Subjects.—Seventy-two first-year psychol- 
ogy students served as Ss as part of a course 
requirement. 


TABLE 1 


Mean No. or LETTERS RECALLED COR- 
RECTLY AS A FUNCTION OF GROUP, 
STIMULUS ORGANIZATION, AND 

RETENTION INTERVAL 


Type of Stimulus Organization 


GROUP I: Sequence: Letters, Words, Sentences 


Retention Mean 
Interval | Letters | Words | sen, | Over Adr 
Seconds) Stimulus 

1 3.98 7.17 6.65 5.93 

4 4.04 8.31 6.81 6.39 

7 4.19 8.39 7.50 6.69 
10 3.96 8.29 7.94 6.73 
M 4.04 8.04 7.22 


GROUP II: Sequence: Words, Letters, Sentences 


1 4.10 7.69 5.46 5.75 

4 3.87 8.10 6.02 6.00 

7 4.12 8.65 5.73 6.17 
10 3.90 8.29 6.00 6.06 
M 4.00 8.18 5.80 


GROUP III: Sequence: Sentences, Words, Letters 


1 [3:527 | EEE DEPO] ese 
i 3.62 | 6.56 | 627 | 549 

7 3.56 | 6.89 | 6.00 | 5.49 
10 3.87 | 819 | 6.50 | 6.19 
M 3.64 | 724 | 5.99 


AM 


TABLE 2 


Mean No. or Letrers CorreECTLY RE- 
CALLED AS A FUNCTION OF STIMULUS 
ORGANIZATION AND RETENTION 
INTERVAL, AVERAGED OVER 


Groups 
*| Type of Stimulus Organization 
Retention E 
Interval | | s 
Letters | Words | (ones, | M. 
1 sec. 3.877 | 7.40 5.76 5.67 
4 sec. 3.85 7.66 6.37 5.96 
7 sec. 3.96 7.98 6.42 6.12 
10 sec. 3.91 8.26 6.81 6.33 
M 3.90 7.82 6.34 
RESULTS 


Summaries of the mean number of 
correct responses are presented in 
Tables 1, 2, and 3. Since a complete 
tabular report of the study would be 
burdensome, it is most economic to 
examine Table 4 first, since it provides 
us with an argument for averaging 
over some of the different effects, in 
order to present a more readable re- 
port in the other tables. 

Table 4 is an analysis of variance 
of the number of letters correctly re- 
ported. As indicated above, the analy- 
sis is based on a generalization from 
Winer’s (1962) recommendation for 
the analogous three-variable case. All 
variables were considered to be fixed 
effects. ‘Four effects reached conven- 


TABLE 3 


No. or LETTERS RECALLED AS A 
FUNCTION OF TYPE AND ORDER 
or ITEM TYPES 


Type of Stimulus Organization 


Order in 
le! 
De. | Letters | Words | S5. | ar 
First half 3.83 7.02 | 6.06 | 5.64 
Second half | 3.96 8.62 | 6.62 | 6.40 
M 3.90 7.82 6.32 


+ 
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TABLE 4 
ANALYSIS OF VARIANCE OF THE No. OF LETTERS CORRECTLY REPORTED 


Error 


MS | F 


Source | df Term 
[ENSEM RC) Pee TD ai zi 
Ern cious E 9451 247 2 
rOups . 
Ss SURE Groups (E) 69 43.53 
Vines (By 3 32.73 4,63** ; 
Times X Groups (AB) 6 6.16 $1687) 
Times X Ss (BE) 207 7.06 
Within Groups 
p 8 
T: C) 2 2264.85 146.90 
‘Type x Groups (AC) 4 39,12 2,54* 8 
Types X Ss (CE) 138 15.42 
Within Groups 
5 1 
Order (D) 1 251.32 46.61** 1 
Order X Groups (AD) 2 8.85 1.64 1 
Order X Ss (DE) 69 5.39 
Time X T: (BC) 6 7.61 <1 (.51) 14 
Time x Type X Groups (ABC) 12 6.80 «1 (.46) 14 
Time X Type X Ss 144 14.90 
Within Groups (BCE) 
Time X Order (BD) 3 6.22 «1 (.36 17 
Time X Order X Groups (ABD) 2 1.68 «1 rt 17 
Time X Order X Ss 72 17.45 
Within Groups (BDE) 
Order X Type (CD) 2 83.25 4,90* 20 
Order X Type X Groups (ACD) 4 2,87 «1 (47) 20 
Order X Type X Ss 18 17.00 
Within Groups (CDE) 
Order X Time X Type 6 4.24 <1 (.54) 20 
Residual 921 7.84 
*p <.05, 
mp So 
tional levels of significance. These are not so low as to raise questions 


were, at the .01 level, retention inter- 
val, type of item, and first vs. second 
half of the experiment, The interac- 
tions between groups and type of item, 
and between type of item and halves 
of the experiment, were significant at 
the .05 level. Of the possible remain- 
ing tests, only the difference between 
experimental groups approached, but 
did not reach, the .10 level of signifi- 
cance, In particular, all interactions 
involving length of retention interval 
and any other variable have F ratios 
of less than one, but, on the other hand, 


about their validity. Since retention 
interval does not interact with other 
variables, a fair picture of the reten- 
tion-interval effect can be obtained by 
averaging over all other conditions. 
This is done in Fig. 1, in which the 
average number of letters correctly re- 
called is plotted against retention in- 
terval. The figure shows that "inverse 
forgetting" does, indeed, occur. The 
number of items correctly recalled 
shows a small, but consistent, increase 
with increase in retention interval 
from 1 to 10 sec, 


1 
k 
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Although the averaging of Fig. 1 
can be justified on statistical grounds, 
it is of interest to examine the data 
in somewhat finer detail. This is im- 
portant because insignificant interac- 
tion effects might be obtained because, 
in a literal rather than a statistical 
sense, they did not exist, or because 
the interaction effects were small rela- 
tive to the variability of the data. The 
highest of the (nonsignificant) F ratios 
for interactions involving retention in- 
terval is .87, for the Retention Interval 
X Group effect. However, from Table 
l it can be seen that the relationship 
between retention interval and ac- 
curacy of recall is nearly identical for 
all groups. Of somewhat more inter- 
est is a possible difference in the effect 
of retention interval depending upon 
the type of stimulus presented, since 
if one suspects that "chunking" is 
somehow involved in memory, the size 
of the chunks might well influence the 
time required for information organiza- 
tion and thus, have the potential for 
influencing any other time-dependent 
short-term memory process. To make 
a positive statement that this did oc- 
cur, one would have to find a signifi- 
cant Type X Retention Interval 
interaction. This was not found. 
Nevertheless, an examination of mean 
accuracy of recall as a joint function 
of type and recall interval, presented 
in Table 2, shows that the effect of 
retention interval on accuracy of recall 
is least for the “letters” condition, but 
nearly parallel for “words” and 
“sentences.” This leads us to suspect 
that in a larger experiment a signifi- 
cant Type X Retention Interval inter- 
action might be obtained, but we are 
not justified in making a stronger 
statement. 

Table 3 depicts recall as a function 
of type of item and half of the experi- 
ment. In terms of absolute effect, the 


MEAN NUMBER OF LETTERS 
Sess. 


+ —-— —-——. 
1 TNT 7 10 
PRESENTATION-RECALL INTERVAL IN SECONDS 


Fic. 1. Mean number of letters recalled as a 
function of presentation-recall interval. 


greatest single influence on recall is 
that of type of item. One would ex- 
pect this to reflect the well-known fact 
that accuracy of perception is a func- 
tion of the degree of organization and 
the familiarity of the stimulus to be 
perceived. It is of some interest to 
note that the most highly organized 
stimuli, sentences, do not lead to the 
greatest accuracy of recall. This ob- 
servation is difficult to interpret, how- 
ever, since the constraint that all 
sentences be composed of three-letter 
words leads to a confounding of stimu- 
lus familiarity and organization, fa- 
miliar words being formed into un- 
usual sentences. This may account for 
the interaction with phase of the ex- 
periment. One interpretation of Table 
3 is that Ss were able to “warm up” 
to the task recognizing familiar words 
in the second half of the experiment. 
Again, this is a tangential point to the 
main purpose of the experiment. 

The discussion thus far has been 
based on an analysis of the number 
of letters correctly recalled in nor- 
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mal reading order. It can be objected 
that this score throws away informa- 
tion, since S may follow an error with 
a string of correct responses. This is 
particularly true in the recall of or- 
ganized material. For instance, if 
the stimulus sequence is ASKTHE- 
OLDMANFORTHEWAYOUT, we 
would score the responses ASKTHE- 
ODDMANFORTHEWAYOUT and 
ASKTHEODYZCYXYGHHGYTF- 
RWS as equivalent, since both are cor- 
rect up to the first seven letters, but 
are incorrect on the eighth. Clearly, 
the two responses in the example are 
not equivalent, the first example is, 
with the exception of one letter, a 
perfect response. To determine 
whether logically possible combina- 
tions of errors and correct responses, 
not reflected in our cruder scoring 
method, did actually occur in our 
data in a way that would affect the 
interpretation of our results, we used 
an alternate method of scoring in this 
experiment and two pilot studies which 
preceded it? In the alternate method, 
S’s responses were scored by a “tem- 
plate matching" technique, in which 
the actual response was measured 
against the correct response, with shift- 
ing of the strings where necessary, in 
order to maximize the number of se- 
quences of correct responses which 
occurred in the response string. To 
continue our example, the response 
ASKTHEXYURSTWAYOUT 


3 The effect was first noticed in a BA 
(Honors) thesis by S. Stumbris in 1963, 
done under Hunt’s direction, Hunt and 
Peak confirmed Stumbris’ results in a subse- 
quent study. Since questions concerning 
apparatus and procedure could be raised 
about both the earlier studies, reports will 
not be published. It should be noted, how- 
ever, that in both cases Statistically significant 
results were obtained, so that the objection 
that selective reporting of this experiment 
involves capitalizing on chance is not 
relevant. 
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would be given credit for 12 correct 
responses, the first and last six letters. 
This is a difficult and necessarily sub- 
jective scoring procedure, since by far 
the commonest error is for $ to omit 
a large number of letters, particu- 
larly toward the end of the string. 
In scoring, then, E must estimate the 
number of letters which S has omitted. 
In spite of this lack of reliability we 
hoped that the alternate scoring 
method would reveal the presence of 
any gross trends which could not be 
detected by the more straightforward 
analysis. In point of fact, no such 
trends were suggested, either in the 
pilot studies (which were analyzed 
completely by both methods) or in the 
current experiment (in which checks 
of the key comparisons were made). 
Retention interval was always a sig- 
nificant variable, in the expected di- 
rection, and none of the interactions 
involving retention interval and other 
variables approached significance. 
Within the context of this study, 
analyses of serial-position curves and 
differences between intrusion and 
omission errors, both traditional meas- 
ures of forgetting, are difficult to in- 
terpret. In two conditions (words and 
letters) S is dealing with highly or- 
ganized material. Thus, if he has 
partial recall of a sequence (e.g, THE 
CAT A—— RAT) he will be 
able to reconstruct a “sensible” ( iex 
highly dominant) sequence of re- 
sponses from the context of the re- 
sponses recalled directly. Whether the 
sequence will be completed, and 
whether it will be correct when com- 
pleted, depends very much on the 
meaningfulness and familiarity of the 
items recalled in context, While we 
have no direct evidence that such be- 
havior occurred in our experiment, 
common sense and a variety of sup- 
porting studies (Attneave, 1959; 
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Garner, 1962) suggest that it did. In 
the absence of detailed independent 
knowledge of S’s speech habits, we did 
not feel that analyses of serial-position 
errors or types of errors were war- 
ranted. This is particularly true since 
our Ss were Australians, whose speech 
habits are somewhat different from 
those of the American students who 
have provided the data for most pub- 
lished tables of word frequency and 
use, 


Discussion 


Since the inverse forgetting phenome- 
non observed here is directly opposite 
to the loss of information over time which 
is usually observed in experimental and 
naturalistic studies of memory, it is im- 
portant to consider both the differences 
between our setting and that of other 
experiments, and various possible theo- 
retical explanations to account for our 
results. Is the phenomenon of general 
interest or is it due to a theoretically 
trivial aspect of our experimental pro- 
cedure? 

In this experiment the list of letters 
to be learned was so long that perfect 
performance was virtually impossible. 
Since our measure of retention was based 
on the absolute number of items recalled, 
rather than the number relative to a per- 
fect score, we do not see how this-could 
have made any „difference. Probably 
more important, particularly when this 
study is compared to those using the 
Peterson and Peterson (1959) paradigm, 
4s the fact that the retention interval is 
unfilled. The S was left free to rehearse 
the stimulus items. This could account 
for our failing to observe a decline in 
information retention, but cannot account 
for our observing a rise, since S cannot 
tehearse what he does not obtain at some 
earlier point. Some additional process 
of information fixation or enhancement 
is required. 

The original motivation for the experi- 
ment came from our conceptualization of 
memory as a series of buffers regulating 
the flow of information into a central 
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storage area. Such an idea is generally 
consistent with the informal models pre- 
sented by Broadbent (1958, 1963) and 
Hunt (1963, 1964), and suggested by 
some of the experimental work of Yntema 
and Trask (1963). Waugh and Norman 
(1965)* have published a detailed two- 
stage model within which the inverse 
forgetting phenomenon can be studied. 
In Waugh and Norman’s model there is 
a single buffer (primary memory) and a 
single central storage area (secondary 
memory). They assume that the proba- 
bility of retention of item i in memory 
[R(i)] is a function of the probability 
that the item is retained in primary mem- 
ory [P(i)] and the probability that it 
is retained in secondary memory [S(i)]. 
The relationship is 


R(i) = PG) + S(i) — Pi) - SC). [1] 


Waugh and Norman present evidence 
showing that in situations in which S(7) 
can be assumed to remain constant, 
R(i) (and by implication P(i)) is in- 
fluenced primarily by the number of 
items presented since item i, regardless 
of the similarity of these items to the 
ith item, Specifically, Waugh and Nor- 
man (1965) account for the data of 
studies using the Peterson and Peterson 
paradigm by assuming that the act of 
counting backwards after stimulus pres- 
entation interferes with retention in pri- 
mary memory. Waugh and Norman also 
suggest that a major function of rehear- 
sal is to aid in retaining items in primary 
memory until they can be transferred to 
secondary memory. 

To fit inverse forgetting into the 
Waugh and Norman model, we must 
make two coordinating assumptions. 
These are (a) that since a fairly com- 
plex response (writing a string of let- 
ters) is required, access to information 
is largely through secondary memory, 
and(b) that the act of responding inter- 
rupts the transfer of information from 


* We completed our work before learning 
of Waugh and Norman's (1965) model. It 
seems to us a good statement of our own 
less formalized ideas. 
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primary to secondary memory. We as- 
sume, then, that when information is pre- 
sented it is maintained in primary mem- 
ory (perhaps by rehearsal) until it is 
either transferred to secondary memory, 
or lost, or until primary memory is dis- 
rupted by the request for a response. 
Unless this request occurs aíter the trans- 
fer is complete, inverse forgetting will 
be observed since the amount of informa- 
tion available in secondary memory will 
be an increasing function of retention 
interval. After the transfer is completed, 
however, any causes of forgetting which 
affect secondary memory will take effect. 
Presumably, these processes act over 
larger time intervals, and account for the 
data obtained in the study of retention 
intervals of a minute or more. 

There is a second interpretation of 
inverse forgetting which, although it also 
postulates a primary memory buffer 
which is maintained by rehearsal, de- 
scribes a quite different process.’ During 
the retention interval S could “fill out" 
items based on contextual information, 
much in the manner we suggested earlier. 
For instance, if S directly recalled (i.e., 
at O-sec. interval) the sequence THECA 
he could progressively expand it to 
THECATATE. If, as Miller (1956) 
has suggested, information is eventually 
stored in "chunks," we could think of 
one of the roles of primary memory 
being to form the chunks from imper- 
fectly received stimuli. If our results 
are, indeed, due to such a combination of 
stimulus rehearsal and expansion, then 
there should be a Retention Interval X 
Item Type interaction. We have already 
indicated that, although no such inter- 
action was found, the data were such that 


* We are indebted to the editorial referee 
for suggesting this explanation. 
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we would be unwilling to say that it was 
not there. To test the hypothesis one 
should try to relate the frequency of 
different types of errors to the speech 
habits of Ss. 
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SEARCH-DISCRIMINATION TIME AND THE APPLICABILITY 


OF INFORMATION THEORY 


ARIE M. OOSTLANDER ax» HANS we SWART 
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In this paper some objections against the application of information 
theory with regard to discrimination tasks are refuted by means of 
comments on and a replication of an experiment by Thomas and 
Solley (1963). In search-discrimination experiments information 
theory does not “predict” a logarithmic relationship between RT and 
stimulus uncertainty because of the instability of the rate of gain of 
information, as already pointed out by Hick (1952). Besides the 
influences of amount of uncertainty, amount of redundancy and form of 
the constraint in the stimulus field were considered. The results of 
our corrected replication turned out to be wholly congruent with 


expectations based on information theoretical research. 


The use of information theoretical 
concepts in psychology has led to 
interesting research, However, some- 
times derivations from this “theory” 
are applied with only superficial ap- 
preciation of the underlying suppo- 
sitions. Hick’s (1952) paper “On the 
rate of gain of information” can be 
a warning against rapid conclusions 
about the relationship between RT and 
task construction. In the analysis of 
choice situations he refers to two ex- 
treme types of decision trees, which 
represent different strategies : 


1. The completely symmetrical bi- 
nary-branched decision tree represent- 
ing the strategy applied in several 
choice tasks. These situations assume 
the existence of an amount of response 
uncertainty, » different responses be- 
ing possible. The response attainment 
requires a sequence of decisions. 
Throughout this process each decision 
(passing a “node” in the "tree") re- 
duces the uncertainty with equal 
amounts (1 bit). The ith decision 
reduces the number of remaining pos- 
sible responses K, to Kui — à Ke. If 
the number of decisions per time unit 
remains the same there will be a linear 


relationship between RT and the 
amount of uncertainty in the situation. 
(RT — a log n t c). 

2. The completely asymmetrical bi- 
nary-branched decision tree represent- 
ing the strategy in systematic-search 
tasks. Again there is some response 
uncertainty and a sequence of decisions 
is required. Now the tth decision re- 
duces the number of remaining possible 
K: to Ki. —K,— 1. So the rate of 
gain of information per decision is not 
the same throughout the process. Only 
in the last node (K,—2) the un- 
certainty reduction will attain 1 bit; 
before that progress will be much 
slower. Each decision reduces the 
number of possible responses by one. 
RT will be a linear function of n. 


So the log function cannot be ex- 
pected to fit data of experimental tasks 
which do not conform to the require- 
ments of (a) the existence of a priori 
response uncertainty, (b) a sequence 
of decisions that give equal rates of 
gain of information. The decision tree 
mentioned meets these requirements 
but the asymmetrical tree consists of 
a sequence of decisions which give un- 
equal rates of gain of information. The 
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applicability of the log function is 
demonstrated in experiments by Hick 
(1952), Archer (1954), Gregg 
(1954), Archer, Bourne, and Brown 
(1955) ; the linear function of RT on 
can be found in search experiments by 
Solley and Snyder (1958) and by 
Thomas and Solley (1963). They 
erroneously consider their results as 
“contradicting predictions from in- 
formation theory," but actually these 
results agree with the expectations 
deduced from information theory by 
Hick. With regard to the experi- 
mental findings that Thomas and 
Solley consider as falsifications of 
earlier results found within the scope 
of research using the information 
Measure, it is necessary to replicate 
their investigations, 

Description of Thomas and Solley's 
experiment.—On sheets of white paper 
figures are mounted that can be charac- 
terized by 1, 2, or 3 three-valued vari- 
ables (form, color, size). Total num- 
bers of stimuli used are 9-1, 18-1, 27-1, 
36-1, 54-1. The fields with randomly 
ordered figures contain one or more 
complete sets of stimuli minus one. 
The S has to search for the missing 
stimulus. Referring to a search-dis- 
crimination experiment by Eriksen 
(1952) which was in fact inconclusive, 
Thomas and Solley predict an increase 
in RT with increasing number of 
target-defining variables, This predic- 
tion is further founded in results of 
Archer's (1954) classification experi- 
ment. However, the structure of this 
classification process is rather different 
from that of a search-discrimination 
task. Thomas and Solley’s task re- 
quires mere counting of stimuli belong- 
ing to a homogeneous class and check- 
ing the total of this group against other 
totals until the group containing less 
figures than the others is found. In 
this process an increase of diversity in 
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the stimulus field obstructs the search 
for the following figure to be counted: 
there is a smaller probability that the 
next figure is of the required class (in 
information theoretical terms: there is 
more uncertainty in the search field). 
The nonsystematic relationship of num- 
ber of defining variables to RT in 
Thomas and Solley’s data is therefore 
an unexpected result. Possibly this 
effect is due to a lack of appreciation of 
the concept of “redundancy” (Eriksen, 
1957). Simple redundancy reduces 
the diversity of the stimulus field so 
that the search process can be repre- 
sented by a simple decision tree. If 
higher-order constraints exist, then the 
search tree becomes more complicated. 
This difference is found in the triple 
classification problem. In the nonre- 
dundant triple-classification problem 
sometimes a higher-order constraint is 
necessary: when figures vary inde- 
pendently along three dimensions over 
three values the number of different 
possibilities is 27. A “complete” set 
of 9, 18, or 36 figures can then be con- 
structed by use of interactional con- 
straint of the third degree if one does 
not want to introduce simple redun- 
dancy. For example, when the number 
of figures on the card is 18 minus 1, 
the problem consists of 6 blue figures : 
a small and medium diamond, a small 
and large square, and a medium and 
large circle; 6 red figures: a small and 
large diamond, a medium and large 
Square, and a small and medium circle ; 
6 green figures: a large diamond, a 
small and medium Square, and a small 
and large circle. In this problem the 
medium green diamond is the missing 
one. This “constrained” three-way 
classification was used in Problem III’; 
the completely nonredundant classifica- 
tion is denoted ITI, Simple redun- 
dancy not only reduces the diversity in 
the stimulus set but also heightens the 
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signal to noise ratio and so facilitates 
identification. 

In our experiment we will vary: (a) 
the number of nonredundantly used 
target-defining variables and we predict 
a positive relationship of RT and 
amount of uncertainty in the search 
field, (b) the number of redundantly 
used target-defining variables and we 
predict a negative relationship of RT 
and simple redundancy, (c) the form of 
the constraint in the stimulus set and 
we predict increase in RT when higher- 
order interactions are introduced. 


METHOD 


Stimulus material.—This is comparable to 
the material used by Thomas and Solley. 
On sheets of white cardboard (50 X 35 cm.) 
instances, cut out of colored paper, were 
mounted, which could be characterized by 
three three-valued variables—color (C): 
red, green, blue; form (F): circle, square, 
diamond ; size (S) : 9, 4, 1 cm’, called large, 
medium, small, respectively. The number 
of variables held constant was 2, 1, or 0 
resulting in single- (I), double- (II), or 
triple- (III) classification problem. This 
corresponds to the generation of 3, 9 and, 
for some triple-classification tasks, 27 differ- 
ent categories of stimuli if the variables are 
nonredundantly used. Each category ex- 
cept one was represented by the same number 
of instances. In the incomplete group one 
stimulus was missing. Total number of 
stimuli (N) used was 8, 17, 26, 35, or 53 for 
all nonredundant problems, and 8, 17, 26, or 
35 for redundant tasks. The set of problems 
used is shown in Table 1. 

Subjects—The group consisted of 40 
freshmen (15 females, 25 males). They 
were randomly divided into two groups. 
Each group worked on half of the stimulus 
material. Both parts of the material were 
equal in difficulty (underlined ys. not-under- 
lined tasks in Table 1). Only the nonredun- 
dant triple-classification tasks were per- 
formed by all Ss (in Table 1 these tasks are 
double underlined). 

Procedure—The Ss were tested individ- 
ually. They were seated at approximately 
1.50 m. from the stimulus field. Before the 
experiment S's were trained in correct label- 
ing of the value of the three defining vari- 
ables. The following instructions were 
given; 
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TABLE 1 
SCHEME OF THE S MATERIAL 
Total Number of Stimuli (N) 
Class. | Variables 
8 |17 | 26 | 35 | ss 
I [el £L oig Ur an e e 
F zu bri He eel br 
S EM oR BEIDE] ERR RT 
Il CR ia te a dl eh 
cs fed p PA reyes eel) 
F-S dud alot A T 
Ill | C-F-S H H 
HP | CFS |1]|1 il 
Ih CF 1 jl 
cs 1 
FS 1 
n CF-S 1 
CS-F 1 1 
FS-C 1 
A EESC 212381 Vest opt] 


Note.—C-F means the dimensions color and form are 
nonredundantly used, i.e, varied independently. 
CF-S means the dimensions color and form are re- 
dundantly used but do not correlate with size. 


I'll show you several cards, with a num- 
ber of figures on each of them. The set 
is formed according to a certain principle. 
Every figure can differ in form, color, and 
size. On every card there is only one 
figure missing. I ask you to define the 
missing one as soon as possible. If your 
answer is wrong or incomplete, I'Il tell 
you, and you continue until you have 
given the correct answer. Do you under- 
stand this, or is there still any question? 


(If there were questions, these were an- 
swered.) After this E continued: “Here is 
an example; which one is missing?” No 
matter whether the answer was right or 
wrong, E explained the problem (no more 
practice was given). After this E said: 
*We will start; try to find the missing fig- 
ure as soon as possible. Please don't guess." 

(If S did not meet this requirement, 
this datum was eliminated.) The order of 
presentation of the sequence of search tasks 
was different for each S so as to counter- 
balance the effects of training. Presentation 
of the stimulus field was continued until a 
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complete, right answer was given. RT was 
measured to the nearest whole second with 
an ordinary stopwatch. 


RESULTS AND Discussion 


The heterogeneity of variance was 
effectively reduced by use of the 
transformation rule y = 10* (1 — x3) ; 
(x = measured RT, y= transformed 
RT). The two experimental groups 
did not differ significantly in perform- 
ance on the common search tasks (the 
nonredundant triple-classification prob- 
lems), which can be seen as a check on 
the comparability of the two groups. 
Table 2 gives in seconds the means 
derived from the means of the trans- 


TABLE 2 
MEAN RTs IN THE SEARCH PROBLEMS 


Number of Stimuli 


Tasks 
8 17 26 35 53 

I 3.54 | 10.01 | 19.61 | 25.62 | 38.07 

I 6.40 | 16.21 | 16.50 | 23.64 | 54.41 
II 34,88 66.11 
HI | 17,61 | 92.10 140.43 

Ih 3.81 | 7.68 | 13.79 | 20.65 
Itl; 6.55 | 12.28 | 20.58 | 24.72 
Hn 3,66 | 8.37 | 13.04 | 16.55 


formed RTs on the different search 
tasks. Within problem types RT can 
be considered as a linear function of N, 
which is in accordance with Hick’s 
predictions and the results of Thomas 
and Solley’s experiment, Analysis of 
variance demonstrated significant dif- 
ferences between comparable tasks of 
the three nonredundant problem types 
in the predicted direction (see Table 
3). The significant interactions are 
probably due to the scale used. Fig- 
ure 1, however, shows a divergence of 
the lines fitted to the RTs on the three 
problem. types, indicating an increas- 


1P. G. Pelle kindly advised us in the 
construction of this transformation rule, 


TABLE 3 
ANALYSIS OF VARIANCE OF NONREDUNDANT 
PROBLEMS 
Tasks | Source of Variation | df F 
-II |N (8, 17, 26, 35, 53: 4 | 429.55 
Tt K i ; 1 46.429 
NXK 225.469 
Error (Mean Square) | 584* | (335118) 


1 55.60** 
1 38.88% 
1 


N XK 
Error (Mean Square) | 1924 


I-III |N (26, 53) 
K 


(411401) 
II-III |N (26, 53) 1 99.23 %4* 
K 1 25.06 
NXK 1 9,17** 
Error (Mean Square) | 193^ (514569) 


^ These numbers of df's should be, respectively, 
590, 196, and 196. The difference is due to the elimina- 
tion of the yay few guessing data, 
<.01. 


*5 <.001. 


ing difference in difficulty with increase 
of N. Analysis of variance of the RTs 
on search tasks containing the same 
amount of diversity among the stimuli 
indicates a facilitation of the task with 
increasing redundancy (see Table 4). 
This tendency reaches significance for 
the comparisons I-II, and I-III, 
Again the interactions are a result of 
the scale used. Figure 2 shows, how- 
ever, that for the I-II, and I-III, com- 
parisons the facilitation effect of re- 
dundant forms increases with the num- 
ber of stimuli in the search field. This 
is in accordance with similar findings 
as to the effect of redundancy (Garner, 
1962). As to the form of constraint in 


NEAN RT (IN SEC) 


NUMBER OF STIMULI 


Fic. 1. RT asa function of N for the 
nonredundant problems. 
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TABLE 4 


ANALYSIS OF VARIANCE OF NONREDUNDANT 


vs. REDUNDANT PROBLEMS 


Tasks Factors df F 

I-II: |N (8, 17, 26, 35) 3 248.178 
K 1 4.11* 
N XK 3 8.4354 
Error (Mean Square) | 3118 (470979) 


I-III: | N (8, 17, 26, 35) 
K 


i 
N XK 3 
Error (Mean Square) | 310: 


241.36 
8.69%" 


3.13* 
a | (482458) 


Ih-IIs 2 (8, 17, 26, 35) 3 93.61*** 
1 
NXK 3 1.13 
Error (Mean Square) | 151% (459289) 
II-III: | N (8, 17, 26, 35) 3 288.51 *** 
K 1 0.63 
NXK 3 11,85 
Error (Mean Square) | 309» | (157309) 


* These numbers of df's should be, respectively, 
312, 312, 152, and 312. The difference is due to the elimi- 
nation of the very few guessing data. 


the stimulus set it can easily be seen 
from Table 2 that the III’ problems 
belong to a more difficult type than the 
III problems. Analysis of the search 
process shows that for these problems 
a different strategy is required. While 
in general counting within homogene- 
ous stimulus groups in which the search 
field can be divided demands the least 
effort (least mean number of stimuli 
to be counted and least memory load), 
the tasks of Problem Type III’ can 
best be attacked by counting the num- 
ber of figures that are characterized by 


MEAN RT(IN SEC) 


NUMBER OF STIMULI 


Fic. 2. RT as a function of N for the 
problems consisting of three groups. 


MEAN RT (IN SEC) 


NUMBER OF STIMULI 


Fic. 3. RT as a function of N for the 
problems consisting of nine groups. 


each single dimensional value. This 
results necessarily in a relatively 
greater number of stimuli to be counted 
than in other cases. So, generally, 
each figure occurs in only one place 
in the search tree, but in the cases 
mentioned above each figure can be 
found in three places in the search tree, 
which results in longer RTs. Whereas 
for N=8, RT is not yet very high, 
RTs for N = 17 and 35 are even sig- 
nificantly greater than those for N 
= 53 in the Simple Type III task. 
Again this is in accordance with similar 
findings as to the difficulties of tasks 
with higher-order constraints (Gar- 
ner, 1962). 


In the analysis of this search experi- 
ment the findings about the relation be- 
tween RT and the construction of the 
task are in accordance with predictions 
from research which uses the information 
measure. RT is positively related to 
amount of uncertainty in the stimulus 
set. RT is negatively related to the 
amount of simple redundancy introduced 
whereas constraints of higher order ob- 
struct the search process. Besides the 
uncertainty analysis, search trees can be 
constructed for the respective types of 
problems. Our experimental results are 
also in accordance with expectations 
(Hick, 1952), based on the structure of 
these trees. It is presumed that the 
search tree is a good approximation of 
the behavior processes in a search task. 
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INTRADIMENSIONAL AND EXTRADIMENSIONAL SHIFTS 
AS A FUNCTION OF AMOUNT OF TRAINING AND 
SIMILARITY BETWEEN TRAINING AND 

x SHIFT STIMULI * 


NORMAN UHL 
Emory University 


A 2X22 factorial design was employed to investigate whether 
similarity between training and shift stimuli affects the ease of solving 
IN and EX shifts under 2 different levels of training. The results 
using 172 college students as Ss were: (a) the IN shifts took fewer 
trials to reach criterion than the EX shifts; (b) the IN shift using 
identical stimuli to those used in training took fewer trials to reach 
criterion than the IN shift using different stimuli; (c) the EX shift 
using identical stimuli on the previously relevant dimension took more 
trials to reach criterion than the EX shift using different stimuli; and 
(d) overtraining facilitated the learning of both IN and EX shifts. 
These results were explained by mediational theory and stimulus 


generalization. 


According to a mediational theory 
such as presented by Kendler and 
Kendler (1962), adult Ss acquire a 
mediational response during training 
which is appropriate to all stimuli along 
the relevant dimension. In an intra- 
dimensional (IN) shift, for correct re- 
sponding on the transfer task (shift), 
S must respond to the same dimension 
that was relevant during training but 
reverse his response along this dimen- 
sion. Thus, the stronger the media- 
tional response, the easier will be an 
IN shift. In contrast, an extradimen- 
sional (EX) shift is hindered by a 
strong mediational response formed 
during training because the previously 
relevant dimension is no longer rele- 
vant while a previously irrelevant di- 
mension becomes relevant. Therefore, 
for predicting the ease of solving IN 
and EX shifts, the strength of the 
mediational response is an important 
variable to be considered. 


iThe author is especially indebted to 

Matthew Yarezower for his advice and en- 
couragement through the course of this 
investigation, 


Kendler and Woerner (1964) found 
that by weakening the mediational re- 
sponse formed during training through 
nonreinforcement, the IN shift did be- 
come more difficult while the EX shift 
became easier. Perhaps another way 
of affecting the mediational response is 
through generalization decrements. 
The effect of stimulus changes has not 
been considered in experiments com- 
paring IN and EX shifts. Most of 
these experiments use the same stimuli 
on the shift as in training. Some 
studies (Furth & Youniss, 1964; Shepp 
& Eimas, 1964) that have used differ- 
ent stimuli did not include EX and IN 
shifts employing the same stimuli dur- 
ing the training and shift. They, 
therefore, do not permit the compari- 
son of the ease of solving IN and EX 
shifts when the similarity between 
training and shift stimuli is system- 
atically varied. 

This experiment attempts to investi- 
gate whether similarity between train- 
ing and shift stimuli affects the ease of 
solving IN and EX shifts under two 
different levels of training. 
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METHOD 


Subjects —The Ss were 172 undergraduate 
students enrolled in an introductory psychol- 
ogy course. Twenty-four of these Ss were 
eliminated, 9 for failure to reach the training 
criterion and 15 for failure to follow 
instructions. 

Material—A 35-mm. slide projector was 
used to project the stimuli on a screen. The 
training stimuli were slides of an “F” and 
a "V" that varied in size (small or large), 
color (green, brown, or orange), and posi- 
tion (left or right). The transfer (shift) 
stimuli were slides of an ^F" and a "V" or a 
"G" and a “W” that varied in size (small or 
large), color (brown or orange), and posi- 
tion (left or right). All slides with the 
exception of those used on nonoutcome trials 
(trials in which S' was not told whether his 


TABLE 1 


EXPERIMENTAL DESIGN 


Transfer Task 


IN Shift 
(Form Continues 
to be Relevant) 


Amount 
of Train- 
ing on 
Task 1a 


EX Shift 
(Color Becomes 
Relevant) 


Fv 


Group 1 
Group 2 


G-W F-V G-W 


Group 3 | Group 5 | Group 7 
Group4 | Group 6 | Group 8 


16 Trials 
48 Trials 


Note.—For all groups during training the irrelevant 
dimensions were color, position, and size. The relevant 
dimension was form using the specific stimuli F-V. 


response was correct) were constructed so 
that during training and testing all four 
dimensions were mutually orthogonal. A 
standard two-choice answer sheet was used 
to record Ss’ responses, 

Procedure—A 2X 2X2. factorial design 
was employed to investigate the effects and 
interactions of two different amounts of 
training and two different degrees of similar- 
ity between training and shift stimuli on the 
ease of solving both IN and EX shifts, 

A pretraining sample problem consisting 
of the presentation of 12 slides was given 
all Ss.. The Ss were told that one of four 
dimensions (color, size, form, or position) 
would determine the correct response at any 
one time. An answer sheet consisting of 
two empty blocks per number was given to 
each S with the instructions to make an “X” 
in the left box next to the number of the 
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stimulus presentation if he thought the stimu- 
lus on the left side of the screen was correct, 
or an “X” in the right box if he thought the 
stimulus on the right side of the screen was 
correct, The Ss were told that after they 
recorded their responses, the slide would 
again be presented and a beam of light would 
be directed on the correct stimulus. The 
training discrimination was then given. 

As illustrated in Table 1, four groups 
received 16 training trials which consisted 
of 12 outcome trials in which the form 
dimension was rewarded (specifically F) 
and four nonoutcome trials in which no feed- 
back was given to S. Before presenting the 
nonoutcome trials the following instructions 
were given: 


There will now be a test of how much you 
have learned thus far. During the next 
four presentations, I will not indicate 
which figure is correct. Try to get 100% 
correct on these next four trials. 


After presenting the nonoutcome trials, the 
statement, “The opportunity for additional 
training will now be resumed,” was given. 
Two of these four groups (1, 3) were 
then given an IN shift for 16 trials, Group 1 
receiving on the form dimension the same 
stimuli as were used in training (F-V), 
Group 3 receiving different stimuli (G-W). 
Two other groups (5, 7) were given an EX 
shift for 16 trials, again, Group 5 receiving 
on the form dimension the same stimuli as 
in training (F-V), Group 7 receiving differ- 
ent stimuli (G-W). These same shifts were 
given to four additional groups (2, 4, 6, 8) 
Which differed from the above four only in 
that they had received 48 training trials. 
These 48 trials consisted of the 16 training 
trials repeated three times. Two additional 
groups (9, 10) received 16 training trials. 
After training, these two groups were in- 
structed to try to obtain 100% correct on the 
next eight trials. They received eight non- 
outcome trials which varied on the dimen- 
sions, color, size, form, and position. Three 
of the dimensions, color, size, and position, 
were the same for both groups. However, 
for Group 9 the specific form stimuli were 
E and *V" and for Group 10, they were 
G" and ^W." These two groups were used 
to test whether Ss would generalize to the 
new form stimuli without receiving any feed- 
back as to the correctness of their responses. 
The experiment was designed in such a 
Way that the stimuli on each irrelevant di- 
mension of the EX and IN shift received an 
equal amount of partial reinforcement. 
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TABLE 2 
MEAN TRIALS TO CRITERION ON TAsK 1 


Transfer Task 
IN Shift EX Shift 
Amount of (Form Continues to be Relevant) (Color Becomes Relevant) 
Training on 
Task 1 
F-V G-W F-V G-W 

M SD M SD M SD M SD 
16 Trials 7.2 2.61 7.3 2.53 8.4 4.78 7.5 2.07 
48 Trials 6.9 1.77 84 3.44 7.3 1.42 8.1 3.10 


RESULTS AND Discussion 


The data for all Ss who were not 
correct on the four nonoutcome trials 
preceding the shift were eliminated 
from the analysis since the shift for 
these Ss would not be the same as for 
the other Ss in the same group. Only 
nine of 172 Ss were eliminated because 
of a failure to meet this training re- 
quirement. The Ss making up each 
group were randomly assigned to the 
treatments. 

In order to ascertain whether any of 
the groups differed on the training task, 
the number of trials to obtain four cor- 
rect consecutive responses was deter- 
mined. Table 2 reports the means and 
standard deviations of the different 
groups. An analysis of variance indi- 
cated nonsignificant differences (F <1) 
among the groups. 


The means and standard deviations 
of the number of trials to criterion 
(four correct consecutive responses) 
on the different types of shifts are 
given in Table 3. A 2X22 fac- 
torial design was employed to deter- 
mine differences in ease of learning the 
transfer task. In order to obtain 
homogeneity of variance, the data were 
subjected to a logarithmic transforma- 
tion. These results indicated that: (a) 
the groups receiving 48 training trials 
took fewer trials to criterion than those 
receiving 16 training trials, F (1, 111) 
=5.14, p<.05; (b) the IN shift 
groups took fewer trials to criterion 
than the EX shift groups, F (1, 111) 
= 65.71, p < .001; and (c) there was 
an interaction between type of shift 
and shift stimuli, F (1, 111) = 16.21, 
p <.001. 


TABLE 3 
MEAN TRIALS TO CRITERION ON TASK 2 


Transfer Task 
ii EX Shift 
Amount of (Form Cobtiesto be Relevant) (Color Becomes Relevant) 
Cun on 
hut F-V G-W FV G-W 

M SD M SD M SD M SD 
16 Trial y 2.90 7.3 2.33 11.1 3.73 9.0 2.33 
48 dati $5 1.09 6.1 1.10 10.9 3.16 7.5 1.70 


432 


Previous studies in which adult Ss 
have been used have consistently found 
IN shifts easier than EX shifts. The 
data of this experiment agree with 
these results. According to mediational 
theory, during training (Task 1) the 
reinforcement of the form dimension 
elicits cue-producing form responses 
and resultant cues, which then mediate 
the instrumental response of selecting 
“F?” An IN shift would be easier 
than an EX shift since the cue-pro- 
ducing responses and resultant cues 
would remain correct; only a shift in 
the instrumental response is neces- 
sary. In an EX shift, in addition to 
learning a different instrumental re- 
sponse, different cue-producing re- 
Sponses and resultant cues must be 
learned. Thus an EX shift can be 
considered more difficult for either of 
two reasons: (a) two, rather than 
one, new associations must be formed, 
and/or (b) the mediating response 
stimulus is more difficult to extinguish 
than is an overt instrumental response. 
The possibility of explaining the above 
result in terms of the color concept 
being more difficult than the form con- 
cept is minimized, since a pilot study 
indicated that there was no difference 
in the speed of learning the form or 
color concept used in this experiment. 

The interaction between type of 
shift and shift stimuli is of major in- 
terest. Figure 1 shows this interaction. 
A t test to compare the two means of 
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the EX and IN shift groups indicated 
that the groups receiving an EX shift 
using the G-W shift stimuli reached 
the criterion in fewer trials than the 
groups receiving the F-V shift stimuli 
($ «.005). For the IN shift, the 
opposite trend is noted in Fig. 1 and is 
significant (p < .05). 

It was hypothesized that the medi- 
ating form response developed during 
training might be weakened by gen- 
eralization decrement when similar but 
not identical form stimuli were used 
in the transfer task. If different but 
similar stimuli on the form continuum 
were presented in the transfer task 
(Task 2) as was done for Groups 3, 
4, 7, and 8, it would be expected that 
the new stimuli would also elicit the 
mediating form response, but at a re- - 
duced strength. Since the mediating 
form response developed during train- 
ing aids in the IN shift and must be 
discarded in an EX shift, the weaker 
this response, the easier is an EX shift 
and the more difficult an IN shift. 
Figure 1 shows that when the same 
training and shift stimuli are used, 
an IN shift is much easier than an 
EX shift. However, when “G” and 
<W” replaced the original “F” and 
“V” stimuli on the shifts, the IN shift 
was still easier than the EX shift, but 
the difference was not as great. This 
Supports the previously stated predic- 
tion and is explained by considering 
the “F” and “V” as generalizing to 
the stimuli “G” and “W” which would 
elicit the mediating form response, but 
at a lesser strength characteristic of 
stimulus generalization, thus aiding the 
solution of an EX shift and hindering 
the solution of an IN shift. 

Kendler and Woerner (1964) used 
nonreinforcement to reduce the strength 
of the mediating response, while in the 
present experiment the mediating re- 
Sponse was weakened by generalization 
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decrement. The data from both these 
experiments illustrate that weakening 
the mediating response makes the IN 
shift more difficult and an EX shift 
easier. 

The results of the two groups that 
received the eight nonoutcome trials 
also illustrate the effect of stimulus 
generalization. On the nonoutcome 
trials, the groups that received the same 
stimuli on the relevant dimensions as 
were used during training, made all 
responses to “F.” In the group that 
was given eight nonoutcome trials 
using different stimuli along the rele- 
vant dimension (G-W instead of F-V), 
the form dimension remained as the 
relevant dimension for 10 of the 15 Ss. 
Eight of these 10 Ss who responded 
to a form dimension chose the specific 
stimulus “G” rather than “W.” Using 
the binomial, the probability of eight 
or more Ss choosing the “G” stimulus 
by chance is .05. This supports the 
view that Ss trained to respond to 
"F" generalize their responses to the 
stimulus, “G.” 

The data from the experiment indicate 
that the effect of the strength of the 
mediating response on IN and EX shifts, 
as influenced by stimulus generalization 
between the training and shift stimuli, 
should be considered when comparing the 
ease of solving IN and EX shifts, a fac- 
tor which has not been considered in 
prior experiments. The vast majority 
of studies used the same stimuli on the 
shift as in training. Those few studies 
that did use different stimuli did not in- 
clude EX and IN shifts employing the 
same stimuli during the training and 
shift. 
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It should also be noted that in some 
prior experiments (Stevenson & Weir, 
1959; Youniss & Furth, 1964) only one 
dimension is used on the training and 
shift problems. For example, Youniss 
and Furth's Ss were required to associ- 
ate blue, yellow, and black with three 
different responses and then were given 
what was referred to as an EX shift. 
The children were to associate three dif- 
ferent colors (red, silver, green) with 
the same three responses learned during 
training. Since no irrelevant dimension 
was included during training, a true EX 
shift was impossible. 

The results of this study indicate that 
more refined analysis in terms of stimu- 
lus attributes should be part of experi- 
ments that compare IN and EX shifts. 
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Vanderbilt University 


The effects of threat of shock on verbal conditioning were investigated 


for 60 undergraduate males who 


were randomly assigned to Threat 


and Nonthreat groups. Ss in the Threat group were told they would 


receive strong electric shocks; shock was never given. 


All Ss were 


reinforced with “Good” for sentences beginning with HE or THEY; 


shock by getting E to say “Good,” 


According to Lazarus, Deese, and 
Osler (1952), psychological stress can- 
not be defined in terms of stimulus and 


Tesponse operations alone. Lazarus et 
al. (1952) state, 


because people differ in motivations and in 
the ways they deal with them, it is never 
teally possible to define a general stress 
Situation. The situation will be more or 
less stressful for the individual members of 
the group, and it is likely that these differ- 
ences in the meaning of the situation will ap- 
pear in terms of performance [p. 294, italics 
added]. 


Although Lazarus et al's review of 
the literature on psychological stress is 
widely cited, most investigations of the 
effects of stressor stimuli on behavior 
still fail to consider individual differ- 
ences in Ss’ interpretation of the mean- 
ing of such stimuli, 


1 This research was Supported in part by a 
grant (HD 947) to the first author from the 
National Institute of Child Health and 
Human Development, United States Public 
Health Service. We are grateful to Richard 
C. Ratliff and Adele Spielberger for serving 
as judges. 

2? Now at the National Institute of Mental 
Health, Bethesda, Maryland. 


The use of electric shock for in- 
ducing psychological stress is widely 
accepted, but the results of investiga- 
tions of the effects of shock on verbal 
behavior are contradictory and difficult 
to interpret. After reviewing a num- 
ber of experiments involving shock, 
Spence and Spence (1966) have re- 
cently concluded that, 


no simple empirical or theoretical statement 
about the influence of shock can be made and 


in Which shock or anticipation of shock is 


In terms of their emphasis upon in- 
structions and other factors which in- 
fluence Ss’ interpretation of the ex- 
perimental situation, Spence and 
Spence appear to be in essential agree- 
ment with Lazarus et al, (1952). 

The purpose of this experiment was 
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to evaluate the effects of threat of 
electric shock on verbal conditioning. 
While the effective modification of 
verbal behavior by operant condition- 
ing procedures has been convincingly 
demonstrated (see reviews by Krasner, 
1958; Williams, 1964), it has been 
shown in recent studies that perform- 
ance gains in verbal conditioning are 
limited to Ss who verbalize a correct 
response-reinforcement contingency and 
who are motivated to act on such 
knowledge (DeNike, 1964; Dulany, 
1962; Spielberger, 1962, 1965; Spiel- 
berger & DeNike, 1966). Therefore, 
in the present study it was expected 
that threat of shock would influence 
performance only for those Ss who 
verbalized a correct contingency and 
who also indicated that they believed 
acting on their awareness would help 
them to avoid shock. 


METHOD 


The Ss were 60 male undergraduates in 
introductory psychology at Vanderbilt Uni- 
versity who volunteered for a “sentences” 
experiment. They were randomly assigned 
to Threat (N = 40) and Nonthreat (N = 20) 
groups. Conditioning procedures were 
carried out in a small room containing two 
tables, two chairs, and a Grass multichannel 
polygraph. The S and E occupied the 
chairs and faced each other across a small 
table. A transformer and other electrical 
equipment rested on the second table located 
behind S. After a few moments of casual 
conversation, E, a male graduate student, 
read the following instructions : 


We are doing a study on verbal behavior 
as reflected in how people make up 
sentences. I will show you some cards 


3 Half of the Ss in each group scored 8 
or lower on the Taylor (1953) Manifest 
Anxiety scale and half scored 21 or higher 
on the MA scale. Continuous measures of 
heart rate were also recorded throughout 
the conditioning trials. However, since no 
statistically significant relationships were 
found between the MA scale, heart rate, and 
conditioning, the MA scale and heart-rate 
data will not be reported. 
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like this one (E displayed a 3X5 in. 
white card on which the pronouns, 1, WE, 
HE, and THEY, and a past tense verb were 
typed). Each card will have a word in 
the center and four words on a line above 
it. Your task is to make up a sentence 
containing the word in the center and be- 
ginning with any of the words on the line 
above. After you have made up a group 
of sentences, we will pause for a moment 
and I want you to write down your 
thoughts about the experiment in the space 
provided on the recording sheet in front 
of you. Do not state your thoughts aloud, 
write them down. We are also interested 
in the relationship between your perform- 
ance in making up sentences and your 
blood pressure. In order to measure your 
blood pressure, we will put this little 
gadget on your ring finger. Place your 
hand in a comfortable position and try not 
to move it. (Second E attached a ring 
plethysmograph. ) 


The E then presented 100 cards, one at a 
time, in the same order. Each sentence 
constructed by S defined a trial. Following 
each block of 10 trials, E paused and, if 
necessary, reminded S to record his 
“thoughts.” ^ No reinforcement was given 
during the first two trial blocks which served 
to determine operant rates for the pro- 
nouns. For Trials 21-100, Ss were rein- 
forced with “Good” for sentences beginning 
with either HE or THEY, depending upon 
which was nearest the chance level of 2.5 
responses per trial block in the operant 
period. After the operant trials, Ss in the 
Threat group were given essentially the fol- 
lowing instructions: 


That's fine. Now we want to see how a 
strong electric shock will affect the rela- 
tionship between a verbal task and your 
blood pressure. During the remainder of 
the experiment you will receive several 
strong shocks. Although the shocks you 
receive may be painful, they will not harm 
you. After we attach the shock electrodes 
to your leg, you will perform the same task 
as before. . . . Incidentally, it may be 
possible for you to avoid the shocks 
altogether. 


The first E tallied S’s operant rate for HE 
and THEY while the second E attached the 
“shock” electrodes and explained how the 
various pieces of electrical equipment pro- 
duced a “painful but safe” shock. For the 
Nonthreat group, the first E paused for a 
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period of time approximately equivalent to 
that required to give the threat instructions. 
Shock was never administered to either 
group. At the end of the conditioning 
period, all Ss were interviewed by the second 
E according to an interview schedule adapted 
from previous verbal conditioning research 
(Spielberger, 1962). For the Threat group, 
the interview served to evaluate Ss’ reactions 
to threat of shock. Upon completion of the 
interview, each S was asked if he had heard 
about the experiment before Participating 
in it; none reported prior knowledge. 


RESULTS 


Ratings of awareness—Awareness 
was inferred from (defined by) the 
notes which Ss recorded during con- 
ditioning. These were rated inde- 
pendently by two judges who had no 
contact with Ss. The judges were in- 
structed to rate Ss aware if a correct 
pronoun-reinforcement contingency was 
recorded, and to indicate for each 
aware S the trial block on which this 
contingency was first recorded, There 
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was perfect agreement between judges 
in rating 39 Ss aware and 21 unaware, 
and agreement for 35 of the aware Ss 
as to the specific trial block on which 
the contingency was recorded. Dis- 
agreements between judges were rec- 
onciled by a third judge. Of 40 Ss 
in the Threat group, 26 (65%) were 
rated aware; 13 (65%) of 20 Ss in the 
Nonthreat group were rated aware. 
Thus, threat of shock did not influence 
the percentage of Ss reporting the cor- 
rect contingency, 

Effects of threat of shock and aware- 
ness on conditioning.—The condition- 
ing data were evaluated by an analysis 
of variance in which Awareness, 
Threat, and Trial Blocks were the in- 
dependent variables in a 2x 2 x 10 
mixed design (Lindquist, 1953). This 
analysis yielded a significant Aware- 
ness X Trial Blocks interaction, F 
(9, 504) =5.82, p< 001, and main 
effects of Awareness, F (1, 56) = 
1227, p < .001, and Trial Blocks, F 
(9, 504) — 7.59, b «.001. None of 
the F tests involving the Threat 
variable were significant. The Aware- 
ness X Trial Blocks interaction is 
presented in Fig. 1A in which the 
conditioning data for successive blocks 
were combined to give more stable 
Curves. Aware Ss showed a marked 
Increase in mean percentage of rein- 
forced-pronoun sentences over trials in 
Contrast to unaware Ss who continued 
to respond at about the same level as 
during the operant period. 

The possibilty that aware Ss showed 
performance gains before they recorded 
a correct contingency in their notes is 
examined in Fig, 1B which presents 
the mean percentage of reinforced- 
Pronoun sentences per trial block given 
during operant, preawareness, aware- 
ness, and postawareness trials. For 
this analysis, 11 aware Ss who re- 
corded the correct contingency on 
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Block 3 were excluded because these 
Ss had no preawareness data. It may 
be noted in Fig. 1B that: (a) no per- 
formance gains occurred during pre- 
awareness trials; and (b) a marked in- 
crease in performance occurred on the 
trial block on which Ss first recorded 
a correct contingency, and remained 
at a high level on postawareness trials. 
These data were evaluated by an analy- 
sis of variance in which Trial Blocks 
and Threat were the independent vari- 
ables. Only the F test for Trial Blocks 
was statistically significant, F (3, 78) 
= 20.43, p < .001, providing further 
evidence that threat of shock per se 
did not systematically influence per- 
formance. The possibility that the 
threat manipulation differentially af- 
fected the performance of aware Ss 
who thought they might avoid shock by 
acting on their awareness was evalu- 
ated next. 

Performance of aware subjects with 
shock-avoidance hypotheses.—For this 
analysis, the postconditioning interview 
protocols of the aware Ss in the Threat 
group were submitted to three judges. 
The judges agreed that 7 Ss spontane- 
ously verbalized shock-avoidance hy- 
potheses, e.g., they reported that they 
thought getting E to say “Good” or 
beginning sentences with the reinforced 
pronoun would help them to avoid 
shocks. These Ss were designated the 
Threat-Shock Avoidance Contingency 
(Threat-SAC) group, and the 17 Ss 
in the Threat group who did not 
verbalize a shock-avoidance contin- 
gency were designated the Threat- 
NSAC group. The postawareness 
conditioning data for the Threat-SAC 


4One S on whom the judges could not 
agree was eliminated; a second S, who did 
not verbalize a correct contingency until 
Trial Block 9, was also excluded because his 
postawareness data could not be readily 
Vincentized, 
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F:6.2. Vincentized conditioning curves on 
postawareness trial blocks for aware Ss in 
the Threat group who verbalized a shock- 
avoidance contingency (Threat-SAC), aware 
Ss in the Threat group who did not verbalize 
a shock-avoidance contingency (Threat- 
NSAC), and aware Ss in the Nonthreat 


group. 


and Threat-NSAC groups were then 
Vincentized into equal thirds and 
evaluated by an analysis, of variance 
which yielded a significant Groups x 
Trial Blocks interaction, F (4, 88) = 
12.01, p < .001, and main effects of 
Trial Blocks, F (4, 88) = 14.37, p < 
001, and Groups, F (1, 22) = 26.79, 
p<.001. The Vincentized condition- 
ing curves presented in Fig. 2 indicate 
that Ss in the Threat-SAC group gave 
increasingly more reinforced-pronoun 
sentences after reporting the correct 
contingency whereas the performance 
of Ss in the Threat-NSAC group was 
quite comparable to that of aware Ss 
in the Nonthreat Group on post- 
awareness trials, as may be noted in 
Fig. 2. 
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Discussion 


In this study, only Ss who reported 
a correct pronoun-reinforcement contin- 
gency during conditioning showed per- 
formance gains, and these first occurred 
on the trial block on which the con- 
tingency was first recorded. These find- 
ings were consistent with results obtained 
in previous verbal conditioning experi- 
ments in which awareness was either 
defined in terms of notes recorded during 
conditioning, as in the present study 
(DeNike, 1964; Spielberger, Bernstein, 
& Ratliff, 1966), or inferred from verbal 
reports obtained in postconditioning in- 
terviews (Dulany, 1962; Spielberger, 
1962). c 

"Threat of shock per se in the present 
experiment did not systematically influ- 
ence performance on the conditioning 
task. However, the shock-threat instruc- 
tion markedly affected the performance of 
those aware Ss who reported that they 
thought giving reinforced-pronoun sen- 
tences would help them avoid shock. 
These findings were generally consistent 
with results recently reported by Martin 
and Dean (1965) who showed that the 
effect of shock on behavior depended 
upon the strategies developed by Ss to 
avoid it, and with those of Bardach 
(1960) who found that the effects of 
shock on learning depended upon Ss' 
attitudes toward the shock as revealed 
in a  postexperimental questionnaire. 
"Thus, it would appear reasonable to con- 
clude that a comprehensive understanding 
of the effects of shock or threat of shock 
on complex human behavior requires 
careful questioning of Ss concerning 
their cognitive and motivational states. 
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Mediated learning was studied in the A-B, B-C, C-D, A-D paradigm 
using a modified short-term memory technique. Pairs were arranged 
into 24 blocks. Within each block, each of the 4 stages of the 
paradigm appeared successively. Each stage was a 3-pair list pre- 
sented for study trials followed by a test trial. 1 or 2 of the pairs in 
a stage were part of a mediated sequence, the remainder were part of 
a control sequence A-X, B-C, C-D, A-D. Each block was completed 
in less than 2 min. The dependent variable was the proportion of test 
pairs (A-D) correct given the previous 3 pairs in a sequence were 


correct. The difference between conditional proportions significantly 


favored mediated sequences. 


According to the mediation hy- 
pothesis a sequence of related stimulus- 
response events (e.g, A-B, B-C . . .) 
within a short time interval should 
raise the probability of the first stimu- 
lus event in the sequence evoking the 
last response. If the objective stimulus 
characteristics of a seen verbal item 
are similar to the assumed stimulus 
characteristics of the same item as a 
spoken response then the four-stage 
verbal-chaining paradigm A-B, B-C, 
C-D, A-D (Jenkins, 1963) used here 
satisfies the operational requirements 
for a test of the hypothesis. 

The present experiment differs -in 
three respects from those in some other 
published reports of the verbal-chain- 
ing phenomenon. Unlike the study by 
Russell and Storms (1955) the pre- 
sumed: mediating links in this experi- 
ment were acquired explicitly, and un- 


like Horton and Kjeldergaard’s (1961) ^ 


Stage 1 Stage 2 
(m) A-B (m) BC: 
im) ALB. (c) Bs-Cs 
(c) Ag-Xs (m) Bi-Ci 

ne B. 
yet Be 
Ar Br 


three-stage chaining experiment facili- 
tated learning depends upon not one 
but two mediating responses succes- 
sively and discretely acquired. Unlike 
both, a modified short-term memory 
technique rather than the conventional 
paired-associate learning method was 


used. 
METHOD 


Design and materials—The pairs on a 
learning list were arranged into 24 blocks on 
memory-drum tape. Within each block, 
each of the four stages of the paradigm ap- 
peared successively. Each stage consisted of 
a three-pair list presented for a number of 
study trials (stimulus-response items to- 
gether) followed by one test trial (stimulus 
item alone). One or two of the three pairs 
in a stage were part of a mediation (M) 
sequence A-B, B-C, C-D, A-D. The re- 
maining pairs were part of a control (C) 
sequence A-X, B-C, C-D, A-D. The fol- 
lowing diagram represents the sequence of 
materials within a typical block, except that 
only one study trial in each stage is shown 
for clarity : 


Stage 3 Stage 4 
(m) C-D: (c) As-Ds 
(m) C-D: (m) Ai-Di 
(c) CD; (m) AD: 

Cr Ar 
Cr Ar 
Cr- Ar 
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A, B, C, and X were four-letter AA words 
from Thorndike and Lorge (1944); D was 
a single digit. No word appeared in more 
than one block. The 3 digits for each block 
were drawn at random without replacement 
until the 10 digits were used up; then the 
process was repeated. Although only one 
study trial is shown at each stage in the 
diagram, in the experiment there were three 
study trials in Stage 1, three in Stage 2, 
two in Stage 3, and one in Stage 4. The 
three pairs within a stage appeared in ran- 
dom order on each trial, except that the 
pairs did not appear in the same order on 
successive trials including the test trial, and 
no pair followed itself. A red line ap- 
peared on the space between study and test 
trials as a signal. Presentation rate was 2 
sec, With 2 sec. between study and test 
trials and a 4-sec. interstage interval the 
complete block appeared in less than 2 min. 
The next block began 6 sec. later, except 
after the twelfth, when about 30 sec. were 
taken to change tapes. 

Every other block contained two M and 
one C sequences; remaining blocks con- 
tained two C and one M sequences. With 24 
blocks there were thus 36 M and 36 G 
sequences. 

There were four counterbalanced forms of 
the 24-block learning lists. Form II was 
the same as Form I except that an X word 
was substituted for the corresponding B 
word in each Stage 1 and vice versa, thus 
reversing the function (mediated or control) 
of the sequences. Then, for example, on 
Form I the first block had two M se- 
quences; on Form II in the first block the 
same two sequences now functioned as (č 
sequences. Forms III and IV were the same 
as Forms I and II except that, by substitut- 
ing B for X words and vice versa in Stage 
2, now each word functioning as a B word 
on Forms I and II functioned as an X word 
on Forms III and IV, and vice versa. Thus 
each word participated equally in both M 
and C sequences. With this counterbalanc- 
ing procedure each Stage 1 was identical 
in Forms I and IV as was each Stage 1 on 
Forms II and III. Each Stage 2 was 
identical on Forms I and II as was each 
Stage 2 on Forms III and IV. Each of 
Stages 3 and 4 were identical across all four 
forms. 

Subjects and brocedure—There were 120 
college Ss, naive to psychological experi- 
ments. They were assigned randomly in 
blocks of four by E to one of the forms ex- 
cept when the assigned form was in use by 


JAMES G. MARTIN AND GEORGE L. PARROTT 


another E. Then the next form on the 
random schedule was used. Each of the six 
Es (mostly naive about the design) ran 5 Ss 
on each form. 

Each S was given a practice block. His 
instructions were to say aloud everything 
presented and to supply the missing words or 
numbers on test trials if he could. All Ss 
met the practice-block criterion of two at- 
tempted responses on Stage 4. The experi- 
mental list presented next was visible 
through the illuminated slot of a memory 
drum mounted behind a black screen. 

Aíter the experiment each S filled out a 
questionnaire containing three items: Item 1. 
"Did you use any particular strategy in 
learning the pairs? If so, what was it?” 
Item 2. “Was there anything you noticed ' 
about the relationship between pairs of words 
or words and numbers that helped you to re- 
member them?” Item 3. “If you think you 
noticed some relationship can you give any 
examples?” 


RESULTS AND Discussion 


Figure 1 shows the stage-by-stage 
scores. The mediation hypothesis pre- 
dicts faster A-D learning in the M se- 
quences. But since B serves as both 
stimulus and response in the M se- 
quences greater interference from 
backward B-A association should oc- 
cur during B-C pair learning in those 
Sequences (Young & Jennings, 1964). 
This difference, seen in Fig. 1, was 
significant, 4(119) = 3.14, p< 0l. 
Thus the availability of the link for 
facilitating learning in the last stage is 
decreased. Even so there was a slight 
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Fig. 1. Responses correct in each stage for 


both halves of the experiment. 
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TABLE 1 


PROPORTIONS AND CONDITIONAL PROPORTIONS Correct IN MEDpiATED (M) 
AND CONTROL (C) SEQUENCES 


Blocks 1-12 Blocks 13-24 All Blocks 
M c M C M c 
b(123) 
Stage 4 correct .341 .353 423 416 .382 .384 
Stage 4 incorrect 112 124 090 110 101 117 
Total 453 ATI .513 .526 .483 .501 
5(4|123) .154 NES .825 113 -791 -167 


but insignificant difference in favor of 
the M sequences in the last stage. 

A fairer test of mediated learning is 
given by an analysis or responses cor- 
rect in Stage 4 given the responses in 
Stages 1, 2, and 3 in a sequence were 
correct, p(4|123). This comparison 
over all S's is shown in Table 1. Note 
first in Table 1 the higher proportion 
of first three stages correct, p (123), in 
C and M sequences. The difference 
between the number of sequences with 
the first three stages correct was re- 
liable, £(119) = 2.13, p < .05, and fol- 
lows from greater interference in the 
B-C stage of the M sequences. But 
p(4|123) was greater for M sequences 
as expected by the mediation hy- 
pothesis. The four proportions P 
(4|123) for each S were transformed 
to radians for a Mediation (M vs. C) 
X Blocks x E X S analysis of vari- 
ance, with E a between-Ss effect and 
Mediation and Blocks as within-Ss ef- 
fects. The result of major interest was 
a significant Mediation effect, F(1, 
114) 2 7.31, p < 01. The Blocks ef- 
fect was also significant, F(1, 114) 
—14.58, p < 01, but the Mediation 
X Blocks interaction was not, F(1, 
114) = 1.96. No E effect approached 
significance; Fs (5, 114 < 1.31. 


The present finding that facilitated 
A-D learning occurs in the A-B, B-C, 
C-D, A-D paradigm when the prior three 


links are available can be taken as sup- 
port for the mediation hypothesis. In 
the only recent paper using this paradigm 
with laboratory associations (Jenkins, 
1963) there was no evidence for a facili- 
tative effect. In Jenkins' experiment each 
stage was a paired-associate list learned 
to criterion. This traditional procedure 
probably allows S more opportunity for 
strategies than do those in the present 
experiment. The usual verbal-mediation 
experiment (using three paired-associate 
stages) typically shows a facilitative ef- 
fect only in the first few trials, a fact 
suggesting that traces from previous ex- 
perimental stages play a transient role 
compared to other resources available 
to adult Ss learning paired associates. 
Such resources may have been minimized 
by the fast presentation rate and rapidly 
changing materials used here, procedures 
which apparently also masked Ss’ re- 
ported postexperimental awareness of the 
relationship between word pairs. These 
considerations suggest that a mediation 
construct sufficiently subtle and general 
to account for thought (Berlyne, 1965), 
language (Jenkins & Palermo, 1964), 
and other complex phenomena must be 
studied in tasks providing the experi- 
mental control and stimulus variation 
possible in short-term memory. 


3 Only one S made a statement on the ques- 
tionnaire even equivocally suggesting knowl- 
edge of pair relationships. 
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AROUSAL, ADAPTATION LEVEL, AND ACCENTUATION 


OF JUDGMENT? 


WOLFGANG SCHÓNPFLUG ? 
University of Frankfurt, Germany 


4 experimental groups estimated the duration of a constant time 
interval presented repeatedly. Ss were either occupied with a count- 
ing task during the estimation interval or were listening to the sound 
of a buzzer. In addition, level of arousal was manipulated by the 
amount of physical work Ss were engaged in before the ratings. 
Intervals were classified as short under the counting condition, and as 
long under the listening condition. Estimates of time intervals classi- 
fied as long increased, estimates of intervals classified as short 
decreased as a function of arousal. With repeated presentations, all 
judgments approached an intermediate, neutral point on the rating 
scale under conditions of lower, but not under conditions of higher 
arousal. The results are interpreted as evidence for a positive relation 
between extent of energy mobilization and deviation from a hypothetical 


adaptation level. 


Absolute psychophysical thresholds 
in one sense modality undergo a change 
when another sense is simultaneously 
stimulated (Duffy, 1962; London, 
1954). Comparable effects appear in 
supraliminal sensations (Schónpflug, 
1965; Zajonc & Dorfman, 1964). In 
general, sensory interaction has re- 
sulted in lower threshold values and 
increases in the intensity of supra- 
liminal sensations. Students of arousal 
have mot hesitated to assume the 
physiological basis of such phenomena 
to be the facilitating influence of the 
reticular activating system, the excita- 
tion of which in turn may depend on 
the extent of sensory input. This 
hypothesis has been supported by posi- 
tive correlations between reported 
stimulus intensities and measures re- 
lated to reticular activation such as 


. iProblem and experimental arrangement 
of this study were conceptualized during 
discussions which the author was fortunate 
to have had with Josefa Zoltobrocki. The 
author is also indebted to Barbara A. Brooks 
and Walter Kintsch, who kindly read and 
greatly improved the manuscript. 

?Now at the University of Bochum, 
Germany. 


the amplitude of the visually evoked 
potential in the electroencephalogram 
(Hernández-Peón & Donoso, 1957), 
blood pressure (Diamond, 1960), and 
electrical skin resistance (Schónpflug, 
1965). 

The effect of arousal upon judgment 
of supraliminal stimulus magnitudes 
may be of two sorts. First, a tendency 
to select higher scale values may in- 
crease as arousal increases. Second, 
arousal may result in an accentuation 
of judgments, i.c, an increased tend- 
ency to depart from the zero point of 
a scale, The first hypothesis implies 
that with an increment in arousal 
stimulus ratings would shift unidi- 
rectionally towards increased values. 
If the second holds true this effect 
should only be observed with ratings 
of stimulus magnitudes located above 
such a zero point. With scales on 
which the zero point holds an inter- 
mediate position, ie, with scales on 
which stimulus magnitudes can be 
judged as large, small, and neutral, 
ratings below the neutral point would 
shift towards lesser values. Further- 
more, if an increment of arousal results 
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in a departure from the neutral point 
of a scale, the process of adaptation, 
le, an approach of ratings towards 
the neutral point with repeated presen- 
tations (Helson, 1964), should be 
slower under conditions of higher as 
opposed to lower arousal. 


METHOD 


Subjects—Sixty Ss, mainly students of 
introductory courses in psychology served in 
this experiment. They were randomly as- 
signed to four conditions, forming groups of 
15 Ss each. 

Apparatus —Flectrical skin resistance was 
measured and registered by a modified model 
of the polygraph, constructed by the Stoelt- 
ing Company, Chicago, Illinois; Velcro elec- 
trodes with a platinum surface were used. 
The ergometer was the standard model built 
by the Jaquet Company, Basel, Switzerland. 
` Procedure.—The experiment required re- 
peated ratings of the duration of a fixed time 
interval, judged to lie above and below the 
neutral point of a rating scale, under condi- 
tions of higher and lower arousal. The Ss 
were asked to rate the duration of a constant 
15-sec. interval. The location of the interval 
in relation to the neutral point of the scale 
was varied by manipulating Ss' activity dur- 
ing this interval. Under one condition they 
had to listen to the sound of a buzzer. In 
the second condition they were acoustically 
presented a sequence of 20 letters and nu- 
merals, and had to count the numerals, The 
Proportion of numerals varied randomly, and 
was on the average .25. Immediately after 
the presentation of the interval Ss rated it 
verbally on an 11-point scale ranging from 
0 to 10, the third point of which (Value 2) 
was designated as "very short," the ninth 
(Value 8) as "very long,” and the sixth 
(Value 5) as "neither short nor long." 
Ratings performed in the counting condi- 
tion are predominantly located below the 
sixth point, those obtained in the listening 
condition above the sixth point (Zolto- 
brocki, 1965). Two conditions of arousal 
were factorially combined with the previous 
conditions. Level of arousal was varied by 
the extent of physical work Ss had to per- 
form. Under one condition Ss were allowed 
to sit in a comfortable chair, under the other 
they had to sit on and peddle an ergometer 
with an effort expenditure of 40 w. 

The Ss were tested individually. After 
receiving instructions they were allowed to 


rest for approximately 10 min. Then they 
received 20 successive presentations of the 
time interval. Between each rating and the 
next presentation were pauses of 90 sec. after 
the fifth, the tenth, and the fifteenth rating, 
and of 30 sec. after the other judgments. 
The Ss performing physical work spent 
the rest period in a comfortable chair, fol- 
lowing which they mounted the ergometer, 
peddling for 1 min. before the first presenta- 
tion and thereafter for 1 min. during the 
three larger pauses described above. The 
other Ss remained seated in the chair 
throughout the whole experiment, During 
each experimental session electrical skin re- 
sistance between the tips of the second and 
the fourth finger of the left hand was con- 
tinuously monitored in order to obtain an 
Objective indicator of the level of arousal. 
Maximal basal resistance in ohms during 
the resting phase (B) and resistance values 
in ohms (4) preceding each of the j presen- 
tations of the interval to be judged were 
determined for each of the i Ss. Then, 
differences (B. — 4,,) were calculated, The 
amount of resistance change (Bi — 4,,) was 
correlated with the resistance level during 
the resting phase (Bi). The resulting func- 
tion was log (10?[B, — Ay, j]) 212 X 107 B, 
+1.06. In order to avoid effects of the cor- 
relation between B, and (B, — Aj) trans- 
formed values X.,—1001og (107[B,— 4,,]) 
(12 X 10* B, + 1.06)? were computed. 


RrsurTs 


The mean changes of electrical skin 
resistance for all experimental groups 
are depicted in Fig. 1. Figure 2 shows 
the ratings of the time interval. 
Muscular activity evidently produced 
an increase in the level of arousal, 
F (1, 56) 2945, p< 01. The ac. 
tivity during the interval to be rated 
created a similar difference, F (1, 56) 
= 9.18, p < .01, but there was no inter- 
action between the two effects, F < 
1.0. Resistance changes increased 
with the number of presentations, F 
(19, 1064) = 1945, p < 01. This in- 
crement was larger in the conditions 
with muscular activity than in the con- 
ditions without muscular activity, F 
(19, 1064) = 2.60, p <.01. None of 
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the other interactions were significant. 
The ratings of the time intervals had 
consistently different positions on the 
scale according to the activity during 
the presentations, being located below 
the neutral point of the scale under 
the counting condition and above it 
under the listening condition, F (1, 
56) = 43.55, p < .01. Muscular per- 
formance (associated with higher 
arousal) led to an increment of ratings 
located above the neutral point and a 
decrease of ratings located below it, 
F (1, 56) = 9.14, p< 01. With re- 
peated presentations, ratings ap- 
proached the neutral point of the scale 
if arousal was not enhanced by muscu- 
lar activity. In the conditions with 
added muscular activity judgments 
tended to depart from rather than ap- 
proach the neutral point of the scale, 
F (19, 1064) = 2.79, p< 01. The 
other effects were not significant. 


e—e MUSCULAR WORK, COUNTING 
e—o NO MUSCULAR WORK, COUNTING 
Aaa MUSCULAR WORK, LISTENING 
aa NO MUSCULAR WORK, LISTENING 
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o 
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Fic. 1. Mean changes of electrical skin 
resistance in ohms (transformed values) 
measured immediately before each of 20 pres- 
entations of a time interval to be rated for 
duration. 
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MUSCULAR WORK, LISTENING 
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Fic. 2. Mean ratings of the duration of a 
fixed time interval for 20 successive presen- 
tations. 


DISCUSSION 


The results provide evidence that en- 
hancement of arousal through an addi- 
tional stimulation leads to a departure 
of judgments away from a neutral value 
of a rating scale, thus accentuating per- 
ceived stimulus properties. This finding 
may be best interpreted in the light of 
Helson's (1964) adaptation-level theory. 
According to this theory objective stimu- 
lus intensities are averaged for each 
discriminable quality to form a level of 
adaptation; any stimulus intensity per- 
ceived is graded in relation to this level. 
With repeated presentation of a constant 
stimulus differing in intensity from the 
original adaptation level, the level shifts 
towards the magnitude of this stimulus. 
The neutral point of a rating scale repre- 
sents the subjective equivalent of the 
objectively determined adaptation level. 
Thus, with repeated presentations of 
stimuli originally not regarded as neu- 
tral, ratings of the magnitudes of these 
stimuli tend to approach the neutral 
value. The latter hypothesis also holds 
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for the estimation of time intervals, as 
has recently been demonstrated by Zolto- 
brocki (1965). Discrepancies between 
a perceived stimulus magnitude and the 
adaptation level, or judgments on a rat- 
ing scale and its neutral point, respec- 
tively, are obviously related to the amount 
of energy released in the organism. 
Establishing and maintaining a differ- 
ence between perceived stimulus inten- 
sity and adaptation level, or rating value 
and neutral point requires energy; re- 
ducing it conserves energy. With the 
extent of energy mobilization, the need of 
saving energy decreases, and departures 
from the adaptation level (and hence 
from its subjective equivalent, the neu- 
tral point of the scale) become more 
stable. This explains why, in situations 
in which degree of arousal varies, ratings 
show differences as observed in this 
study: if magnitudes deviate from adap- 
tation level, deviations are larger under 
conditions of higher arousal. Under the 
same conditions, a shift of level of adap- 
tation is at least delayed. Such an ex- 
planation implies the hypothesis of an 
interaction between cognitive and ener- 
getic processes, which may be of some 
significance for the theory of arousal as 


well as for further elaboration of the 
adaptation-level theory. 
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OCULOMOTOR PATTERNS DURING THE SOLUTION 


d 


OF VISUALLY DISPLAYED ANAGRAMS* 


IRA T. KAPLAN 
Ophthalmology Department, New York University Mi edical Center 
AND. W. N. SCHOENFELD 
Columbia University 


The S was shown a series of 40 5-letter anagrams. 


The first 20 


anagrams could all be solved by rearranging their letters in the same 
order, the next 10 followed a different order, and the last 10 followed. 


a 3rd rule. 


graphed. After the whole serie: 


whether he had noticed any pattern in the anagrams. 
discovered the rules developed distinctive 


As S solved the anagrams, his eye movements were photo- 
s had been presented, S was asked 


Those Ss who 
eye-movement patterns: 


they looked at the letters of the anagram jn the order that they ap- 
peared in the solution word, and the solution was achieved with just 5 


fixations, 1 on each letter. 


When the rule that solved the anagrams 


was changed, the fixation pattern also changed to follow the new rule. 


This oculomotor response may be regarded as 


the behavioral counter- 


part of a “mental set” to perceive the letters in the order of the rule. 


As a person solves a series of prob- 
lems which can all be solved by the 
same method, he develops a tendency 
to apply that method to subsequent 
problems. Rees and Israel (1935) 
demonstrated this phenomenon in the 
solution of anagrams. In one experi- 
ment, they first presented visually a 
training series of five-letter anagrams 
that could all be solved by rearranging 
their letters in the same order. Each 
anagram had only one solution. These 
single-solution anagrams were then fol- 
lowed by a test series of ambiguous 
anagrams, each of which had at least 
two solutions. In one solution, the 
letters were rearranged in the same 
order as the preceding series; in the 
other solutions, the letters Were ar- 
ranged in different orders. The effect 
of the training series was to establish 


1 This research was supported by Public 
Health Service Research Grant MH08164 
from the National Institute of Mental 
Health. 

We are grateful 
Melinda Myers for 
work, 


to Carol Wright and 
their assistance in this 


a bias toward those solutions of the 
ambiguous anagrams whose letters 
were arranged in the same order as 
the training series. The authors at- 
tributed this effect to the operation of 
a “mental set” that was established 
during the training series. 

The use of ambiguous anagrams as 
a test for “‘set” suffers from the limita- 
tion that it is inconclusive when ap- 
plied to single items. Rees and Israel 
inferred the establishment of the “set” 
by comparing the performance of their 
experimental Ss with the performance 
of a control group which did not re- 
ceive the training series. The experi- 
mental group solved 94.6% of the 
ambiguous anagrams according to the 
rule, while the percentage for the con- 
trol group was 47.3. Clearly, the oc- 
currence of any single solution that 
fits the rule does not signify the opera- 
tion of the “set” on that particular 
trial. 

Subsequent investigators viewed this 
anagram-solving behavior in the con- 
text of learning theory. Maltzman and 
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his associates (Maltzman, Eisman, 
Brooks, & Smith, 1956; Maltzman & 
Morrisett, 1952, 1953) related “set” 
to Hull’s concept of habit strength, 
and Adamson (1959) emphasized the 
role of reinforcement history. But 
these workers, too, compared the per- 
formance of groups of 5s summed over 
blocks of anagrams, using the measures 
of solution time, number of solutions 
within a limited time, and, again, the 
distribution of solutions to ambiguous 
anagrams. The operation of the "set" 
was still not observed in the single 
solutions of individual Ss. 

As an alternative to the unobservable 
“mental set,” a basis for this type of 
performance might be sought in con- 
ditioned eye-movement patterns. In 
the present experiment, S’s eye move- 
ments were photographed on every 
trial as he tried to solve a series of 
anagrams that were all formed accord- 
ing to the same transposition rule. 
Our purpose was to discover whether 
a distinctive pattern of oculomotor be- 
havior emerged in the course of suc- 
cessive solutions, in particular whether 
S learned to fixate the letters of the 
anagram in the order that they ap- 
peared in the solution word. 


METHOD 


Subjects.—Seven male college students 
served as S's. Solution times were recorded 
for all of them, and eye movements were 
photographed for five of them. 

Apparatus—The anagrams were presented 
one after the other by a Kodak Carousel 
slide projector. When a slide appeared on 
the screen, the light activated a photocell 
which started an electric timer calibrated in 


Rule: 34521 

1. richa 11. tnpla 
2. kebri 12. knthi 
F eshou 13. lecam 

. retig 14. evpri 
5. elwha 15. EN 
6. elunc 16. nedoz 
T. ydcan 17. dichi 
5 macre 18. nitra 

. rewat 19. yt 
10, helun 20) himon 
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01 sec. As soon as S solved the anagram, 
he pressed a telegraph key that stopped the - 
timer. 

An electrically operated Bolex 16-mm. 
movie camera, running at a calibrated speed 
of 13.6 frames per sec., photographed S’s eye 
movements as he looked at the anagram. 
The S was aware that his eye movements 
were being photographed. Head movement 
was restricted by a bite bar. The letters 
were widely separated, so that when the films 
were analyzed frame by frame, the position 
of S’s eye indicated which letter he was 
fixating. 

The five letters of each anagram were 
displayed in a rectangular pattern, with a 
letter at each corner and one letter in the 
center. The letters were typed on the slides 
in lower case IBM “Modern” type. At the 
viewing distance of 50 in., the taller letters 
were 4.6° high, and the shorter letters were 
3.4°. The horizontal separation between the 
corner letters was 41.6°, and the vertical 
separation was 28.0?. These separations 
were chosen to foster clearly distinguishable 
eye movements, since our preliminary obser- 
vations showed that, with closer spacing of 
the letters, S occasionally used his visual 
span to dispense with a separate fixation on 
each letter, ie, he was observed to fixate 
between adjacent letters. The positions of 
the letters are numbered for reference in the 
left half of Fig. 1. The right half of the 
figure shows the anagram for "brick," the 
transposition rule being 34521, ie., the first 
letter of the solution is in Position 3 of the 
anagram, the second letter is in Position 4, 
and so on. 

The anagrams were intended to have 
unique solutions. No letter appeared twice 
in the same anagram, so that only one se- 
quence of letter positions spelled the solution 
word. Most of the words were taken from 
the lists of Rees and Israel. We discovered 
later that two of the anagrams had alterna- 
tive solutions: the intended solutions being 
"whale" and "dozen," and the alternatives 
“wheal” and "zoned"— however, no Ss found 
the latter two solutions. The following 40 
anagrams were used in the experiment: 


Rule: 31245 Rule: 52143 
21. odmel 31. 

22. lecrk 2 inele 
23. iagnt 33. acfrs 
24. auflt 34. lupit 
25. tosrm 35. iaysd 
26. rufit 36. nasdh 
27. ridnk 37, rasdc 
28. njeoy 38. iwsnt 
29. wasmp 39. kasec 


30. rupne 40. roskf 


OCULOMOTOR ACTIVITY IN ANAGRAM SOLUTION 


1 2 k c 


4 2j r i 


Fic. 1. Five positions in which letters 
were placed are numbered on the left. On 
the right is an anagram that spells brick 
when the letters are read in the order 34521. 


Procedure.—The anagrams were shown to 
each S in the order that they are listed 
above. The first 20 anagrams could all be 
solved by rearranging the letters in the 
order 34521. "Then the rule was changed to 
31245 for the next 10 anagrams. For the 
last 10 anagrams, the rule was 52143. The 
S was not told of the existence of these rules. 

While waiting for each anagram to appear, 
S fixated the center of the screen. As soon 
as he found a solution, he simultaneously 
closed his eyes and pressed the telegraph key 
that stopped the timer. Releasing the bite 
bar, he spoke the solution word. The S 
kept his eyes closed while E recorded the 
solution time. Then, at a spoken signal 
from E, he resumed his position and fixated 
the center letter of the last anagram until 
the slide changed. If S did not find a 
solution within 20 sec., the trial was termi- 
nated and the next anagram was presented. 
After the last anagram, S was asked whether 
he had noticed any pattern in the solutions. 


RESULTS 


Solution time—The solution times 
of three Ss were consistently short, 
except when the rule was changed. 
These Ss were SJ, DO, and TC, whose 
median solution times were 1.59, 1.68, 
and 2.00 sec., respectively. The times 
of four other Ss varied widely from 
item to item through all 40 anagrams. 
When asked whether they had noticed 
any pattern in the solutions, the three 
quick Ss reported correctly that there 
was a pattern and that it changed 
twice. Two variable Ss also reported 
the rules: GE and PS, whose median 
solution times were 2.45 and 3.93 sec., 
respectively. The two other variable 
Ss were unaware of any pattern. 
"Their median solution times were 4.20 
sec. for RD and over 20 sec. for VG. 

Solution time on each successive 
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anagram is plotted for three repre- 
sentative Ss in Fig. 2. The top graph 
illustrates the variable performance of 
RD, who did not recognize the pattern. 
The middle graph shows the similarly 
variable performance of PS, who did, 
however, report that there Was a pat- 
tern which changed twice. The con- 
sistently fast solutions of DO, who 
also discovered the patterns, are shown 
in the bottom graph. After the fifth 
anagram, most of DO's solutions were 
found in less than 2 sec. On both 
occasions when the rule was changed, 
on Items 21 and 31, there was an in- 
crease in DO's solution time. The 
median solution times based on all 
anagrams for RD, PS, and DO were 
4.20, 3.93, and 1.68 sec., respectively. 

Sometimes an S pressed the key ac- 
cidentally or reported an incorrect so- 
lution. This happened once with RD 
(Anagram 6) and four times with PS 
(Anagrams 5, 13, 28, and 33). No 
times are shown for these trials in Fig. 
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Fic. 2. Solution time on successive ana- 
grams for three Ss; ns indicates no solution. 
(The first 20 anagrams were solvable by the 
rule 34521, the next 10 by the rule 31245, and 
the last 10 by the rule 52143.) 


10 po 


[jJ 10 20 30 40 
Anagram Number 


Fic. 3. Number of fixations per anagram 
for the same three Ss as in Fig. 2. (The 
open circles indicate solutions reached after 
only five fixations in the order of the rule.) 


Fixation patterns—The number of 
fixations per solution is shown for the 
same three Ss in Fig. 3. The open 
circles represent solutions character- 
ized by two features: only five fixa- 
tions were made, and the letters were 
fixated in the order of the rule. The 
fewest fixations that ever occurred 
during any solution was five. Subject 
RD found only one solution with five 
fixations, but they did not follow the 
rule. On Trial 2 he looked at the 
letters of the anagram in the order 
31452, fixating the letters "bkric," and 
immediately announced the solution. 
In none of RD's solutions did any 
sequence of five fixations occur in the 
order that the letters appeared in the 
solution word, The only obvious pat- 
tern was a tendency to move in a 
counterclockwise direction. Thus, on 
11 anagrams, RD's first three fixations 
were on Positions 3, 1, and 4, 

Subject PS solved 11 different ana- 
grams with only five fixations each. 
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On 10 of these, the letters were fixated 
in the order that they appeared in the 
solution. These 10 anagrams are 
indicated by the open circles in Fig. 3. 
There are 7 such items among the first 
20, where the fixations followed the 
transposition rule 34521. When the 
rule was changed on Anagram 21, the 
first four fixations were on Positions 
3452. It was not until Anagrams 29 
and 30 that PS gave solutions with 
only five fixations which followed the 
new rule. During the last 10 anagrams, 
only the last solution consisted of five 
fixations that followed the third rule. 

PS consistently made fewer fixations 
per solution than RD, although his 
solutions were not much faster because 
his fixations lasted almost 50% longer 
than RD’s. In terms of the mean 
number of fixations per second, the 
rates were 2.16 for PS, 3.06 for RD, 
and 2.91 for DO. 

Eye movements were recorded for 
two other Ss: GE, whose solution 
times were variable, and TC, whose 
times were consistently short. These 
tecords were not taken on every trial, 
but they showed that each S solved 
several anagrams with five fixations 
in the order of the rule, and both Ss 
reported the rules correctly. GE aver- 
aged 2.71 fixations per sec., and TC 
averaged 3.00 fixations per sec. 

The influence of the transposition 
rules upon the pattern of fixations is 
shown most clearly in the graph for 
DO. As the open circles indicate, he 
solved 28 anagrams with five fixations 
according to the rule. On the fourth 
anagram, and on the 15 consecutive 
ones from the sixth to the twentieth, 
he fixated the letters in the order 
34521. When the rule was changed 
on Anagram 21, DO's first five fixa- 
tions followed the old rule. But the 
next anagram was solved with five 
fixations in the order of the new rule: 


OCULOMOTOR ACTIVITY IN ANAGRAM SOLUTION 


31245. Seven of the next eight ana- 
grams were solved in the same way. 
When the rule was changed again on 
Anagram 31, the first five fixations 
still followed the second rule. Not 
until the last four anagrams was the 
third rule, 52143, followed exactly. At 
the start of each of the last eight items, 
while DO waited for the slide to 
change, he fixated Position 5 instead 
of the standard Position 3 which he and 
the other Ss usually fixated in keeping 
with the instructions. 


DISCUSSION 


Our results show that presenting a 
series of similarly formed anagrams tends 
to condition an oculomotor pattern in 
which the letters of the anagrams are 
fixated in the order that they appear in 
the solution words. It seems reasonable 
to suppose that such a pattern is selec- 
tively reinforced by its correlated rapidity 
of solution. That is, a sequence of fixa- 
tions that follows the rule usually achieves 
a solution sooner than any other fixation 
pattern. However, solution time alone 
does not reveal the behavior that goes 
on during the process of solution, nor is 
it necessarily related to “awareness” of 
the rule. Thus, PS and RD had similar 
solution times, although PS was able to 
report the rules while RD was not, and 
the eye-movement record showed the 
influence of the rules in PS's fixations 
but not in RD’s. 

Our observations indicate that eye 
movements are not essential to the solu- 
tion of anagrams. Solutions often occur 
in which the letters are not fixated in 
the order that they appear in the solution 
word, and unless the letters are widely 
separated they do not receive individual 
fixations. But when the letters are 
widely separated, as in the present ex- 
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periment, then eye movements become a 
sensitive index of performance, Of the 
four performance measures employed in 
anagram studies of the present type— 
namely, solution time, verbal report, rela- 
tive frequency of solutions to ambiguous 
anagrams, and eye movements—the ocu- 
lomotor sequence is the most precise in 
terms of applicability to single trials, and 
the most informative in terms of activity 
during the solution process. When the 
perceptual conditions are appropriately 
arranged, the conditioned fixation pattern 
emerges as the behavioral substance of 
the hypothetical “mental set” to perceive 
the letters of an anagram in the order 
of the rule. When, on the other hand, 
perceptual conditions are such as to make 
a fixation pattern of the eye unnecessary 
for anagram solution, the argument of 
motor theory would require behavior 
elsewhere in the musculature as the basis 
of the “set” and of the solution, The 
validity of this argument is, of course, 
a classic issue in general behavior theory. 
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Brackbill (1964) has found that 
acquisition results in better retenti 
reinforcement is used. 


which was given in addition to the r 


occur when the feedback was eliminated and traditional delay of 


reinforcement procedures were used. 


In a recent series of experiments, 
Brackbill and her collaborators (Brack- 
bill, 1964; Brackbill, Boblitt, Davlin, 
& Wagner, 1963; Brackbill, Bravos, & 
Starr, 1962; Brackbill, Isaacs, & 
Smelkinson, 1962; Brackbill & Kappy, 
1962; Brackbill, Wagner, & Wilson, 
1964; Lintz & Brackbill, 1966) have 
consistently demonstrated that delayed 
reinforcement during acquisition re- 
sults in better retention than does im- 
mediate reinforcement. This result has 
been called the delay-retention effect 
(DRE). With one exception (Brack- 
bill, Bravos, & Starr, 1962, Exp. II), 
the experiments using her stimulus dis- 
crimination task failed to show that 
acquisition was impaired by delay of 
reinforcement, contrary to the results 
of numerous experiments with animals, 
and some verbal learning experiments 
on delay of knowledge of results (see 
Renner, 1964), 

Brackbill suggested that Ss used re- 
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The present experiment, using her stimulus 
discrimination procedure, indicated that the effect was due to feedback 


K. EDWARD RENNER? 


delay of reinforcement during 
ion than occurs when immediate 


einforcement. The effect did not 


sponse-produced cues to bridge the 
delay interval (Brackbill & Kappy, 
1962); however, contrary to her ex- 
pectations, an interpolated delay in- 
terval task had no effect on speed of 
acquisition, but the same task impaire 
acquisition when it followed immedi 
ate reinforcement (Brackbill, 1964). 
Her findings, then, present two pr 
lems: the absence of a gradient ol 
delay of reinforcement and the diffi- 
culty of understanding the DRE. 
An analysis of Brackbill’s procedure 
shows that her studies are not delay 
of reinforcement or delay of knowle 
of results (KR) experiments in th 
usual sense of these terms. In her 
stimulus discrimination task, marbles, 
which later could be exchanged for 
a small toy, provided reinforcement. 
In addition, an unusual kind of feed 
back was given; the stimuli w. 


sounded, depending on whether the 
sponse had been correct or incorrect. 
Also, the marbles used for reinforce- 
ment were delivered from a chute into 
à receptacle under the correct stimulus. | 
Thus, Ss receiving delayed reinforce- 
ment did not have to remember their 
response over the delay interval, for 
they were given full information of 
which response they had made and 
which response they should have made. | 


FEEDBACK AND THE DELAY-RETENTION EFFECT 


In the delay condition, reinforcement 
was contiguous with information 
(feedback) on which was the correct 
stimulus. The longer reinforcement 
(or knowledge that a response was 
correct or incorrect) is delayed after 
a response, the less information it 
provides because S forgets his re- 
sponse. However, with Brackbill’s 
procedure, the amount of information 
was the same regardless of the length 
of the delay interval as long as S paid 
attention to the feedback. One pur- 
pose of the present study was to estab- 
lish the existence of a delay of rein- 
forcement gradient when the procedure 
does not provide the extra feedback 
used by Brackbill. 

The confounding of reinforcement 
(providing a marble for a correct re- 
sponse) with the feedback (presenting 
a light over the correct stimulus, the 
auditory signal, and having the marble 
dispensed into a receptacle under the 
correct stimulus) makes it impossible 
to determine whether the DRE is due 
to delay of reinforcement or to the 
delayed feedback. The second purpose 
of the present experiment was to sepa- 
rate the effects of these two variables. 
It was expected that S would pay more 
attention to the feedback when it oc- 
curred at the end of a delay interval 
than when it immediately followed the 
response and was redundant with re- 
inforcement, and that this additional 
attention to feedback by the delay Ss 
would prove to be the crucial factor 
producing the DRE. 


METHOD 


Experimental design—There were five 
groups in the experiment. The first two 
groups corresponded to Brackbill’s immedi- 
ate and delay groups; thus, her two principal 
conditions were present in our experiment. 
Group 1 received immediate reinforcement 
and immediate feedback (0-sec. delay), 
whereas, for Group 2, reinforcement and 
feedback were delayed by 10 sec. If immedi- 
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ate feedback does not add any information 
to the learning situation when reinforcement 
is also immediate, eliminating feedback 
should have no effect on learning or reten- 
tion. If feedback does add information when 
delayed, eliminating it should produce slower 
learning and no improvement in retention. 
Groups 3 and 4, which received no feedback, 
but only immediate or delayed reinforce- 
ment, respectively, were included to test 
these predictions. Finally, if feedback is 
crucial for producing the DRE, then Group 
5, receiving immediate reinforcement and 
delayed feedback, should show improved 
retention. 

Subjects—The Ss were 23 girls and 23 
boys from the third-grade classes of two 
elementary schools. All Ss were white. 
They were assigned randomly to the experi- 
mental groups. The final distribution of Ss 
was: 5 boys and 5 girls in Group 1, 4 boys 
and 4 girls in Group 2, 5 boys and 6 girls 
in Group 3, 5 boys and 4 girls in Group 4, 
and 4 boys and 4 girls in Group 5. 

Apparatus and experimental procedure.— 
The apparatus and experimental procedure 
closely resembled Exp. II of Brackbill, 
Bravos, and Starr (1962). The apparatus was 
modified so that a single delivery chute was 
located between the two stimulus windows, 
making marble delivery independent of feed- 
back on what was the correct stimulus. The 
procedure was modified by giving five ac- 
quisition and five relearning trials instead of 
Brackbill’s dropout and criterion method. 
Other changes in procedure were: no recog- 
nition measure was used; retention was 
measured after 7 days rather than 8 days; 
the pretraining item was not used in relearn- 
ing; a buzz sounded for an incorrect re- 
sponse, and a click for a correct response, 
rather than vice versa; relearning was car- 
ried out under the same conditions of feed- 
back and reinforcement used in learning ; and 
the intertrial interval varied from 10 to 16 
sec. Otherwise, the procedure, stimulus 
material, instructions, and apparatus were 
essentially the same as used by Brackbill. 


RESULTS AND DISCUSSION 


Delay of reinforcement gradient.— 
Table 1 shows the means and SDs of 
the acquisition performance of the five 
groups. Analysis of variance of this 
performance shows a significant treat- 
ments effect for errors on Trials 2-5, 
F (4,41) = 4.16, p < .01, and for errors 
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TABLE 1 


MEANS AND SDs OF THE ACQUISITION ERRORS AND PERCENTAGE OF 
SAVINGS SCORES FOR THE EXPERIMENTAL Groups 


Acquisition Errors 


Percent e of 
Savings Score 
Group N Trials 2-5 Trial 5 
X SD X SD X 

FB 10 12.20 823 1.50 2.50 44.25 43.72 
mre, | | 7.00 4.14 0:38 0.74 80.61 19.61 
Rr 11 12,91 11.27 1.91 2.81 57.96 ee 
Rp 9 23.78 9.60 5.56 3.50 42.71 30. F 
RiFBp 8 11.25 9.15 0.50 1.07 82.71 17.1 


on Trial 5, F (4, 41) = 6.45, p < .001. 
Individual £ tests indicate that the sig- 
nificant F resulted from the fact that 
Group 4, receiving delayed reinforce- 
ment and no feedback, performed sig- 
nificantly poorer than all the other 
groups. There were no significant dif- 
ferences among the other four groups. 
Thus, when feedback is eliminated and 
reinforcement is delayed, a typical 
temporal gradient may be observed. 
Delay-retention effect.—As in Brack- 
bill’s work, a Percentage of savings 
score was computed (number of errors 
on Trials 1-5 minus number of errors 
on Trials 6-10, divided by the number 
of errors on Trials 1-5, multiplied by 
100). The means and SDs of the 
percentage of savings scores are given 
in Table 1. Analysis of variance shows 
a significant treatments effect, F (4,41) 
= 3.52, p < .05, resulting from signifi- 
cantly greater savings by the two de- 
layed feedback conditions, Groups 2 


from each other, Thus, by separating 
the effects of reinforcement and feed- 
back, it has been shown that the DRE 
is a function of the unusual feedback 
used by Brackbill The question of 
theoretical interest is why delayed feed- 


back facilitates retention, while immedi: 


ate feedback does not. 


increase the Probability of S’s forgetting 
his response during the delay interval 
and thus increase his attention to the 


O pay more attention to” 
the marble, When à delay is imposed 
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ticipates. This second reason may be the 
primary one for Ss in Group 5, who learn 
about the correctness of their responses 
prior to feedback. 

We are left with the problem of why 
looking at the light should facilitate re- 
tention, if this is indeed the crucial factor 
in producing the DRE, Delayed feed- 
back may produce several relevant effects, 
For example, the delay Ss have an op- 
portunity to practice remembering and 
to receive confirmation from the feedback 
which should aid retention. The failure 
of an interpolated task (Brackbill, 1964) 
to disrupt the acquisition performance in 
the delay condition suggests that the im- 
portant factors occur in the postfeedback 
interval. Specifically, the memory proc- 
ess inyolved in using this delayed feed- 
back, perhaps for confirming what is re- 
membered, presumably requires verbal 
mediation which may be absent in the 
immediate condition. The memory proc- 
ess for storage and retrieval of the 
feedback becomes then, a relevant con- 
cern. According to an analysis which 
focuses on feedback, acquisition, as well 
as relearning, should be faster for the 
delayed-feedback Ss, since the problem 
is the same on each trial, whether the 
trials are separated by minutes or days. 
There is some evidence that this is in 
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fact the case in a trend for the delay Ss 
(Group 2) to learn faster than the 
immediate Ss (Group 1). 
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TRIAL-BY-TRIAL ANALYSIS OF PROCESSES IN SIMPLE 
AND DISJUNCTIVE CONCEPT-ATTAINMENT TASKS + 


STEVEN H. SCHWARTZ? 
University of Illinois 


The effects of 2 types of concepts and 3 methods of presentation 
were assessed in a concept attainment (CA) task. 11 dependent 
measures were reported, based on trial-by-trial recording of stimulus 
cards, placements, and verbalized hypotheses concerning S's estimate 
of the correct classification rule. The results indicated large concept- i 
type effects on nearly all measures, while methods and interaction 
effects were limited to a few measures and were smaller. The value 
of considering hypotheses and strategies for selecting sequences of 
stimuli was demonstrated by intraclass correlations of .883-.969 
between the number of correct placements an S made and the number 
predicted from his trial-by-trial hypotheses, and by the finding that 
about # of the variance in trials to CA for Ss in the simple-concept 
condition could be accounted for by the sequence of instances they 
inherited from yoked Ss who were free to select instances in any order 


they wished. 


The primary aim of this research 
was the identification of antecedent 
“process variables” lawfully related to 
final concept attainment (CA) be- 
havior. For CA, Ss were required to 
make eight successive statements of 
the correct classification rule, con- 
joined with eight correct placements 
of instances. Process variables con- 
sisted of seven measures derived from 
Ss’ trial-by-trial hypotheses, plus two 
measures dependent on the particular 
orders of instances Ss chose. Since 
nearly all the process measures con- 
sidered, as well as the CA criterion, 
are based on Ss’ verbal hypotheses, 
the validity of these hypotheses, or 
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more specifically, their relation to 
covert hypotheses and to subsequent 
overt behavior, becomes a question of 
prime concern. This problem has re- 
cently been the subject of some con- 
troversy. 

In a CA task, Verplanck (1962) re- 
ported dissociation of card-placement 
behavior and corresponding verbal hy- 
potheses offered on each trial, thereby 
challenging the utility of verbal reports 
in general and their central position 
in Dulany’s (1962, 1964) theory of 
propositional verbal control in particu- 
lar. Dulany and O'Connell (1963) 
in a replication of the Verplanck and 
Oskamp ^ procedures — (Verplanck, 
1962), demonstrated that those results 
could be accounted for by two artifacts. 
Some of the stimuli used were s0 
ambiguous with respect to the classifi- 
cation rule that nonreinforced Ss in- 
structed to sort by the same rule made 
almost as many errors as uninstructed 
Ss in the original experiment. Sec- 
ondly, Verplanck and Oskamp failed 
to consider the chance level of correct 
placements associated with uncorre- 
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lated hypotheses. When these short- 
comings were corrected, the divergence 
of observed placements from place- 
ments predicted from hypotheses was 
nonsignificant. 

More recently, O'Connell (1965) re- 


ported no significant differences be- . 


tween expected and observed place- 
ments, under 60% and 100% 
reinforcement conditions, as well as 
with subsequent reversal and nonre- 
versal shifts, when correlated hy- 
potheses were considered, thereby con- 
firming and extending the Dulany and 
O’Connell (1963) results. The pres- 
ent study further investigated this 
relationship using two types of con- 
cepts and three methods of presenta- 
tion of stimuli. 


METHOD 


Subjects.—Seventy-two Ss were drawn 
from an S pool of students in an introductory 
psychology course and randomly assigned 
(with one modification to be explained later) 
to the treatment conditions within a 3X2 
design. 

Materials—A set of 64 3X5 in. plain 
white index cards, upon which three geo- 
metric figures had been drawn with the aid 
of a stencil, served as stimuli. All cards 
contained, from left to right, a 1-in. high tri- 
angle, circle, and hexagon. Each figure 
was either red or blue, solid or dotted. All 
permutations of these values were included 
in the set yielding 4°, or 64 different stimuli. 
A large table, two wooden topless containers, 
each divided into five labeled compartments 
in which cards could be placed, a small 
wooden stand inclined at about 30° to the 
vertical (for display of the cards), and a 
stopwatch were additional materials em- 
ployed. 

Tasks-—In the simple-rule task placing 
cards with a solid hexagon to the right was 
correct, while in the disjunctive-rule task 
placing cards with either a solid red triangle 
or a solid red hexagon to the right was cor- 
rect, (The labels for the rules are some- 
what arbitrary in that they might just as 
well be classified as conjunctive and discon- 
junction of conjunctions, respectively.) 

Methods of presentation—Under the Se- 
lection-Strategy condition, Ss selected a card 


on each trial from a display of 64 cards ar- 
ranged in random orders on a table in front 
of them. A yoked Reception-Strategy group 
received the cards in the same order as their 
paired Ss in the selection group. (The ran- 
dom order for assigning Ss to conditions was 
therefore modified so that an S in the yoked 
group would be run only after his paired S 
in the selection condition had already been 
run.) Additional cards beyond those se- 
lected by the paired selection Ss were ran- 
domized from the remaining cards and placed 
after the ordered cards. Finally, a Recep- 
tion-Random group worked under the typical 
reception condition, in which they received 
the cards in random order. The six groups 
will be referred to as: Simple-Selection 
Strategy (SSS), Simple-Reception Strategy 
(SRS), Simple-Reception Random (SRR), 
Disjunctive-Selection Strategy (DSS), Dis- 
junctive-Reception Strategy (DRS), ‘and 
Disjunctive-Reception Randon (DRR). 
Procedures.2—All Ss were run individually 
by the same E. After a few minutes of 
conversation to establish rapport, all Ss read 
along while E read the following instructions 


aloud: 


The experiment you are participating in 
is basically a card sorting task. The cards 
you see in front of you can be sorted 
systematically into two stacks in a num- 
ber of ways, that is, according to many 
different rules. The correct rule for sort- 
ing the cards into two stacks in this 
particular problem may be simple, conjunc- 
tive, or disjunctive. 


There followed explanations and examples of 
these three types of concepts. 

Next Ss were instructed that the cards 
would vary "only in color, red or green, 
and form, solid or dotted, of each of 
the three figures,” and that they were “to 
pick (for the selection groups) any card on 
the table, (for the reception groups) the top 
card, and decide whether to put it onto the 
positive (right) or negative (left) pile (i.e. 
it is an example of the rule or it isn't)." 


2 The complete instructions to Ss have been 
deposited with the American Documentation 
Institute. Order Document No. 8897 from 
ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, 
Washington, D. C. 20540. Remit in advance 
$1.25 for microfilm or $1.25 for photo- 
copies and make checks payable to: Chief, 
Photoduplication Service, Library of 
Congress. 
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The Ss were then told to indicate on 5- 
point scales their confidence in their hypoth- 
eses on each trial along with their evaluation 
of the importance of each trial After 
placing the card on each trial $ was required 
to state his reason for placing the card the 
way he did. 


Tell me your reason in the following 
way: Cards with (without) go into 
the right (left) pile. The blank is to be 
filled in with the reason for placing the 
card to the right or left. This reason 
should fit every single card to be placed 
on that particular side and should not 
describe any card on the other side. The 
rule for placing cards on the negative or 
left side will just be the converse of the 
correct rule for the right side. Once you 
determine the correct rule or reason, you 
should be able to sort the cards without 
error. 

After you have done this, I will tell 
you the correct stack the card should have 
been placed in, and you are to pick the 
card up and put it on that side. It is 
possible, of course, to have the correct 
placement for the wrong reasons. 


Finally S was told that his goal should be 
to determine the correct rule and that once 
he achieved that rule he would make no 
errors in placements by following it. 

After reading the instructions, § began to 
select stimuli from the display of eight rows 
of eight cards (if he was in the selection 
condition) or began taking successive cards 
from the prearranged pile (if he was in 
either of the other two reception conditions). 
After each placement, E told S which side 
the card actually belonged on and S cor- 
rected his placement if required. The ex- 
periment continued until $ chose the correct 
placement with the correct hypothesis eight 
times in succession, the 64 trials were com- 
pleted, or the 1-hr. time limit expired. 
Each S was exposed to a minimum of 32 
trials if needed, 

‘Response measures.—On each trial E re- 
corded the following: trial number, stimulus 
card, Correct placement, actual placement, 
S's hypothesis, his confidence, and his evalu- 
ation of the importance of the trial. Eleven 
dependent variables were derived from these 
raw data. The first three indicated the num- 
ber of Ss attaining the concept, time to at- 
tainment, and trials to attainment, Six 
measures were dependent directly on the 
trial-by-trial hypotheses of Ss. These in- 
cluded the number of uniquely different hy- 
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potheses offered over the number of trials 
an $ experienced prior to the criterion trials, 
the number of objects and attribute values 
(red-green, solid-dotted) mentioned per hy- 
pothesis, and the frequency of an S's re- 
maining with or shifting to a new hypothesis 
on Trial » following a confirmed or discon- 
firmed placement on Trial n—1. 

The remaining three measures related to 
hypotheses require some elaboration. The 
first of these classified hypotheses into logi- 
cal types similar to those used by Bruner, 
Goodnow, and Austin (1956). Hypotheses 
were classified as simple, conjunctive, dis- 
junctive, and other compound and miscel- 
laneous types. 

The other treatments of the hypotheses 
depended on the relation of the attributes 
named in the verbalized hypothesis to the 
correct attributes, which was termed the con- 
ditional validity of the hypothesis, after Mc- 
Hale (1962), and is the same as the 
"theoretical value" of a hypothesis (Dulany, 
1962, 1964). The procedure used in com- 
puting the validities was a generalization of 
that used by Dulany and O'Connell (1963), 
and O'Connell (1965). The first of these 
measures, the conditional validity of a hy- 
pothesis, was the probability of a correct 
placement, on the side indicated in the hypoth- 
esis, for all cards fitting the description in the 
hypothesis. The term conditional was used 
since the value of many of the hypotheses 
differs depending on whether the rule is 
applied to the exemplar or nonexemplar side. 
For example, in the disjunctive-concept con- 
ditions, the hypothesis, “cards with all three 
figures solid and red go on the right,” has a 
conditional validity of 1/1 or 1.00 for the 
exemplar side, but its converse, that all cards 
which do not contain three solid-red figures 
go on the left, has a validity of 28/63 or 
45. Note that the computations are based 
on the total set of cards (sampling with re- 
placement) although in fact the set of cards 
diminished after each trial (sampling without 
replacement). The results indicated the 
error involved is small while the gain in 
ease of computation with the sampling with 
replacement model is enormous. y 

A composite validity for each hypothesis 
was also computed. This was identical to 
the conditional validity, except that it indi- 
cated the overall probability of a correct 
placement, over both right and left sides. 
For the sample hypothesis cited above the 
composite validity is (1 + 28/1 + 63) = 29/64 
or .46. 

The particular order or sequence of cards 
an S was exposed to was the referent of the 


PROCESSES IN CONCEPT ATTAINMENT 


next two measures. The point at which all 
the necessary and sufficient information 
(NASI point) (John, 1957) required to at- 
tain the concept has been seen by S was 
computed for each S. This is the minimum 
number of trials in which the concept could 
be completely determined, given the restric- 
tions as to type of hypothesis and number of 
objects involved stated in the instructions, 
and the particular pattern of instances 
actually chosen or seen. 

In addition the minimum number of 
changes in consecutive instances "was re- 
corded for each S (Bruner et al, 1956; 
Glanzer, Huttenlocher, & Clark, 1963). This 
value was obtained by counting the number 
of attribute values changed on each of the 
three figures (1-6) from each trial to 
the last instance in view on the right or left, 
whichever yielded the smaller number. 

Five measures derived from Ss’ placement 
behavior, confidence and importance evalua- 
tions fall outside the concern of this paper, 
but are reported elsewhere (Schwartz, 1965). 
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RESULTS 


Final concept attainment—Table 1 
indicates the considerably greater diffi- 
culty Ss had in attaining the disjunc- 
tive as compared to the simple concept. 
This was reflected in the number of Ss 
attaining the concept, with all but one 
doing so in the simple conditions com- 
pared to only 16 of 36 attaining in the 
disjunctive conditions, x*(1) = 295. 
p < 001, as well as in mean time and 
mean trials to solution where the F 
values (1, 45) for concept effects were 
45.85, p < .001, and 56.48, p < .001, 
respectively. Smaller method, F (2, 
45) = 5.07, p < 02; F (2, 45) = 3.58, 
p <.05, and interaction effects, F (2, 
45) = 6.57, p < 01; F (2, 45) = 6.08, 
p € 01 were noted. 


TABLE 1 
DEPENDENT MEASURES FOR EACH TREATMENT CONDITION 


Treatment 
Dependent Measures 
SSS SRS SRR DSS DRS DRR 
Final Concept Attainment 
No. of Ss Attaining 12 12 11 6 4 6 
Mean Time to Attainment* (minutes) 15.1 15.5 161 ]|42.5 |375 ]|214 
Mean Trials to Attainment* 16.08 |18.50 |19.82 |36.17 40.50. | 25.33 
Measures Related to Hypoth. 
Intraclass Correlation di .883 | .945 .958| .934| .953| .969 
Mean Conditional Validity 571 .584 .609 .613| .646 .618 
Mean Composite Validity .543 | .563 .591 .650| .646| .684 
Slope of Cond. Validity over Trials? "3520| .257| .4224| .140| .149| .284 
Slope of Composite Validity over Trial | 381°] .211*|  .408!| .256 L91| .259 
Proportion Simple Hypoth. .406 | .296 1437 | .199| .316| .148 
Proportion Conjunctive Hrpotbie " z fae "a T Tos ME 
No. of Ss Using Any Disjunctive Hypoth. 
Proportion Misc. Tiypoth. m .177 | 432 458| .342| .346| .473 
P: tion S te 
roportion DPA re 473 | 747 | 539| .725| 825| | 864 
No. of Objects and Values per Hypoth. | 54. | 3.2. | 2.8 four OR 
Proportion Diff. Hypoth. per Trial 698 557 615 461) .418 .380 
Meas ites Related VE Sraa c B 
ini . nsecutive 
Trials E depre 2.40 2.39 2.40 242): 2.17 2.42 
Number of Ss with Indeterminate NASI 
posue a 6 SEED ede Ao f 4 
Note.—N = 12 in each treatment. 
attaining the concept. : 


on =8, 


an = 10. 


ge cated from this analysis because they began their criterion 


ment over trials. An S had to 
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Hypotheses.—For every S an ex- 
pected number of correct placements 
prior to the eight criterion trials was 
computed by multiplying the average 
conditional validity of the hypotheses 
on these trials by the number of such 
trials. The relation between this pre- 
dicted number and the actual number 
of correct placements prior to the cri- 
terion trials was then determined for 
the 12 Ss in all six treatment conditions 
by the intraclass correlation. This 
measure of association was used be- 
cause it was desired to know how well 
the predicted and obtained values 
agreed absolutely, rather than in a 
least-squares sense, as would be shown, 
for instance, by a product-moment cor- 
relation, The results are indicated in 
Table 1, showing intraclass correla- 
tions which ranged from .883 to .969. 

Average conditional and composite 
validities also shown in Table 1 indi- 
cate, highly significant concept-type ef- 
fects, F (1, 66) = 18.38, p < .01, and 
F (1, 66) = 44.94, p < .001, respec- 
tively. Method and interaction effects 
were insignificant. 

There is an indication in Table 1 
that the validities of the hypotheses 
generally increase as the task pro- 
gresses, as signified by the positive 
normalized slopes of the validities over 
trials. 

The breakdown of hypotheses into 
logical types yielded significantly 
greater proportions of simple hy- 
potheses in the simple-concept condi- 
tions, F (1, 66) 27.33, p < 01, no 
differences in use of conjunctive hy- 
potheses, and significantly more Ss of- 
fering disjunctive hypotheses in the 
disjunctive concept conditions, x? (1) 
= 26.89, p < 01. 

. The first-order sequential proba- 
bility analysis of hypotheses behavior 
following a confirmed or disconfirmed 
placement was simplified, since on all 
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but two trials out of the over 1,900 
trials in this study, a disconfirmation 
resulted in a shift to a new hypothesis. 
Following a confirmed placement, how- 
ever, such consistency was not present. 
There was a large concept effect, F 
(1, 66) = 18.17, p < .01, and a sig- 
nificant methods effect as well, F (2, 
66) — 4.37, p « .05. 

Significant concept effects also ap- 
peared in the last two measures related 
to hypotheses shown in Table 1, with 
proportionately fewer different hy- 
potheses offered during the disjunctive- 
concept conditions, F (1, 66) = 29.33, 
p < .01, while the number of objects 
and values mentioned per hypothesis 
was greater in the disjunctive-concept 
conditions, F (1, 66) = 10.50, p «01. 
No significant method or interaction 
effects were found on either of these 
measures. ^ 

Sequence effects—The last two 
measures in Table 1 concern the par- 
ticular sequence of cards Ss selected or 
were exposed to. The NASI point, in 
terms of all possible simple, conjunc- 
tive, and disjunctive hypotheses as in- 
dicated in the instructions to $s, was 
indeterminate for half the Ss in the 
simple conditions, compared to only 
8.5% of the Ss in the disjunctive con- 
ditions, x? (1) = 15.14, p < .01. There 
were no significant differences be- 
tween the three methods of presenta- 
tion. Only two Ss in the simple con- 
ditions failed to attain the concept be- 
fore all possible hypotheses had been 
eliminated, while 75% of those attain- 
ing in the disjunctive conditions did so 
only after eliminating all possible er- 
roneous hypotheses. 

The mean minimum number of 
Changes on consecutive trials for the 
Six treatment groups is indicated in 
Table 1. Note that these means are - 
not independent. For instance, there 
can be little difference between Groups 
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TABLE 2 


RELATION BETWEEN DEPENDENT MEASURES AND TRIALS TO ATTAINMENT (SIMPLE 
CONCEPT) AND ATTAINMENT OF CONCEPT (DISJUNCTIVE CONCEPT) 


Simple Concept ( = 35)* 


Disjunctive Concept (n = 36) 


Trials to Attainment and: i i : 
ial Eu t and Corr. Attained (0) EXIT (1) and: Corr. 
Time to Attainment .922** | Conditional Validity —.460** 
Conditional Validity .207 | Composite Validity —.502** 
Composite Validity 2273 | Slope of Conditional Validity over 
rials —.482** 

Slope of Conditional Validity over Slope of Composite Validity over 

Trials —.414* Trials —.565** 
Slope of Composite Validity over Proportion Simple Hypoth. .262 

Trials —.390* 
Proportion Simple Hypoth. —.125 | Proportion Conjunc. Hypoth. —.515** 
Proportion Conjunc. Hypoth. —.130 | Proportion Disjunc. Hypoth. —.301 
Proportion Compound Hypoth. 2255 | Proportion Compound Hypoth. 516** 
Probability Remaining with Same Probability Remaining with Same 

Hypoth: after Confirmed Place- Hypoth. after Confirmed Place- 

ment .288 ment à —.093 
Proportion Different Hypoth. Proportion Different Hypoth. 

per Trial —.437** per Trial —.035 
No. of Objects and Values per No. of Objects and Values per 

Hypoth. ; 251 ypoth. —.096 
Minimum No. of Changes on Minimum No. of Changes on 

Consecutive Trials —.481**| Consecutive Trials 189 


a One S did not attain the concept in the 


SSS and SRS, or DSS and DRS 
since the —RS group in each case 
received the cards in the same order 
as the yoked Ss in the—SS group ex- 
cept for the few additional cards added 
where the orders from the SS group 
Tag out. The correspondence was as 
expected (Table 1). Since there is no 
freedom in selecting cards in the —RR 
condition, the cards being ordered 
randomly, this condition served as a 
control indicating the minimum num- 
ber of changes expected by chance. 
Again the values were almost identical 
for the two —RR groups. The differ- 
ences between concept types, SSS vs- 
DDS, and SRS vs. DRS, were in both 
Cases suggestive of fewer changes per 
trial in the disjunctive task, t£ (22) = 
170, p < .10; ¢ (22) = 185, P< 10. 
Within the disjunctive conditions à 
comparison of the DSS and DRS 


simple-concept conditions and was eliminated, 


groups with the control DRR group 
yielded significant differences in both 
cases, t (22) = 3.68, p « 01; t (22) 
= 31, p < 01. 

Relation to final attainment.—All of 
the above-mentioned measures were 
correlated with trials to attainment for 
the 35 Ss who solved in the simple- 
concept conditions. Since under the 
disjunctive concept 20 of the 36 5 s did 
not attain the concept, point biserial 
coefficients were computed for Ss in 
these groups. These results are indi- 
cated in Table 2. | 

Yoked subjects.—Since Ss in the 
—SS and —RS groups received the 
same stimulus cards in the same order 
(for as long as the choices made by 
S in the selection group lasted) it was 
of interest to determine how similar 
the groups were on various dependent 
measures. Correlation coefficients be- 
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tween the 12 pairs of yoked Ss for 
the simple- and disjuhctive-treatment 
groups were computed on 13 depend- 
ent measures. Significant correlations 
of .852 (p< .01), 795 (p < .01), .745 
(p < .01), and .560 (p < .05) on trials 
to attainment, time to attainment, mini- 
mum number of changes on consecutive 
instances, and proportion of times an 
S kept the same hypothesis after a 
confirmed placement, respectively, were 
found for the simple-treatment groups, 
while for the disjunctive Ss, the pro- 
portion of conjunctive hypotheses, and 
average conditional and composite 
validities were the only measure to 
show significant correlations, .658 (p 
«.05), .860 (p<.01), and 783 (p 
< .01), respectively. 


Discussion 


Final concept attaimment.—As ex- 
pected, the greater difficulty of the dis- 
junctive concept was apparent in all three 
final CA measures: number of Ss attain- 
ing, time, and trials to attainment. This 
is in accord with previous studies by 
Donaldson (1959), Hunt and Hovland 
(1959), Wells (1963), Braley (1963), 
Glanzer, Huttenlocher, and Clark (1963), 
and Conant and Trabasso (1964). These 
measures also reflected, but to a much 
lesser degree, the effects of variation in 
method of presentation. Under the simple- 
concept conditions the same order of ease 
in attainment, SSS < SRS < SRR, oc- 
curred in time and trials to attainment. 
With the more difficult disjunctive con- 
cept the control DRR group appeared 
superior, suggesting that the strategies 
used in selecting cards in the DSS con- 
dition were inefficient, and in fact worse 
than chance ordering of the cards. 

Hypotheses, report validity, and verbal 
control.—It would seem that the hypothe- 
ses of report validity and verbal control 
have been strongly supported within this 
paradigm. Intraclass correlations be- 
tween observed and predicted number of 
correct placements before criterion trials 
ranged from .883 to .969 for the six 
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treatment groups, with only one value 
below .934. Furthermore, on no more 
than 2 of an aggregate of 1962 trials 
did Ss place the cards other than in 
accord with their stated hypothesis. On 
one common alternative explanation, ver-— 
bal hypotheses and placements are said to 
be correlated as a result of parallel and 
independent conditioning under the same 
reinforcement contingencies. In the pres- - 
ent case, however, we are not dealing 
with a single hypothesis whose strength 
is correlated with the strength of the 
correct response. Over the conditioning 
trials, an array ‘of over 25 different 
hypotheses with different conditional va- 
lidities occurred. | As is clear from intra- 
class correlations ‘of this magnitude, 
hypotheses with conditional validities of 
40 were followed: by correct responses 
about 40% of the time, hypotheses with 
conditional validities of .60 by correct 
placements about 60% of the time, etc. - 

Thus, despite a single contingency for 
correct placements, hypotheses with an 
array of conditional validities did in fact - 
tome under a corresponding array of 
reinforcement contingencies, with no re- 
sulting dissociation of predicted and ob-- 
served placements. No theory of the 
conditioning of verbal statements has f 
been elaborated in such a way as to 
account for an “independent patterning - 
of hypothesis validities and response pet- 
centages that are so closely parallel. Still 
more seriously, it would be difficult to 
find antecedent conditions, reinforcement - 
or otherwise, that would explain how 
placements and verbal hypotheses inde- 
pendently fell into this fairly precise 
degree of trial-by-trial concordance over 
almost 2,000 trials. This high degree of 
both kinds of hypothesis-placement con- 
gruence does follow naturally, howevet, 
from a theory holding that S reports the 
hypothesis mediating the control of his 
placement on each trial. These results 
thus support and extend those of Dulany 
and O'Connell (1963) and O'Connell 
(1965). For this study, they would 
support an interpretation that these 
measures do reflect "processes" instru- 
mental in concept attainment rather than 
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parallel but independent consequences of 
reinforcement. 

The value of assessing verbalized hy- 
potheses on a trial-by-trial basis has been 
further demonstrated by ‘the indications 
that they enter into lawful relations with 
other observable variables. For exam- 
ple, the conditional and composite validi- 
ties are both influenced by the type of 
concept to be attained, With more diffi- 
cult concepts there appears to be more 
opportunity for correlated hypotheses to 
appear, yielding higher’ validities in this 
condition than with the simple concept. 
These validities apparently also rise 
over trials in all treatment combinations, 
particularly for those Ss who are more 
successful in attaining the concept, as 
pointed out by the, correlations —,890 
and —.565 between the normalized slope 
of the validities over trials and either the 
number of trials to attainment (Simple 
groups), or whether S attained the con- 
cept or not (disjunctive groups). 

The smaller number of different hy- 
potheses per trial offered in the disjunc-; 
tive conditions seems related to the 
higher validities occurring there, and the 
fact that Ss shifted consistently after a 
disconfirmation, but only half the time 
or less after a confirmed placement. Hy- 
potheses-with higher validities tend to be 
confirmed more often and therefore re- 
quire less shifting to new hypotheses. 

Results in terms of the classification 
according to logical types suggest that 
Ss first discover the type of concept in- 
volved before identifying the particular 
one required. The simple hypotheses 
were offered more frequently in. the 
simple-concept conditions, disjunctive hy- 
potheses more frequently in the disjunc- 
tive-concept conditions, with no trend for 
the conjunctive-hypothesis class. Simi- 
larly, the difference in number of objects 
and values per hypothesis between con- 
cept treatments follows since by definition 
simple hypotheses contain fewer objects 
and values than disjunctive hypotheses. 

Two aspects of the results of the se- 
quential probability analysis of changes 
in hypotheses following a confirmed or 
disconfirmed placement conclude the dis- 
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cussion of measures relating to hypothe- 
ses. First, the nearly universal behavior 
of changing a hypothesis following a 
disconfürmed placement is surprisingly 
rational. Although it is the only logical 
thing to do under the 100% reinforcement 
instructions, nonetheless we have come 
to expect Ss to conform rarely to the 
most efficient logical model. The rather 
specific and lengthy instructions, plus the 
self-correction procedure used on each 
trial are possible explanations for the 
consistent behavior. 

Perhaps of more theoretical interest is 
the finding that Ss did not consistently 
remain with the same hypothesis follow- 
ing a confirmed placement; depending on 
the conditions Ss changed hypotheses on 
15-50% of the trials following confirmed 
placements. Most mathematical models 
of CA (Bower & Trabasso, 1963; Restle, 
1962) have assumed that S does not 
change his hypothesis following a con- 
firming instance. This assumption ap- 
pears at odds with the present data as 
well as those reported recently by Suppes 
and Schlag-Rey (1965). In Suppes and 
Schlag-Rey's study the conclusions were 
based on very different procedures not 
calling for verbalized hypotheses. 

Selection | strategies.—The strategies 
used by Ss in the selection conditions 
do not easily fall under the "focusing" 
and “scanning” rubrics of Bruner et al, 
(1956), There was little evidence of 
focusing on any card, although under the 
disjunctive conditions Ss tended to select 
cards in an order which contained fewer 
changes from trial to trial than would 
be expected by chance, Furthermore 
there was no significant relation between 
selection of cards with few changes and 
attainment or nonattainment in the dis- 
junctive conditions, and a significant 
trend in the opposite direction. (better 
performance with more changes), as in- 
dicated by an r of —.481 between number 
of changes and trials to attainment, in the 
simple-concept groups. 

Likewise the NASI point did not char- 
acterize a good or poor selection strategy 
as evidenced by the lack of any relation- 
ship between this measure and success 
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in either simple- or disjunctive-concept 
conditions. It is quite apparent from the 
trial-by-trial hypotheses of Ss in the 
simple-concept conditions that they for 
the most part considered only simple or 
conjunctive hypotheses, until they hit 
upon the correct concept, almost entirely 
ignoring possible disjunctive concepts. 
In fact the breakdown into logical types 
indicates only 7 out of 36 Ss in the 
simple-concept conditions using even a 
single disjunctive hypothesis. 

Yoked subjects—Dramatic evidence 
for the importance of the particular se- 
quence of instances an S observes in 
determining his final performance, at 
least in a simple CA task, was seen in 
the .852 correlation between the yoked 
Ss under the simple-concept conditions 
on trials to attainment. The correlation 
was much lower (in fact insignificant) 
in the disjunctive conditions, due perhaps 
to the difficulty of the task and the re- 
sulting low frequency of attainment. 
That the actual orders selected in the dis- 
junctive conditions were inefficient has 
already been suggested. It is quite likely 
then that the variation in “goodness” of 
order in the DSS condition was small, 
therefore affording little opportunity for 
evidence of influence in the paired Ss 
in the DSR condition. One cannot over- 
look the additional alternative that with 
problems of considerable difficulty the 
order of the stimuli presented plays a 
decreasing role, while factors such as 
intelligence, memory, previous experi- 
ence, etc, become stronger determiners. 

In view of the influence of the sequence 
of instances observed in the simple-con- 
cept groups, a question of prime interest 
is “what are the specific variables that 
determine the ‘goodness’ or ‘poorness’ of 
a sequence?", Unfortunately no one 
variable appears as the determining fac- 
tor. Rather a combination of variables, 
involving changes in attribute values 
from one card to the last card in view 
on the exemplar or nonexemplar side 
(— 376 correlation of this variable with 
trials to attainment in the SSS group), 
and a tendency for particular Sequences 
of cards to suggest similar Sequences of 


hypotheses (indicated by the correlation 
of .560 between the conditional validities 
of hypotheses in the yoked Ss in the SSS 
group) appear to have the most influence. 
Perhaps problems of intermediate diffi- 
culty yielding more trials to attainment 
than occurred in the simple-concept, con- 
ditions, but more final solutions than in 
the disjunctive-concept conditions, would 
clarify the picture. 
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in space as a function of stereopsis alo 
and motion parallax alone using a 
motion parallax was induced by a 
training, Ss received 10 sets of 20 di 
and were asked to identif: 
and were immediately inf. 
code used was a rank order. Proce 
except knowledge of results was om 
median and the dispersion of judged 
of actual distance. With pro] 
add significantly to the accu 
equipment and techniques 
cussed relative to an inform: 


The paucity of objects in the visual 
environment man encounters in space 
eliminates all distance cues, except: 
binocular parallax, motion parallax, 
apparent size, convergence, and ac- 
commodation from any significant 
use in space. Of these, accommoda- 
tion is too weak to be useful, and 
apparent size of the target is valueless 
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ormed of the correct answer. 


e accuracy of distance estimation 
ne, stereopsis plus motion parallax, 
motion-picture stereoscope. The 
simulated head motion. During 
screte distance presentations each 


y them as to distance as they were presented 


The identifying 
dure was repeated during testing 
itted. Equations, expressing the 
distance were given as a function 


per training, a cyclical head motion can 
racy of distance and size estimation. 
are described. The moon illusion is dis- 
al observation made during the experiment. 


The 


when S has no target knowledge. 
Thus, only three relevant cues re- 
main: binocular parallax, motion 
parallax, and convergence. This 
study is divided into three experi- 
ments. The first defined the accu- 
racy of distance and size estimation 
in a static environment where neither 
S nor the target move relative to one 
another or to the target background. 
In this situation, motion parallax is 
not relevant. The second defined 
the accuracy of distance estimation 
where a side-to-side head motion was 
simulated. The third experiment was 
identical to the second except that 
one eye was closed, thus eliminating 
stereopsis. 


EXPERIMENT I: 
STEREOPSIS ALONE 


Method 
Subjects.—Twelve male engineers were se- 
lected from the available engineering popu- 


lation on the basis of an eye examination. 
Selection criteria were 20/20 vision (cor- 
rected) and the maximum score of 95% on 
the Shepherd-Fry scale of depth perception 
as administered on the Visuotester. 
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Apparatus.—ln order to present the bi- 
nocular disparity for the cue of binocular 
parallax, the target displacement for the cue 
of convergence, and the target apparent mo- 
tion for the cue of motion parallax, a motion- 
picture stereoscope was constructed. The 
stereoscope consists of three major assemblies: 
a modified 16-mm. movie projector, an image- 
splitting assembly, and a stereo-microscope 
viewing head. 

The projector was a modified 16-mm. 
Kodak analyst with the normal 750-w. pro- 
jection lamp replaced with a 12-w., 6-v. lamp. 
The projector optics were modified by re- 
placing the heat-absorbing glass with a dif- 
fuser. The. image-splitting assembly re- 
placed the normal projection lens. 

Figure 1 details the image-splitting as- 
sembly. The viewing head is from an un- 
modified Wild M-4 stereomicroscope. 

The field of view of the microscope oculars 
is approximately 40°. 


~, 
z 
4 


OA 


— — 
Fie. 1. 


SPLITTER 


The target was a ping-pong ball. The 
visual angle of the target at all distances was 
held constant at 1.9940°. The background 
was a 8 X 20 in. phototransparency made 
from a photograph of a starfield. 

The film sequences were prepared using a 
photographic animation stand equipped with 
Vernac scales which allowed the positioning 
of the target platform to .0001 in. To main- 
tain the same angular size throughout the 
series, the distance between the camera and 
the target was held at a precise value of 
43.2000 in. between the center of the sphere 
and the film plane, and between the surface 
of the phototransparency and the film plane. 

In order to achieve separate images for the 
two eyes, a multiple exposure technique was 


used which produced identical star fields in 
the top and bottom halves of each 16-mm. 
frame. The target image was then positioned 
in the top half frame in order to achieve the 
convergence and retinal disparity associated 


COLLIMATING 
LENSES 


RHOMBOID PRISMS 


LENS-PRISM RELAYS 


no 
4 


Mirror stereoscope. 
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Fic. 2. Full frame Teproduction of 
target presentation, 


with the view of the left eye and in the 
bottom half frame to achieve a similar condi- 
tion for the right-eye view, This was done 
using animation techniques which also per- 
mitted the simulation of motion. The inter- 
pupillary distance simulated was 2.5000 in. 
which is the mean interpupillary distance of 
the Hertzberg Population (Hertzberg & 
Daniels, 1950). A full frame is shown in 
Fig. 2. 

Each distance Presentation consisted of 


reel all 20 distance presentations were printed 
ina different, randomly chosen order. Be- 
tween each distance presentation there were 
5 sec. of film which was blank except for the 
printing of the succeeding rank in the middle 


The target brightness when viewed through 
e projector was approximately 11.35 
candles/ft? with a background of less than 
0.032 candle/ft?, These figures are reported 


Cult to obtain, A more detailed and com. 

plete description of the apparatus is available 

in the origina] report (Dees, 1965). 
Procedure.—During the training, Ss were 


value as each distance Was presented. The 


fourth reel, It should be noted that S$ was 
exercising absolute rather than relative judg- 


ment in giving these rankings. Had S been 
requested to respond with a distance esti. 
mate rather than a rank, the test would have 
been essentially identical, namely, the identi- 
fication of one stimulus value from a set of 
known choices. The use of ranks in this case 
provided an easily learned identification code 
for each stimulus value. As such, the stimu- 
lus values were not actually ranked, but were 
rather identified by the use of a rank-order 
code. During the 5 Sec. between each dis- 
tance presentation, S was requested to look 
away from the apparatus. The E requested 
S to return his attention to the apparatus 
when the next presentation began. At the 
end of the 10 reels, S was permitted to relax 
inany way he wished during the 1 or 2 min. re- 
quired to change reels. During training, the 
reels were presented to half of the Ss in the 
order 1 through 10, and to the other half of 
the Ss in the reverse Order. There was a 
5-10 min. break between training and testing. 

The test trials were identical to the train- 
ing trials except that the order of the presen- 
tation of the reels was randomized by S, Ss 
were not informed of the Correct distance 
rankings, and were instructed to respond 
With their most plausible ranking on each 
presentation without regard to repetition or 
exclusion of ranks, 


Results 


Table 1 gives ràw frequency data 
of all 12 Ss combined. The actual 
rank and distance in feet are given on 


top. Thus the frequency with which 
each distance was judged to be each 
other distance is Presented. 

The regression of the median judged 
on actual distance is in Fig. 3 along 
with the regressions of the upper and 
lower limits within which 95% of the 


rank was filled by the cumulative 
The cumulative percentage 
table was transposed into a table of 
Standard scores, The median and 
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TABLE 1 
BINOCULAR Vision ALONE 


ACTUAL | ACTUAL 
DISTANCE RANK 


JUDGED RANK 


Po ee 
He 


95% limits were then obtained by 


interpolation using the following 
formulas: 
Median Rank 
papas FAW TAE 
|Zz| + |Zel 
95% Limit Rank 
11.96 — |Z] 
= SHR 0.5. 
[Zi — Zal +R + 


In these formulas: 


Zz = The left standard score of the two 
which bracket the 1.96 or zero stand- 
ard score. This assumes that the scores 
were accumulated from the lowest rank 
to the highest rank and from left to 
right. 


Zr = The right standard score of the two 
which bracket the 1.96 or zero stand- 
ard score. The same assumptions as 
for Zr apply. 

Rr = The rank corresponding to the left of 
the two bracketing standard scores. 
The same assumptions apply. 

These formulas yielded a series of 
fractional ranks for the median and 
95% limits. The actual distance 
values for these fractional ranks were 
then obtained by linear interpolation 
between the actual distance values 
for the original ranks. This procedure 
provides a normalized interpolation. 
The Yate's correction of 0.5 is nec- 
essary to offset the error of accumu- 
lation. 
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JUDGED DISTANCE (FT.) 


ACTUAL DISTANCE (FT.) 


Fic. 3. Binocular vision alone. 


The following formulas were ob- 
tained from the regression of judged 
on actual distance. 


lg J = 0.09481 + 1.00417 log A 
(2-4000) [1] 
log Ju = 0.52194 + 1.04150 log A 
(2-300) [2] 
log Jz, = —0.25712 + 0.95060 log A 
(4-2000) [3] 
log A = —0.09442 + 0.99585 log J 
(2-4000) [4] 
log Av = 0.27832 + 1.05197 log J 
(4-2000) [5] 
log Az = —0.50114 + 0.96007 log J 
(2-300) L6] 


The numbers in parentheses below the 
formulas are the limits of the actual 
distances (in feet) within which the 
Tegressions were calculated. In these 
formulas “4A” is the actual distance, 
“J” is the judged distance, and the 
Oe and i772 subscripts signify the 
upper and lower 95% limits, respec- 
tively. These regressions are illus- 
trated in Fig. 3. The dotted lines 
indicate extrapolations based upon the 
values beyond those included in the 
calculation of the regression equations, 

The standard deviations were esti- 
mated for the log normal deviation 
of the arrays about the median values. 
This was done by subtracting the 
appropriate 95% regression formula 
from the appropriate median formula 


Y 
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and dividing by the 95% standard 
score of 1.96. This resulted in two 
formulas for the standard deviation 
of the array of judged. distances about 
the median judged distance. There is 
a standard deviation estimate for devi- 
ations above the median and another 
for deviations below the median for 
both the median judged distance and 
the median actual distance. They 
are given below: 


slog Jy = 0.21792 + 0.01909 log A 
E (2-300) . [7] 

clog Jz = 0.17956 + 0.02733 log A 
(4-2000) [8] 

clog Av = 0.19017 + 0.02863 log J 
(4-2000) [9] 


clog Az = 0.20751 + 0.01826 log J 
(2-300) [10] 


]t should be noted that the predictive 
power of these equations is limited to 


'the three specific cases for which sets 


of formulas are furnished. An equal 
discriminatory (ED) scale could be 
constructed for any probability of 
discrimination desired by using these 
formulas. A 95% ED scale in feet is 
given below (95% probability of one 
point being discriminated from both 
adjacent points) : 
2.00, 5.18, 14.02, 39.71, 

118.09, 369.3, 1,218 


A prior publication by the author de- 
scribes the technique used to con- 
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struct this scale (Dees, 1965). The 
amount of information transmitted 
between 1 ft. and infinity was 2.045 
bits according to the technique used 
by Garner and Hake (1951). There- 
fore, if a criterion of 100% discrimin- 
ability were used, the ED scale would 
have four points rather than seven. 


EXPERIMENT II: 
> 


STEREOPSIS AND MOTION PARALLAX 
This experiment investigated bin- 
ocular parallax, convergence, and 
motion parallax as they pertain to 
the accuracy of distance estimation 
in space. Specifically, the effect of 
stereopsis plus a simulated side-to- 
side head motio was investigated. 
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Method 


Ten previously unused male engineers were 
selected as Ss from the available engineering 
population by using the criteria of Exp. I, 
There were two changes in the apparatus 
described in Exp. I: The films were animated 
to represent the relative motion of the target 
which would occur if S moved his head from 
side-to-side with a frequency of 0.5 cps and 
an amplitude of 1 ft. as measured from the 
center position to either extreme; and the 
distances were changed and reduced to a 
total of 19 presentations because of equip- 
ment considerations. 


Results " 

Table 2 gives the raw frequency 
data of all 10 Ss combined. The 
amount of information transmitted 


between the distances of 8 ft. and | 


10,000 
D 


JUDGED DISTANCE (FT.) 


4 
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ACTUAL DISTANCE (FT.) 


Fic. 4. Sinusoidal head motion with binocular vision. 
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TABLE 3 
SinusorpAL Heap Motion wirH MONOCULAR VISION 


pera (ide lat, 1 Anca 
DISTANCE | RANK 
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infinity was 2.062 bits. The regression log Av = 0.15841 + 1.12108 log J 
(16-4000) [15] 


equations are given below and illus- 
trated in Fig. 4. The lower 95% limit 
should asymptote approximately at 
1,900 ft. judged distance. The upper 
95% limit reaches a judged distance 
of infinity at or before an actual 
distance of 2,000 ft. 


log J = —0.02832 + 1.04126 log A 
(12-2000) [11] 

0.25134 + 1.12039 log A 
(8-1000) [12] 

log J, = —0.14130 + 0.89200 log A 
(16-4000) [13] 


0.02720 + 0.96037 log J 
(12-2000) [14] 


log Ju = 


log A = 


log Ay = —0.22435 + 0.89255 log J 
(8-1000) [16] 

The formulas for estimating the 

standard deviations of the arrays 

of the log distributions are given 

below: 

clog Jy = 0.14268 + 0.04037 log A 
(8-1000) [17] 


clog Jų = 0.05764 + 0.07615 log A 
(16-4000) [18] 

clog Au = 0.06694 + 0.08199 log J 
(16-4000) [19] 


zlog Az = 0.12834 + 0.03460 log J 
(8-1000) [20] 
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EXPERIMENT IIT: 
Motion PARALLAX ALONE 
Method 


Ten previously unused male engineers were 
selected as Ss from the available engineering 
population by using the selection criteria of 
Exp. I and II, 

This experiment was identical to Exp. II 
except that the presentation was blocked 
from one eye. The Ss chose which eye they 
wished to use, but were required to use the 
same eye throughout the test. 


Results 


Table 3 gives the raw frequency 
data of all 10 Ss combined. The 
amount of information transmitted 
between 8 ft. and' infinity was 2.415 


JUDGED DISTANCE (FT.) 


ACTUAL DISTANCE (FT.) 
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bits. The regression equations are 
given below and illustrated in Fig. 5. 


log J = —0.10493 + 1.06869 log A 
(12-2000) [21] 

log Jy = —0.11991 + 1.22827 log A 
(8-1000) [22] 

.log J, = —0.07200 + 0.90546 log A 
(12-4000) [23] 

log A = 0.09818 + 0.93573 log J 
(12-2000) [24] 

log Ay = 0.07952 + 1.10441 log J 
(12-4000) [25] 

log A, = 0.09763 + 0.81415 log J 
(8-1000) [26] 


Fic. 5. Sinusoidal head motion with monocular vision. 


E: 
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The formulas for estimating the 
standard deviations of the arrays 
of the log distributions are given 
below: 


clog Ju = —0.00764+ 0.08142 log A 
(8-1000) [27] 


clog Jı = —0.01680+ 0.08328 log A 
(12-4000) [28] 


clog Ay = —0.00952+0.08606 log J 
(12-4000) [29] 


clog A,=  0.000284-0.06203 log J 
(8-1000) [30] 
DISCUSSION 


This study differs from others in the 
range of distance covered, the accuracy 
of the stimuli, the experimental tech- 
nique, and the data treatment. The 
usual objection to studies of absolute 
distance estimation is that the control- 
ling variables are so difficult to quantify 
that any experimental data are worth- 
less. Under ordinary viewing condi- 
tions this objection is justified. The 
monocular cues such as linear perspec- 
tive are difficult to quantify. However, 
when the line of sight is elevated from 
the horizon to that used from the cock- 
pits of aircraft and spacecraft, the ob- 
jection is no longer valid. When judg- 
ing distance in empty space with the 
background effectively at infinity, the 
only cues are motion parallax, binocular 
parallax, size, convergence, and accom- 
modation. These cues can be quantified. 

The present study is directly appli- 
cable to vision in space and to a lesser 
extent to vision from aircraft. Aerial 
attenuation and the juxtaposition of 
cloud formations will furnish additional 
cues from aircraft. The study ex- 
amined only one target size, one simu- 
lated head motion, one target brightness 
and contrast, and has not been cross 
validated. Cross validation. and an ex- 
amination of the validity of the simu- 
lated head motions are the next steps 


in the research program. After this, the 
variables previously held constant will 
be examined. However, the results of 
this study should, where it is applicable, 
furnish the best definition of the accu- 
racy of distance estimation in space and 
from aircraft presently possible. 

A comparative examination of these 
studies shows that the combination of 
motion parallax and stereopsis is better 
than stereopsis alone, but also that mo- 
tion parallax alone was easily the most 
accurate when S was trained. Using 
both eyes simultaneously requires that 
either the target in the foreground or the 
stars in the background will appear as 
double images. The background is the 
field against which the magnitude of the 
target oscillation is judged. When that 
field is seen as a double image, the refer- 
encing of the target oscillation against 
the background seems to be less accurate. 

Size was eliminated as a distance cué 
in this experiment. However, the size- 
distance invariance (SDI) hypothesis 
states that size and distance judgments 
are inseparable, the one obviating the 
other. This hypothesis has been the 
subject of a great deal of contradiction 
in the literature (Epstein, Park, & 
Casey, 1961). However, assuming this 
hypothesis to be an accepted principle, 
the formulas for the relationship of 
judged to actual distance should be 
readily translatable to the relationship 
of judged to actual size. The following 
digression illustrates this: 


64 = Actual visual angle of target 
6, = Judged visual angle of target 
AD = Actual distance of target 
JD = Judged distance of target 
AS = Actual size of target (diameter) 
JS = Judged size of target (diameter) 
tan ĝa = AS/AD by definition 
tan 0; = JS/JD by definition 
Or = 0; 
AS/AD = JS/JD 
and: AS/JS = AD/JD. 


Perceptually : 


Taking Formula 1 as an example with 
adaptations in the nomenclature: 
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antilog (0.09481 + 1.00417 log AD) 


JS/AS = 


JS 


AD 
_ AS [antilog (0.09481 + 1.00417 log AD)] 


The other distance formulas are also 
translatable in a similar manner. 

The author noted a peculiar inter- 
pretive error during the training for 
Exp. I which may cast some light upon 
the moon illusion. Many Ss completely 
reversed the order of judged distance on 
the first set of presentations and were 
confused by E's insistence upon their 
correct ordering. Their line of reason- 
ing was this: (a) E said that the target 
was of a constant size, (b) the target 
appears to change size as a function of 
distance, (c) therefore, when the target 
is larger it must be closer. This is nor- 
mally a logical argument. However, a 
more correct argument based upon a 
slightly different premise would be: (a) 
the angular size of the target is con- 
stant, (b) the target appears to change 
in size as a function of distance, (c) 
therefore, where the target appears 
larger the stereopsis cues must be yield- 
ing the automatic interpretation. that 
the target is farther away. This second 
argument is the correct line of reason- 
ing. The Ss used the other line of rea- 
soning at first because the situation 
where an object changes its apparent 


AD 


distance without changing its angular 
size is seldom experienced in everyday 
life. However, it was experienced in 
this experiment, and it is experienced in 
the moon illusion. The cues which 
would shift the perceived distance of the 
moon farther away when it is seen at 
the horizon are monocular, but the 
faulty reasoning automatically applied 
to the target in Exp. I applies with 
equal facility to the moon illusion. 
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LIST DIFFERENTIATION AND DISTRIBUTED PRACTICE? 


JOHN P. HOUSTON 
University of California, Los Angeles 


In a 2X 2 design, Ss learned 3 paired-associate lists by either massed 
(MP) or distributed (DP) practice and then learned a 4th list by 


either the same or the alternative 
tested after 24 hr. 
retention following 4th-list MP. 


method.  4th-list retention was 


Retention following 4th-list DP was better than 


Retention in those conditions 


involving a switch in practice procedure between Lists 3 and 4 did 
not differ from the retention of Ss who learned all 4 lists by the 


same procedure. 


In an investigation of the effects of list 
differentiation and distributed practice upon 
retention, Houston and Reynolds (1965) 
tested first-list recall in an A-B, A-C situa- 
tion. A-C learning was either by massed 
(MP) or distributed (DP) practice while 
A-B and A-C were learned either by the 
same or by different practice methods. The 
results indicated that first-list recall was 
better when A-B and A-C were learned 
by different procedures than when they were 
learned by the same procedure. Recall fol- 
lowing distributed A-C learning did not dif- 
fer from recall following massed A-C 
learning. 

In the present study S's learned four paired- 
associate lists conforming to an A-B, A-C, 
A-D, A-E situation. Fourth-list retention 
was tested after 24 hr. The fourth list was 
learned either by MP or DP. The first 
three lists were all learned by either DP 
or MP. Thus, the experiment involved a 
2x 2 design with the main effects referring 
to massed vs. distributed fourth-list learning 
and same vs. different practice procedures 
(MP-DP, DP-MP, MP-MP, DP-DP). 

According to the successive extinction- 
recovery hypothesis discussed by Underwood, 
Keppel, and Schulz (1962) the two condi- 
tions involving distributed fourth-list practice 
(MP-DP, DP-DP) should yield better 
fourth-list retention than the two conditions 
involving massed fourth-list practice (DP- 
MP, MP-MP). The list differentiation 
interpretation predicts better fourth-list re- 
tention in the two conditions containing dif- 
ferent practice procedures (MP-DP, DP- 
MP) than in the conditions containing only 
one practice method (MP-MP, DP-DP). 


1 This work was supported by research grants (MH 
10612-01, MH 11199-01) from the National Institute 
of Mental Health, Public Health Service. 
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There are at least three important differ- 
ences between this design and the one used 
by Houston and Reynolds (1965). First, in 
the hope of maximizing the kind of associa- 
tive interference which seems to be neces- 
sary for DP effects to appear, Ss in this 
experiment were required to learn three 
rather than one list prior to learning the 
critical list. Second, a 24-hr. rather than a 
1-min. retention interval was employed in an 
attempt to increase the opportunity for dif- 
ferential recovery of the responses from the 
earlier lists. Finally, so that the situation 
would be more comparable to previous 
studies, a proactive rather than a retroactive 
design was used. 

Method—With some exceptions the 
method closely paralleled that of Exp. III 
reported by Underwood, Keppel, and Schulz 
(1962). The Ss were 96 University of 
California undergraduates whose participa- 
tion was in fulfillment of a course require- 
ment. The four conditions were arranged in 
24 blocks such that each occurred once in 
each block. The Ss, 24 to a condition, were 
assigned to this arrangement as they ap- 
peared in the testing situation. The lists 
were those used and described by Under- 
wood, Keppel, and Schulz (1962). The 
eight stimuli were nonsense syllables and the 
four sets of eight responses were two-syllable 
adjectives. Only one pairing of the stimuli 
and responses was used. Lists were not 
counterbalanced across list positions. "Three 
random orders of the lists were developed 
to minimize serial learning. Each of these 
orders was used as the starting order equally 
frequently. Fourth-list recall and relearn- 
ing was begun at that order which S would 
have received had he had one more learning 
trial. All learning was by the anticipation 
method at a 2:2 sec. rate on a Stowe drum. 
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The MP intertrial interval was 4 sec. and 
the DP interval 1 min. The DP interval 
was filled with a digit cancellation task. A 
1-min. interlist interval was employed. Each 
list was presented for five anticipation trials. 
Twenty-four hours after fourth-list learning 
the fourth list was presented for a recall and 
four additional MP relearning trials. 

Results and discussion—The mean num- 
bers of correct responses given by the MP- 
MP group on Lists 1, 2, and 3 were 15.71, 
13.38, and 14.67, respectively. The means 
for the DP-DP condition were 18.29, 17.54, 
and 1842. The DP-MP and MP-DP means 
were 18.58, 14.04, 1942 and 17.33, 14.95, 
17.04, respectively. A repeated-measures 
analysis of variance indicated that the groups 
which later received massed and distributed 
fourth-list practice did not differ in terms of 
List 1-3 learning, F <1. Similarly, the 
groups which later received the same or dif- 
ferent practice procedures did not differ, 
F<1. The list main effect was significant, 
F (92, 256) =7.57, p « 01. 

The mean numbers of correct responses 
given by the MP-MP, DP-DP, DP-MP, 
and MP-DP groups during fourth-list learn- 
ing were 17.67, 20.63, 2158, and 18.04. 
Neither the same vs. different nor the MP 
vs. DP main effect was significant, Fs < 1. 
The interaction barely missed significance 
at the .05 level, F = 3,90, 

The mean numbers of fourth-list responses 
recalled correctly after 24 hr. by the MP- 
MP, DP-DP, DP-MP, and MP-DP groups 
were 1.58, 1.58, 1.25, and 225. As was 
true in the Underwood, Keppel, and Schulz 
Exp. III, an analysis of variance applied 
to these raw recall data did not reveal a 
significant fourth-list practice effect, F 
—204. Neither the same vs. different, 
F<1, nor the interaction effect, F= 2.04, 
was significant. Because of the probable dif- 
ferences in fourth-list learning these recall 
data were adjusted by the single-entry suc- 
cessive probability analysis discussed by Un- 
derwood (1964). The resulting mean loss 
scores for the MP-MP, DP-DP, DP-MP, 
and MP-DP conditions were 4.33, 4.88, 5.84, 
and 3.96. Following this adjustment the 
MP vs. DP main effect was significant, 
F (1, 92) = 4.03, p<.05. The same ys, 
diferent main effect was not significant, 


SUPPLEMENTARY REPORTS 


F<1. Thus, the results of this experiment 
replicate the Underwood, Keppel, and Schulz 
finding of superior fourth-list retention fol- 
lowing distributed fourth-list practice but 
fail to support the prediction based upon 
the list differentiation interpretation of dis- 
tributed practice effects. 

Following the correction for differences in 
learning, the interaction effect was also sig- 
nificant, F (1, 92) = 13.28, p<.01. The Ss 
learning the first three lists by DP showed 
greater loss scores than did Ss who learned 
the first three lists by MP. This suggests 
that materials learned by DP may in some 
sense possess greater interfering properties 
than materials learned by MP. 

The effects of DP and list differentiation 
upon fourth-list relearning were evaluated by 
subtracting, for each S, the number of 
correct fourth-list learning responses from 
the number of correct fourth-list relearning 
responses. The mean values were 5.00, 
4.17, 7.29, and 3.67 for the MP-M P, DP-DP, 
MP-DP, and DP-MP conditions. Neither 
the practice effect, F — 1.18, nor the list dif- 
ferentiation effect, F «1, was significant. 
The interaction was not significant, F — 3.02. 

Responses from the first three lists given 
as responses during fourth-list learning pre- 
sented no consistent pattern and were too 
few to be useful. The numbers of these 
responses given during fourth-list relearn- 
ing were 53, 32, 46, and 44 for the MP-MP, 
MP-DP, DP-DP, and DP-MP conditions. 
Although these numbers are also small, they 
do conform to the predictions of both in- 
terpretations. — That is, the fourth-list DP 
groups produced fewer intrusions (78) than 
did the MP groups (97), and the conditions 
involving a change in practice procedures 
yielded fewer (76) than did the groups who 
learned all lists by the same procedure (99). 
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VISUAL HEMIFIELD DIFFERENCES AS CONFLICTS 
IN DIRECTION OF READING? 


B 


E. RAE HARCUM 
College of William and Mary 


This study verifies and extends the c: 


onclusions of Harcum and Finkel 


(1963), who attributed right-left hemifield differences in the visual 
perception of words to a conflict in the directions for scanning the 


visual patterns. Orientation and s 


equence of letters in meaningful 


English words, presented to the right or left of fixation, were varied 
together or separately. Generally, when the directional characteristics 
of the words did not agree with the normal direction for reading 


English, perception was less accurate. 


Harcum and Finkel (1963) demonstrated 
that the usual greater perceptual accuracy 
for English words presented to the right of 
fixation is reversed if mirror images of the 
words are shown instead. This was at- 
tributed to differences in the directions for 
perceptually scanning these stimuli. Direc- 
tions of scanning can follow O's normal di- 
rection of reading (Heron, 1957), as well 
as the intrinsic characteristics of the stimulus 
(Harcum & Smith, 1963). For words 
printed normally in the right visual field, 
scanning tendencies from both sources agree 
in direction. For such words to the left 
of fixation, the scanning tendencies con- 
flict. Assuming a primacy effect on the 
accuracy of the scanning, there should be a 
larger difference in accuracy for left and 
right halves of words when scanning ten- 
dencies agree in direction. - Harcum and 
Finkel (1963) obtained this result for both 
normal and mirrored words, when Os knew 
before exposure whether the word was nor- 
mal or reversed. The present study in- 
creases the generality of these findings by 
withholding preexposure knowledge of stim- 
ulus conditions, maintaining a constant pre- 
exposure set. Also, stimulus directionality 
produced by orientation and by sequence of 
letters is varied independently. 

Method.—Eighteen Os observed 120 eight- 
letter English words, equally distributed 
randomly to left or right sides of fixation 
under one of four conditions. In Cond. BC, 
both the orientation and the sequence of 
letters were correct with respect to written 
English. That is, the words were printed 
normally. For the BR condition both the 

1 This investigation was supported by a Public 
Health Service research grant (HD 00207-06) from 


the Institute of Child Health and Human Develop- 
ment. The author thanks Dorothy a Dodd and 
Glaus: Hines for collecting and ulating these 
lata. 
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orientation of individual letters and the se- 
quence of the letters were reversed. — Thus, 
this condition was a complete left-right mir- 
ror image of the BC condition. The SR 
condition employed letters in the correct 
orientation for English, but the sequence of 
letters was reversed so that the words were 
spelled backward when read from left to 
right. The LR words involved correct letter 
sequence, but the left-right orientation of the 
individual letters was reversed. Under each 
of the above conditions the normal upright 
orientation of the letters was not affected. 

Apparatus and methods were essentially 
identical to those of Harcum and Finkel 
(1963). Prior to exposure O did not know the 
condition or hemifield of the stimulus. When 
he was properly fixated and ready, he pressed 
a telegraph key which initiated the exposure. 
Immediately after the exposure O attempted 
to reproduce on a prepared data sheet the 
stimulus exactly as it appeared, with correct 
hemifield and with correct orientation and 
position of the letters. Letters were scored 
correct, according to actual position within 
the stimulus, if they were reproduced within 
one position of the correct location, regard- 
less of the reproduced orientation. 

Results and discussion. —Figure 1 shows 
for all Os the means of errors at each letter 
position. Seventeen of the Os reproduced 
BC stimuli in the right hemifield more ac- 
curately, with 1 O showing no difference. 
Eleven Os revealed superior BR reproduc- 
tion in the left field. All of the 10 Os who 
showed a change of hemifield superiority 
changed from right superiority with normal 
words to left superiority with mirrored 
words. Although the LR condition is a 
mirror image of the SR condition, 14 Os 
exhibited superiority of the right hemifield 
under the LR condition, with 2 equal, and 
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Fic. 1. Means of errors for each letter position under 


the four conditions of letter orientation. 


17 Os exhibited right superiority under the 
SR condition. 

The left and right halves of BC words 
show about equal accuracy when these words 
appear at the left, but there is superior re- 
production for the letters closer to fixation 
for words on the right of fixation. The 
BR condition shows the reverse of these 
results. The SR results essentially dupli- 
cate those for the BC condition, except for 
a greater number of errors. When O does 
not have prior knowledge of condition, the 
SR stimuli are essentially nonsense words, 
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printed normally. The LR condition repre- 
sents the lone exception to the conclusion 
that the more symmetrical distribution of 
errors is related to more errors, probably 
because such words on the left are not 
strongly scanned in a direction which would 
produce meaningful words. For these words 
on the right, one direction of scan produces 
meaning, which would increase the per- ' 
ceptual accuracy. 

The present findings corroborate and ex- 
tend those of Harcum and Finkel (1963). 
Even when specific stimulus conditions are 
unknown, the perceptual accuracy is greater 
when scanning tendencies produced by stim- 
ulus characteristics and by reading experi- 
ence agree in direction. This interpretation 
of hemifield differences is offered as an 
alternative to the notion of a simple hemi- 
spheric dominance. 
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LATENCY COMPONENTS IN TWO-CHOICE RESPONDING * 


D. H. TAYLOR 
Reading University 


Donders’ (1868) classical b- and c-reactions were compared with 2 
similar conditions in which stimulus discrimination was reduced to 
the detection of perfectly detectable stimuli, so enabling the latencies 
associated with stimulus discrimination and response choice to be 
studied separately. An additive hypothesis of RT components would 
predict that these latency distributions should add together in the full 
2-choice situation. In each of the 4 conditions, 8 Ss each gave 32 
RTs. Latency distributions were described by their minima and 1st 
3 moments. The data were consistent with the additive hypothesis. 
The component latency distributions could be fitted by a negative 


binomial function. 


The century-old work of Donders 
(1868), in which he showed that inter- 
mediate between the simple RT (a- 
reaction) and the two-choice RT (b- 
reaction) was a third situation, the 
c-reaction, in which a selective re- 
sponse was required to one of two 
stimuli, is one of the best-known ex- 
periments in psychology. However, 
in spite of its classic status, interpreta- 
tion of the result has led to little be- 
yond the original assertion by Wundt 
and his. associates that component la- 
tencies in RT are additive and sepa- 
rable—a notion which was strongly 
attacked soon after its inception, but 


1 This work was supported by a grant 
from the Science Research Council for a 
project entitled “Presentation of information 
in skill situations.” 


which today is implicit in most quan- 
titative theories of RT. The difficulty 
of interpretation of the Donders ex- 
periment is perhaps its simplicity: 
only three average RTs are involved, 
and the measured values although re- 
peatable offer little scope for theoreti- 
cal development. The main interest 
lies in the order of the three values, 
but this could have been expected even 
in Donders’ time. The main devel- 
opments of the original experiment 
have involved generating more data 
by introduction of other variables into 
the selective response situation, such 
as number of stimulus alternatives 
(Brebner & Gordon, 1964; Fitts, Pe- 
terson, & Wolpe, 1963), or S-R com- 
patibility (Broadbent & Gregory, 
1962). 
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The revival of interest in RT theory 
due to the introduction of stochastic 
latency models (discussed and re- 
viewed by McGill, 1963), returns at- 
tention to the possibility that additive 
models of basic RT situations may be 
profitably used to investigate the na- 
ture of the underlying processes. The 
Donders experiment is well suited to 
an examination of additivity of com- 
ponent latency distributions since it is 
already widely regarded as a frac- 
tionation of the basic choice situation. 
The experiment to be described repeats 
the Donders situations and introduces 
another method of fractionating the 
two-choice response. Distributional 
measures of RT (Taylor, 1965a, 
1965c) are used to investigate the na- 
ture of possible component latency 
distributions in choice responding. 

The c-reaction, in which a response 
is required to only one of two possible 
stimuli, is usually regarded as a frac- 
tion of the two-choice situation (b- 
reaction), which includes stimulus dis- 
crimination, but not response choice. 
Another possibility, less often consid- 
ered, is a fraction including response 
choice but excluding stimulus discrimi- 
nation. Since any forced-choice re- 
sponse must be based on some kind of 
discrimination, such a situation is in a 
strict sense impossible. However, a 
signal-detection situation in which the 
detection is made very easy can be 
made to serve as a good approxima- 
tion. This may be done by allowing 
an event to occur (such as an auditory 
warning signal) on each of a series 
of trials, and adding or not adding a 
visual stimulus to this event, requiring 
one response on trials when the stimu- 
lus appears, and another when it does 
not. The resulting detection task in- 
volves as little as is practically possible 
of what is ordinarily meant by stimu- 
lus discrimination. If such an ap- 
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. when the stimulus appears, and to. 


, response is not required on every trial - 


proximation is acceptable, then a third 
fractionation becomes possible: in - 
which neither stimulus discrimination 
nor response choice is present. In a 
situation similar to that just described, 
S is required to respond only on trials 


withhold his response when it does. 
not. The situation differs from the 
simple RT (a-reaction) in that the 


‘(equivalent to adding a large number | 
of catch trials to the a-reaction). 1 
An additive model of component la- 
tencies would predict that the increase 
in latency of the full two-choice b- 
reaction over the last described situ- 
ation should equal the sum of the 
component latencies obtained in a simi- 
lar way from the other two fractions— 
corresponding to stimulus discrimina- 
tion and response choice. The addi- - 
tivity should moreover extend to all 
measurable properties of the compo- 
nent latency distributions, and not E 
merely to their means. Some confir- 
mation of an additive hypothesis was 
obtained in an exploratory study ~ 
(Taylor, 1965b), and the present ex- ! 
periment was designed as a test of this 
a priori hypothesis. 


. METHOD 


In a basic two-choice situation, the stimuli 
were the colors red or green produced as” 
lin. diameter disks by a digital-display 
indicator. Responses were given by pressing — 
microswitch pushbuttons with the forefingers. 
A warning signal consisting of a 500-cps - 
tone began at a variable interval before the. 
stimulus and lasted until it appeared. The 
interval was either 0.8, 1.0, 1.3, or 1.5 Seca 
in random order with equal frequency. The 
stimulus remained illuminated until a re- 
sponse was made. Four main experimental. 
conditions were derived from this situation, 
as follows: b—the two colors red or green. 


one response to each color (i.e, 
b-reaction) ; c—stimuli as above, 
but response to one color only, with th 
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preferred hand (i.e, a Donders’ c-reaction)?; ` 


b'—one color on half the trials, no light on 
the remainder, in random order, one-response 
to each eventuality; c/—stimuli as for b’, 
but response only to the color, with the pre- 
ferred hand.  Thirty-two Ss were used, all 
student volunteers aged 18-26 yr., 14 female. 
Eight Ss were randomly assigned to each of 
the above four treatments, and each per- 
formed one run of 64 trials. The basic 
experimental data were the 32 RTs given 
by the preferred hand in each run, measured 


with a resolution of 2 msec. by a suecally 


constructed automatic timer. 

In addition, each $ performed two E 
runs, one before and one after the experi- 
mental run. These were: a;—the preferred- 
hand stimulus as in the experimental run, 
given on every trial; a—the same, but with 
the warning tone continuing until the re- 
sponse was given (to check on the possibility 
that Cond. a: could be performed using the 
auditory cue alone). Thirty-two trials were 
given in each of these conditions. The order 
of the two control runs, and the allocation of 
the colors red or green to the preferred hand, 
were balanced randomly within each group 
of eight Ss. Within runs, trials were given 
at intervals of approximately 6 sec. A 
dozen or so practice trials were allowed 
before each run. The S’s errors (i.e., selec- 
tion of wrong response or responding when 
not required) were signaled to him instan- 
taneously by a buzzer. Trials on which 
errors were made were repeated later in the 
run. Instructions to all Ss were to respond 
as quickly as possible consistent with reason- 
able accuracy. 


RESULTS 


Each experimental run provided an 
RT distribution, of nominal size 32, 
made up of correct preferred-hand re- 
sponses. The following measures were 
computed for each distribution: mini- 
mum, mean, SD, and the cube root of 
the third k-statistic, here called CD, 
for “cube deviation” (by analogy with 
SD which is the square root of the 
second k-statistic). 

The subsequent treatment of the data 
is based on the following arguments. 


2 The stimulus to which no response was 
required was extinguished automatically 
after .5 sec. 


483 


TABLE 1 


SUMMARY OF DISTRIBUTION PARAMETER 
ESTIMATES (MSECc.) 


Averages Medians 
Cond. Errors* 
Minimum| M SD CD 
e 224 273 | 41 40 0 
c 249 331 59 52 3.9 
b' 246 338 | 63 57 3.3 
b 267 376 74 63 3.1 
a 187 239 39 52 S 
az 207 255 36 39 — 


* Expressed as percentage of all trials on which there 
was a possibility of an incorrect manual response. 


Experience with mean RTs often shows 
that consistency within runs is much 
greater than the repeatability of runs 
within experimental conditions. This 
suggests that individual RTs from dif- 
ferent runs within the same experi- 
mental conditions should not generally 
be regarded as samples from the same 
population, and hence that the error 
variance appropriate for testing the ef- 
fects of experimental conditions should 
be estimated from the within-condition 
run means, and not from the within- 
run variances, The same argument 
applied to the other measures avoids 
the difficult task of estimating their 
sampling variances. To obtain popu- 
lation estimates, the averages of the 
run minima and run means were used ; 
for SD and CD, the medians were 
found to give reasonably consistent re- 
sults, less dependent on occasional out- 
lying values. The estimates obtained 
from the four experimental conditions 
and the two control runs are sum- 
marized in Table 1. 

The results from the two Control 
Cond. a, and a, resemble each other 
closely except that the a, distribution 
is delayed by some 20 msec. relative 
to the a, distribution. For both min- 
ima and means, analysis of variance 
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TABLE 2 


ANALYSES OF VARIANCE/COVARIANCE 
FOR MiNIMA AND MEANS IN 
EXPERIMENTAL CONDITIONS 


Source SS df MS F 
Minima 
Variance 
Conditions 7301 3 2434) 3.4* 
Residual 19905| 28 7u 
Covariance (b — 0.7) 
Conditions^ 8307 3 2769) 6.4** 
Residual’ 11670] 27 432 
Means 
Variance 
Conditions 43840 3 | 14613 | 17.2% 
Residual 23787 | 28 850 
Covariance (6 = 0.8) 
Conditions’ 42909 3 | 14303 | 31.2%" 
Residual’ 12373 | 27 458 
* 5 «.05 
**5c.01 


showed that the condition effect was 
significant ($< 05), and also that 
no effect of the order of the two control 
conditions existed, 

The distribution obtained in the e 
condition is similar to those obtained 
in the control conditions, but delayed 
a further 20 msec. relative to the a, 
condition. All measures of the c and 
b' distributions are larger than those 
obtained in the œ condition, and the 
measures of the b distribution are 
larger still. The structure of the ap- 
parent differences was examined by 
analysis of variance, Because of the 
Separate groups of Ss used for each 
condition, the residuals in ordinary 
analyses of variance of the minima 
and means of the four experimental 
conditions were inflated by S differ- 


TABLE 3 


ADJUSTED Estimates OF MINIMA AND MEans 
IN EXPERIMENTAL CoNprTIONS, WiTH 
5% CONFIDENCE Limits (Msec.) 


Minimum] CL+ M 


CcL+ 
oH) alee 
c 3 330 1 
b’ 253 19 344 19 


266 | 17 | 373 | 99 
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ences. However, considerable reduc- 
tions in the residual variances could be 
obtained by using the control results 
as covariates in analyses of covariance. 

Table 2 summarizes the analyses of 
variance/covariance for the minima 
and for the means, using as covariates 
S averages of the corresponding meas- 
ures obtained from the two control 
conditions. 

Slight differences in the S-group 
values of the covariates led to adjusted 
condition estimates, based on the 
analyses of covariance. These are 
shown in Table 3, together with 5% 
confidence limits derived from the 
within-condition variances reduced by 
the covariance components of the indi- 
vidual regressions. 


TABLE 4 


NowrNAL 90% MEDIAN CONFIDENCE INTER- 
VALS FOR SD AND CD ESTIMATES IN 


EXPERIMENTAL CONDITIONS (Mskc.) 
SD cD 
ç 26-51 29-54 
c 52-68 43-66 
b’ 51-69 33-77 
b 60-82 46-85 


Comparison tests on the mean esti- 
mates indicated the structure c <c 
—b'«b, p< 05 for the differences, 
for both adjusted and nonadjusted 
estimates. A similar structure is sug- 
gested by the minimum estimates, but 
the statistical evidence was equivocal, 
especially in the case of the adjusted 
data. 

For the SD and CD values, median 
interval estimates were obtained, using 
binomial probabilities, The nominal 
90% confidence intervals are shown 
in Table 4. 

For the SDs in Table 4, the condi- 
tion effects appear to have the same 
Structure as that shown by the means. 
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TABLE 5 
Component Latency DISTRIBUTION ESTIMATES (MsEc.) 


Latency Component Minimuma M^ k2 (msec.*) ks (msec.?) 
(c — c!) 25. (22) 58 (58) 1800 76600 
(b' — c^) 22 (31) 65 (72) 2288 121200 
(c — e^)* 
(b' — c) 47 (53) 123 (130) 4088 197800 
(b — c^) 43 (44) 103 (101) 3795 186000 


« Figures in parentheses derived from adjusted estimates. 


The CD intervals are too large to per- 
mit any firm conclusions to be drawn. 

Additivity of distribution parame- 
ters.—The additivity hypothesis would 
predict that the latency distributions 
added to the basic c' distribution by, 
respectively, stimulus discrimination 
(c) and response choice (b’), should 
sum to the distribution obtained by 
adding both elements at once (b). 
The stimulus, response, and joint la- 
tencies may be represented by (c — c’), 
(b' — €), and (b —c'), respectively, 
where the symbols represent additive 
distribution measures (in the present 
case the minima, means, variances, and 
k,). The observed values of the joint 
latency (b —c') should be approxi- 
mated by the predicted values (c — c’) 
+ (b'—c). Table 5 shows the ob- 
served and predicted values obtained 
from the summary data in Table 1 and 
the adjusted data in Table 3. 

The predicted values are in each 
case slightly higher than the observed 
values. 

Statistically, the truth of the addi- 
tivity hypothesis for the parametric es- 
timates of minima and means is equiva- 
lent to the truth of 


b—c—be-0, . [I] 


which may be tested by computing its 
variance component and comparing it 
with the residual. 

The 20% confidence limits for the 
comparisons, based on the adjusted er- 


ror variance, were about + 30 msec. 
for both minima and means. The ob- 
served values of Equation 1 are — 4 
and — 20 msec. for minima and means, 
respectively. The hypothesis of addi- 
tivity is therefore not rejected by these 
data. The extent of agreement shown 
by the predicted and observed values 
of k, and b, in Table 5 is of similar 
order, though not testable by the same 
means, 


Discussion 


It is of interest to consider what the 
"component latency distributions" ob- 
tained in the first two rows of Table 5 
represent. If the additivity hypothesis is 
correct, they correspond to the latencies 
characteristic of the processes of stimu- 
lus discrimination and response choice. 

Garner (1962) suggested that a dis- 
criminative process is involved in re- 
sponse choice, and the close similarity in 
the present data between the two compo- 
nent distributions might therefore sug- 
gest that each represents a basic discrimi- 
native process. When the two processes 
are combined in the b condition, the addi- 
tivity hypothesis implies that they are 
stochastically independent. The observed 
departures from exact additivity are in 
a direction consistent with a small co- 
variance between the two components. 

The skew of a frequency distribution 
may be expressed by a parameter g,, con- 
veniently computed as (CD/SD)?. Its 
value for each of the two component dis- 
tributions is about 1.1. This is close to 
the g, that would be obtained from a 
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gamma distribution of order three; thus, 
following Restle’s (1961) “neural series 
hypothesis,” each process might consist 
of three independent neural units with 
exponentially distributed latencies. This 
conjecture does not, however, explain the 
observed minimum values of the com- 
ponent distributions (about 25 msec.), 
since the basic gamma distribution has 
zero minimum. A slight modification 
to the argument may be made to account 
for the observed minima. 

Consider a delay mechanism with a 
geometric latency distribution 


P(kt) —pqU-0, k—12.... [2] 


This has a minimum latency equal to 
t. An example of such a mechanism 
might be a unit which fires with latency 
t after initial excitation, and thereafter 
continues to fire with period t, having a 
constant probability p of triggering an- 
other unit on each of its discharges, A 
chain of n such units would generate a 
negative binomial latency distribution with 
minimum latency nt. Fitting the three 
parameters of a negative binomial distri- 
bution of this kind to the four observed 
parameters of the component latency dis- 
tributions (by trial and error) gives a 
remarkably good fit for the values n = 2, 
t= 12-13 msec, p=4. However, the 
fact that a simple latency model can be 
made to fit these particular data is no 
indication of its validity, but only an 
illustration of how such data might be 
explained. 

The distributions obtained in the two 
Control Cond. a, and a,, and in the c 
conditions, appear to differ by latency 
components which are almost constants. 
This is shown by the observations that 
the minima and means change by ap- 
proximately the same amounts, and the 
higher-order measures do not change ap- 
preciably. This apparent “shift” of la- 
tency distributions of RT has been ob- 
served before, in connection with varying 
stimrlus probabilities ( Taylor, 1965b). Tt 
could be represented with the model out- 
lined above by setting p= 1 in Equation 
2, but no specific reason for its appear- 
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ance in the present situation can be 
suggested. 

The latency distributions of Cond. a,, 
a,, and c', assuming them to be alike in 
respects other than their location, are 
quite similar in these other respects to. 
the component latency distributions al- 
ready discussed. Any suggestion of a 
functional similarity between the under- 
lying mechanisms would, however, imply 
that negligible variability is introduced 
into the total RT by its peripheral ele- 
ments (in this case the retina and the 
finger muscles)—a point on which fur- 
ther evidence would be desirable. 

There is no evidence in the present data 
of intermittent functioning of relatively 
long period, such as the 50 msec. sug- 
gested by Stroud (1955). In order to 
explain the observed positive skew, the 
rectangular distributions postulated by 
Stroud would have to be modified by the 
introduction of a probability (Christie & 
Luce, 1956; Taylor, 1965a), which would 
raise the predicted variances high above 
those observed. The repetition frequency 
of the presently suggested model (about 
80 cps) is more characteristic of neu- 
rone discharge frequencies (cf. Jasper, 
Ricci, & Doane, 1958) than of grossly 
intermittent functioning. One other al- 
ternative explanation of the properties 
of skewed distributions such as those ob- 
Served, as convolutions of normal and 
exponential distributions (Hohle, 1965), 
cannot be discounted at this stage. 

Although the present data contraindi- 
cate parallel operation of stimulus and 
response mechanisms, the result obtained 
may not generalize to other situations, 
eg. involving overlearned or highly 
compatible tasks (cf. Broadbent & Greg- 
ory, 1962). It remains to be seen how 
far the additive hypothesis, with its im- 
plication of serial processing, will ex- 
tend in these directions. 

The following general points emerge 
from consideration of the present data. 
The concept of additive latency compo- 
nents appears once again, 100 yr. after 
its inception, to be useful in the study of 
simple RT processes. Sufficient of the 
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properties of component latencies can be 
distinguished to make inferences possible 
at a level on which quantitative neuro- 
logical studies will be of direct relevance. 
Many possibilities exist for reexamina- 
tion of RT situations which have long 
been thought to be too simple to yield fur- 
ther information about mental processes. 
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tive height (response slope) in arbitrary 
units and latency in milliseconds of the eye- 
lid CRs of the instrumental reward, OP, 
and classically reinforced, CL, groups. 


mental reinforcement, OP, and the 
yoked classical group, CL-OP, given 
about 50% intermittent classical rein- 
forcement, has a still lower acquisition 
curve. The noncontingent reinforce- 
ment was ineffective in producing in- 
creases in responding as shown by the 
two lower curves in the figure. 

In extinction the instrumental rein- 
forcement produces more responses 
than classical reinforcement, and there 
is some indication of a partial-rein- 
forcement effect in the extinction curve 
for the CL-OF group. The statistical 
analyses, ¢ tests of arc sin transformed 
data with allowances for heterogeneity 
of variance where appropriate, showed 
that the noncontingent reinforcement 
produced significantly (p < .01) less 
responding during acquisition. During 
extinction the classical and contingent 
instrumental reinforcement produced 
significantly more (p < .05) respond- 
ing than did the 10096 noncontingent 
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instrumental reinforcement, OP-100, 
procedure ; but, curiously, they did not 
differ significantly from the OP-OP 
group. When a correction was made 
for the final level of responding during 
acquisition in the CL and OP groups 
by subtracting the number of responses 
on the first and last 10 trials of extinc- 
tion from the number of responses on 
the last 10 trials of acquisition, there 
was significantly less extinction drop 
in the OP group during the first 10 
extinction trials, #(38) = 2.24, p < .05. 
The difference between the CL and 
the CL-OP extinction curves, similarly 
corrected for acquisition, was highly 
significant, t(38) = 11.95, p< .001, 
indicating that the superior resistance 
to extinction in the instrumental rein- 
forcement group may have been an in- 
termittent reinforcement effect. It 
should be pointed out, however, that 
there was no intermittence of the re- 
inforcement of appropriate responses 
in the instrumental reward group. 


Discussion 


The instrumental reward acquisition of 
conditioned eyelid responses reported by 
Hansche and Grant (1965) is corrobo- 
rated. There was no evidence for classi- 
cal reinforcement properties of the in- 
strumental reward, as shown by the 
absence of acquisition of the response 
in the OP-OP and the OP-100 groups. 
The contingency of the reinforcement 
upon the response is clearly necessary 
for the acquisition of this eyelid response. 

The reported higher resistance to ex- 
tinction following instrumental reward 
training as compared with classical con- 
ditioning training appears also to be à 
valid finding, In the present experiment, 
however, this higher resistance to €x- 
tinction following instrumental reinforce- 
ment characterizes only the first 10 ex- 
tinction trials in contrast to the more 
persistent resistance to extinction T€- 
ported by Hansche and Grant, 


CLASSICAL AND INSTRUMENTAL EYELID CONDITIONING 


The form of the eyelid response in 
the group receiving instrumental rein- 
forcement was essentially that of the 
voluntary eyelid response (Spence & 
Ross, 1959). In Fig. 2 are presented 
the frequency distributions of derivative 
height in millimeters, a measure of the 
slope of the eyelid responses, and latency 
in milliseconds for the OP and the 18 
voluntary form responders (Hartman & 
Ross, 1961) of the CL groups. Eighteen 
of the 20 Ss in the CL group were classi- 
fied as voluntary form responders by the 
Hartman and Ross (1961) criterion, so 
that the close correspondence between 
the derivative distributions in the two 
groups indicates that the OP Ss would 
be considered voluntary responders had 
they been given classical reinforcement. 
In fact, 14 of the 20 Ss, all that could be 
classified in terms of the number of re- 
sponses available, were voluntary re- 
sponders by the Hartman-Ross criterion. 
The latency distribution shows that the 
instrumentally reinforced Ss gave more 
short-latency responses than did the 
classically reinforced Ss. The latency 
distribution closely resembles those ob- 
tained by Hartman, Grant, and Ross 
(1960) for instructed voluntary eyelid 
responses in the absence of a ready sig- 
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nal It is concluded that the instrumen- 
tally reinforced Ss in this experiment 
showed mostly the voluntary form re- 
sponse with a relatively shorter latency 
than that obtained from the classically 
reinforced Ss. 


REFERENCES 


Hawscug, W. J. A comparison of operant, 
avoidance, and classical techniques in the 
conditioning of the eyelid response. Un- 
published doctoral thesis, University of 
Wisconsin, 1959. 

Hanscue, W. J., & Grant, D. A. A com- 
parison of instrumental reward and avoid- 
ance training with classical reinforcement 
technique in conditioning the eyelid re- 
sponse. Psychon. Sci, 1965, 2, 305-306. 

Hartman, T. F., Grant, D. A, & Ross, 
L. E. An investigation of the latency of 
“instructed voluntary” eyelid responses. 
Psychol. Rep., 1960, 7, 305-311. 

Harman, T. F., & Ross, L. E. An alter- 
native criterion for the elimination of 
“voluntary” responses in eyelid condi- 
tioning. J. exp. Psychol., 1961, 61, 334- 
338. 

Spence, K. W., & Ross, L. E. A methodo- 
logical study of the form and latency of 
eyelid responses in conditioning. J. exp. 
Psychol., 1959, 58, 376-381. 


(Received July 27, 1965) 


Journal of Experimental Psychology 
1966, Vol. 72, No. 4, 492-496 


TRANSFORMATIONS OF POSITIVE AND NEGATIVE 
INFORMATION IN A MODIFIED LEARNING-SET TASK* 


ROBERT WEBER 
Kenyon College 


ADDISON WOODWARD, Jr. 
Connecticut College 


AND 


The relative effects on acquisition of positive and negative information 
were studied by using a modified learning-set task, 2 trials per 
problem, On the Trial 1’s of each problem S did not respond, but E 
gave either positive or negative information to S. On Trial 2's S 
did respond, and the correct choice, depending on the condition, could 
be either a "recurrent" stimulus item that carried over from Trial 1, 
or a "transient" stimulus item that did not carry over from Trial 1. 
When the recurrent item was correct, Positive Trial 1 information 
gave more rapid Trial 2 learning than negative information. But 
when the transient item was correct, negative Trial 1 information gave 
more rapid learning than positive information. In all cases there were 
learning-to-learn effects in the processing of positive and negative 


information. 


Since Smoke's (1933) study the role 
of positive and negative instances in 
learning concepts has attracted passing 
attention from psychologists. From 
the standpoint of methodological nicety 
Smoke's procedures left much to be 
desired. It remained for investigators 
like Hovland and Weiss (1953) to 
tighten some of the procedural loop- 
holes by designing a task that allowed 
them to assess and then equate the 
relative amounts of positive and nega- 
tive information. To do this they had 
to tell their Ss a good deal: how many 
stimulus dimensions were being em- 
ployed; how many values each dimen- 
sion had; that the concept in question 
would, for example, consist of two di- 
mensions out of a possible set of four; 
that there would be two values for 
each dimension; etc. Their findings 
indicate that Ss can learn from negative 
instances but find it easier to learn 
from positive ones. 

Recently Wason (1959) has ex- 
tended the problem. From its original 
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formulation in terms of positive and 
negative instances Wason has gone on 
to talk of positive and negative sets 
and information. He uses linguistic 
propositions that are cast in an affirma- 
tive (positive) or negative form. The 
Ss are asked to change these propo- 
sitions so that they agree or disagree 
with the objective state of conceptlike 
stimulus materials. Positive proposi- 
tions are easier for Ss to deal with 
than negative ones; this is true for 
both latency and error scores. Miller 
and Chomsky (1963) have related 
Wason’s study to the new and im- 
portant work dealing with grammati- 
cal transformations (Chomsky, 1957). 
The general notion is that negation 
is simply a linguistic transformation 
that may be applied to simpler kernel 
utterances, thereby producing more 
complex utterances. According to 
Miller an S’s speed of reacting to, and 
dealing with, various linguistic trans- 
formations ought to tell us something 
about their order of application and 
therefore their psychological primacy: 

The purpose of the present experi- 
ment is to examine certain relations 
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between positive and negative informa- 
tion, while at the same time giving Ss 
minimal instructions. To this end a 
modified discrimination learning-set 
task is employed. It allows assessment 
of learning-to-learn effects in the trans- 
formation of negative information into 
positive information and in the trans- 
formation of positive information into 
negative information. In addition, a 
base line condition is included for a 
null transformation in which Ss need 
not alter in form the information given 
them, 


METHOD 


Subjects —Forty students from introduc- 
tory psychology classes at C. W. Post Col- 
lege of Long Island University served as Ss. 
All were naive with respect to the experi- 
mental task. They were assigned at random 
to experimental conditions, 10 per group, 
and were run individually. 

Procedure and design—The stimuli used 
were highly distinctive, two-dimensional, 
visual nonsense forms (Goldstein & Weber, 
1965). In function they resemble the 
stimuli used in primate learning-set studies 
(Harlow, 1959). Tf letters of the alphabet 
are used to stand for arbitrary stimulus 
forms, then the design of a typical problem 
consisting of two trials is represented 
schematically in Table 1. 

On each trial of a problem there were two 
stimulus items, each about the size of a 50¢ 
piece and with a space of 3 in. between them. 
They were arrayed horizontally on a 4X 6 
in. card in a visible file folder (Gagné & 
Brown, 1961). Table 1 indicates that the 
stimulus display “ab” appeared on Trial 1 
and the display “ac” on Trial 2. Because 
the stimulus “a” repeats across the two trials 
of the problem it is designated as “recurrent” 
(R). Because the complementary stimuli 
“b” and “c” change from Trial 1 to Trial 2 
they are designated as “transient” (T). 
This two-trial per problem design was re- 
peated for 40 problems, and each successive 
problem had different stimuli. In brief, no 
two problems ever had the same recurrent 
stimulus, and no two trials within a problem 
ever had the same transient stimulus. Also, 
the position, left or right, of recurrent and 
transient stimuli was assigned randomly. It 
is clear that this is standard learning-set 
procedure (Harlow, 1959) with two trials 
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TABLE 1 
DESIGN OF THE EXPERIMENT 


Trial 1 Trial 2 
Group | Stimulus | Informa- | Stimulus | — y, 

„Pair tion Pair | Corect 
Displayed| Item» |Displayed| COSC! 

PR ab a+ ac a 

NR ab br ac a 

PT ab b+ ac (2 

NT ab a— ac c 


= The letter indicates the stimulus item that carried 
information, and + or — indicates whether the in- 
formation was positive or negative. 


per problem, except for modification due to 
the introduction of transient stimuli. 
Every S saw exactly the same displays in 
exactly the same order. The only differ- 
ences among experimental conditions con- 
sisted of the Trial 1 information S received 
on each problem and the way reinforcement 
was assigned to Trial 2 of each problem. 
On Trial l's S did not respond, but was 
given information, positive or negative. (To 
aid S in remembering not to respond on 
Trial 1’s the display on each such trial had 
a colored strip at the bottom.) The infor- 
mation was given by E indicating with a 
pointer one of the two stimulus forms (recur- 
rent or transient) and then giving spoken in- 
formation (“Right” or "Wrong"), the par- 
ticular combination of pointing and informa- 
tion depending on the experimental condition. 
On any given Trial 1, $ observed and then, 
without making a choice, turned to the next 
card (this was Trial 2 of the same prob- 
lem); here S made a choice by pointing to 
one or the other of the two stimulus items, 
followed by E saying "Right" or “Wrong.” 
Then S turned to the next card (Trial 1 of 
a new problem) where E gave information 
without S making a choice, and so on, for 
all 40 problems. It is emphasized that 
amount of Trial 1 information, whether nega- 
tive or positive, is equal. If negative 
information is given with respect to Stimulus 
“a” in the display “ab,” then the positive 
stimulus must have been "b. If positive 
information is given with respect to “b,” then 
the negative stimulus must have been “a.” 
Hence receiving either positive or negative 
information determines uniquely the role of 
the item on which information is not given. 
Table 1 shows that a 2X2 factorial de- 
sign was used. The kind of Trial 1 infor- 
mation, positive or negative, was one variable 
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TABLE 2 
TRIAL 2 ERRORS OVER 40 PROBLEMS 


Group M Mdn. SD Range 
PR 0.2 0 0.4 0-2 
NR 6.2 3 7.5 0-26 
BT. 13.3 15 4.7 4-19 
NT 2.0 2 1.8 0-5 


Note.—N = 10 for each group. 


of classification; and the way in which the 
Trial 2 correct choice was assigned, to the 
recurrent or transient item, was the other 
variable of classification. Hence in Group 
PR positive information was given by E 
on Trial 1’s, and S was required to choose 
the recurrent item on Trial 2’s. In Group 
NR negative information was given on Trial 
ls, and the recurrent item was correct on 
Trial 2’s. In Group PT positive information 
was given on Trial 1’s, and the Trial 2 cor- 
rect choice was always the transient item, 
Finally, for Group NT, negative informa- 
tion was provided on Trial 1’s, and transient 
stimuli were correct on Trial 2's. It is the 
successive Trial 2 response outcomes, right 
or wrong, that constitute the experimental 
data. 

Instructions.—Instructions were minimal. 
They were presented along with a mock-up 
display and served to tell S he should ob- 
serve on trials with a colored strip (Trial 
ls), he should respond on other trials 
(Trial 2s), and that he would be told 
"Right" or "Wrong" after each response. 
In addition S was told he was to learn to 
make as many correct choices as possible. 
No mention was made of recurrent or tran- 
sient stimuli, nor of concepts, transformations, 
or anything about the purpose of the ex- 
periment. 


RESULTS 


Table 2 presents the descriptive re- 
sults of Trial 2 errors. Groups PR and 
NR come out in the direction one 
would expect. Positive information on 
Trial 1’s leads to few errors, and nega- 
tive information on Trial l's leads to 
considerably more errors. However, 
in the transient groups, PT and NT, 
the results are reversed. Positive in- 
formation on Trial 1’s yields a large 
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number of errors, and negative in- 
formation yields few errors. 

When an analysis of variance is per- 
formed on Trial 2 errors, the main 
effects (Trial 1 information and Trial 
2 correct choice) are submerged by the 
interaction effect, F (1, 36) = 32.40, 
p < 001. The difference between Trial 
2 correct choice, recurrent or transient, 
is significant, F (1, 36) = 858, p< 
-01, but hard to interpret because of the 
interaction. The difference between 
positive and negative information on 
Trial 1 is not significant, but again is 
hard to interpret because of the inter- 
action. What is going on can best be 
seen by examining the group learning 
curves in Fig. 1, where the interaction 
is illustrated by the ordering of the 
different curves. 

Additional features of Fig. 1 also 
warrant comment. That all curves do 
not begin at or near the 50% chance 
responding level is explained by our 
procedure of summing over blocks of 
eight problems, For some groups (PR 
and NT) learning occurred almost im- 


PR e———e 


60) NTO-----o 
NRe-----e 
PTo——o 


PERCENT CORRECT ON TRIAL 2 


F8 9-6 (7-24 25-32 33-40 
PROBLEMS 


Fic, 1. Percentage of correct Trial 2 
choices as a function of problems and experi- 
mental group. (Each point represents 80 
events—10 Ss X 8 problems.) 
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mediately, while one group (PT) 
shows no learning until Problem Block 
17-24. That these curves are for the 
most part monotonic and increasing 
indicates that there are learning-to- 
learn effects in the processing of posi- 
tive and negative information. 

The foregoing results may be sum- 
marized in two propositions. (a) Con- 
trary to the findings of previous stud- 
ies the influence on acquisition of posi- 
tive vs. negative information depends 
on the task being learned; sometimes 
a task is learned most readily when 
negative information is given, and at 
other times when positive information 
is given. (b) Learning-to-learn phe- 
nomena occur in the processing of 
positive and negative information. 


Discussion 


Given the results obtained, in particu- 
lar the interaction effect, one may ask 
if there is a reasonable explanation to 
account for them, If the results of Table 
2 are examined, it is observed that Cond. 
PR and NT are relatively easy, while 
Cond. NR and PT are more difficult. Do 
the easy groups have anything in com- 
mon? Do the difficult groups have any- 
thing in common ? 

Yes, to both questions. Call the stimu- 
lus item to which Trial 1 information is 
attached a. carrier stimulus. The Groups 
PR and NT both receive their Trial 1 
information tied to a carrier stimulus that 
is recurrent across the two trials of a 
problem. In contrast, groups NR and 
PT both receive their Trial 1 informa- 
tion tied to a carrier stimulus that is 
transient across the two trials of a 
problem. 

The obvious conclusion is that Trial 1 
information, whether positive or negative, 
is best carried to Trial 2 by a stimulus 
that recurs in Trial 2. To account for 
this one may hypothesize that presenting 
information on Trial 1 has the important 
effect of focusing an $'s attention on the 
carrier stimulus, irrespective of whether 


TABLE 3 


TRANSFORMATION FLOW CHART FOR A TYPICAL 
PROBLEM AND MEDIAN ERRORS AS A 
Function or NUMBER OF 


‘TRANSFORMATIONS 
Group 
Trial Diore aeon ipo cu 
PR NT | NR PT 
1 BOY Se y ase 9n [aei 
+ - + 2j- 4 
l l l l 
2 x z|x—z|x 2|xs5 
* = ck]: - * 
No. of Trans- 
formations 1 2 2 3 
Mdn. Errors 0 2 3 15 


the information tied to it is positive or 
negative. Since in Groups PR and NT 
the carrier stimulus also occurs on Trial 
2, S simply matches it (Group PR), or 
he rejects it and then chooses the other 
stimulus item (Group NT). It is possi- 
ble to think of matching as the outcome 
of an identity or null transformation and 
of rejection as the outcome of a null 
transformation plus a transformation 
away from a negative stimulus. These 
transformations are shown as arrows in 
the flow chart in Table 3, where plus and 
minus signs indicate explicit or implied 
reinforcement value. Although it is pos- 
sible to define "transformation" rigor- 
ously, all that is meant here is a change 
from one specifiable state to another, not 
necessarily different, state. 

By assuming that Trial 1 information 
focuses attention on the carrier stimulus, 
it is also possible to account for the diffi- 
culty experienced in Groups NR and PT. 
The common aspect of these two groups 
is that they both received Trial 1 in- 
formation tied to the transient stimulus 
item, Therefore, according to the hy- 
pothesis, S’s attention is fixed on the 
transient item in Trial 1. Then when 
Trial 2 occurs there is a different tran- 
sient and S is faced with what are for 
him two new stimuli. That Groups NR 
and PT do eventually learn must mean 
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that S changes to a different strategy 
during the course of learning and no 
longer fixes attention exclusively on the 
carrier stimulus. Table 3 offers an ex- 
planation. In the case of Group NR the 
negative information on Trial 1 is trans- 
formed by a shift in attention and coding 
into the derived positive element, which 
will then carry over as a trace to Trial 2. 
There S need only match his choice to 
it via a null transformation. For Group 
PT the opposite information transforma- 
tion is required: Positive information on 
Trial 1 is transformed by a shift of 
attention and coding into the derived 
negative item, which is then carried over 
as a trace to Trial 2. There S applies 
yet another transformation, this time to 
the trace, and thereby rejects the stimulus 
element matching the trace. Undoubt- 
edly, transforming positive information 
into a negative form must run counter 
to an S's usual strategy. It is no acci- 
dent that Group PT was by far the most 
difficult. Finally, Table 3 shows that 
median errors increase as the number 
of assumed transformations increases. 
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LEVEL OF RISK IN PROBABILITY LEARNING: 
WITHIN- AND BETWEEN-SUBJECTS DESIGNS* 
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Human choice behavior in a 2-choice situation involving risk was (a) 
invariant over 2 risk levels when each S received only 1 risk level, 
(b) directly related to risk level when each S was exposed to both 
risk levels. The major difference among groups was depression of 
performance on low-risk trials for Ss exposed to both risk levels 


(negative contrast). 


estimates of event probabilities (zs). 


Negative contrast was also observed in Ss’ 


Choice behavior and m, were 


significantly correlated within each experimental condition. 


In a review of incentive experiments 
in animal learning, Pubols (1960) con- 
cluded that the effect of incentive mag- 
nitude on time-independent measures 
of learning varies with the method of 
presentation of the incentive levels. 
Variation in incentive magnitude pro- 
duces no performance differences when 
each S is exposed to only one nonzero 
level of incentive (between-Ss design). 
However, performance is directly re- 
lated to the amount of incentive when 
each S is exposed to more than one 
level of incentive (within-Ss design). 

Studies investigating the effects of 
levels of risk on human choice be- 
havior have yielded results which are 
consistent with Pubol's conclusions. 
Myers, Fort, Katz and Suydam 
(1963) reported that the probability 
of predicting the more frequent of two 
events (P[A,]) was not significantly 
greater for a 10¢ risk group than for 
a 1¢ risk group. Lipkin, Schnorr, 
Suydam, and Myers (1965) found that 
P(A,) was directly related to amount 
of risk when human Ss were each ex- 
posed to two levels of risk. 

While these and other studies of 
payoff effects in human learning 

1This research was supported by funds 
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(Harley, 1965a, 1965b; Offenbach, 
1964) suggest a parallel with the litera- 
ture on animal learning, no single in- 
vestigation of human learning has in- 
volved a direct comparison of between- 
Ss and within-Ss incentive effects. The 
purpose of this study was to conduct 
a basic parametric investigation of the 
role of incentive in human learning as 
a function of the type of experimental 
design employed. 


METHOD 


Subjects —The Ss were 48 male and 48 
female undergraduates recruited from the in- 
troductory psychology course at the Uni- 
versity of Massachusetts. 

Procedure—The independent variables 
were (a) level of incentive (1 and 10 chips), 
(b) method of incentive presentation (be- 
tween-Ss, within-Ss), and (c) sex. Twelve 
males and 12 females were randomly assigned 
to each of the two between-Ss conditions. 
The remaining 48 Ss were run under the 
within-Ss design. 

The Ss were required to predict whether 
each of 100 playing cards had a point or line 
drawn on it. The relative event frequencies 
were 60:40 with the point being the more 
frequent event (Ei) for one half of the Ss 
and the less frequent event (Es) for the 
other half. The cards were shuffled before 
each block of 100 trials in full view of S. 
Each S received a total of 400 trials in a 
single experimental session. y 

On each trial, an S won or lost a pre- 
viously designated number of chips depending 
upon the correctness of his prediction. For 
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PREDICTIONS: 


è 


MEAN PROPORTION OF Ej 
g 


BLOCKS OF i00 TRIALS 


Fic. 1. Probability of predicting the more 
frequent of two events over trial blocks for 
all experimental groups. 


Ss in Group 1-B, level of risk was 1 chip for 
each of the 400 trials. For Ss in Group 
10-B, level of risk was 10 chips on each of 
the 400 trials. The Ss run under the within- 
Ss design (Group W) experienced 50 trials 
at each level of risk, the sequence of risk 
levels being random within each block of 
100 trials. Level of risk was indicated on 
the face of each playing card so that Group 
W Ss knew in advance of their prediction 
the level of risk for that trial. Probability 
of E; (r) was .6 at each level of risk. 

The Ss were run individually and were 
self-paced. Each was given an initial stake 
of 200 chips and instructed to try to win as 
many chips as he could during the session. 

After the session, each S was given a 
questionnaire. Groups 1-B and 10-B Ss 
were asked to estimate + to the nearest 5%. 
Group W Ss were asked the same question 
for each level of risk. 

Statistical analysis—In order to compare 
the data of Groups 1-B and 10-B with that 
of Group W in the same analysis of vari- 
ance, a procedure similar to that suggested 
by Grice and Hunter (1964) was followed. 
Group W Ss were rank ordered within sex 
according to the magnitude of the difference 
in their P(A:) scores under the two risk 
levels for the last 100-trial block. This dif- 
ference will be referred to as “objective con- 
trast.” For each successive pair of ordered 
objective contrasts within a sex, one S’s 
high-risk score and the other S’s low-risk 
score were randomly assigned to one data 
set, while the former S's low-risk score and 
the latter S's high-risk score were assigned 
to another data set, yielding two sets of data 
for Group W. t 
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Combining either of these data sets with 
the data of Groups 1-B and 10-B results in 
eight combinations of sex, amount of incen- 
tive, and method of incentive presentation, 
each containing 12 different Ss. Each of 
the Group W data sets was, in turn, included 
in an analysis of variance with the data of 
Groups 1-B and 10-B. Consequently, two 
sets of F ratios will be reported in the 
Results section. 


RESULTS 


Figure 1 shows the mean proportion 
of E, predictions (P[A,]) for Groups 
1-B, 10-B, and W over blocks of 100 
trials. The curves for Groups 1-B 
and 10-B represent the same data 
in both panels of the graph. Each 
panel contains one half of the Group 
W data obtained by the procedure de- 
scribed in the Method section. As seen 
in both panels of Fig, 1, the curves 
for Groups 1-B, 10-B, and for Group 
W on high-risk trials are negatively 
accelerated and close together, while 
Group W performance on low-risk 
trials shows little change over trial 
blocks and is noticeably depressed. 

Analyses of variance of the P(A1) — 
data yielded a significant main effect 
of method of incentive presentation, 
F (1, 88) = 895, p < .01; F (1, 88) 
= 12.38, p< .001. Amount of in- 
centive was significant, F (1, 88) = 
6.13, p < .025; F (1, 88) = 7.67, b 
< 01, as was the interaction of method 
of incentive presentation with amount 
of incentive, F (1, 88) = 5.84, p < 
.025; F (1, 88) = 7.30, p < .01. None 
of the effects involving sex were sig- 
nificant. All significant results were 
clearly due to the lower P(A,) scores - 
of the Group W Ss on low-risk trials. 

P(A,) increased over trial blocks, - 
F (3, 264) = 1948, p «001; F (3 
264) = 21.73, p < 001. P(Ai) in 
creased more rapidly over trial blocks 
on high-risk trials than on low-risk 
trials, F (3, 264) = 4.06, p < 01; 
(3, 264) = 3.23, p «.05. Between- 
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Ss incentive presentation produced a 
more rapid increase in P(A;) than 
did within-Ss incentive presentation 
for the right-panel data, F (3, 264) 
— 487, p « 01 but not for the left- 
panel data, F (3, 264) = 0.88. 

The Ss’ estimates of z, which will 
bé referred to as s, were compiled 
from the questionnaires. Panels I and 
II of Table 1 show the mean m, for 
the same Ss whose P(Ai) data are 
depicted in the left and right panels of 
Fig. 1, respectively. The Groups 1-B 
and 10-B Ss overestimated + while the 
Group W Ss overestimated 7 for high- 
risk trials and underestimated m for 
low-risk trials, As noted in the 
Method section, m was .6 for all con- 
ditions. The Ss in Group W showed 
a consistent trend in their estimations: 
the high-risk +, minus the low-risk rs, 
a quantity which will be referred to as 
“subjective contrast,” was negative for 
only 4 of the 48 Group W Ss. 

Analyses of variance were performed 
on the r, data in the same manner as 
for the P(A,) data. These analyses 
yielded a significant method of incen- 
tive presentation main effect, F (1, 88) 
= 7.72, p < 01; F (1, 88) = 1345, p 
< .001, a significant amount of in- 
centive main effect, F (1, 88) = 4.39, 
p < 025; F (1, 88) = 10.26, 6 < 01, 
and a significant interaction of these 
two variables, F (1, 88) — 10.83, $ < 
.01 ; F (1,88) = 20.28, p < .001. None 
of the effects involving sex approached 
significance. All significant results 
were clearly due to the lower 7s for 
Group W Ss on low-risk trials. 

Since analyses of variance of the 
P(A) and 7 data yielded similar re- 
sults, Pearson product-moment correla- 
tion coefficients were determined for 
various aspects of these two data sets. 
Last-block P(A:) was significantly 
correlated with ms for Groups 1-B r 
= 586, t (23) 2339, P< .01, and 


TABLE 1 
MEAN =z, FOR ALL EXPERIMENTAL 
CONDITIONS 
Panel 
Group PETERE A 
I II 
Group 10-B 63.5% 63.5% 
Group 1-B 66.5% 66.5% 
Group W, High Risk | 65.4% 64.8% 
Group W, Low Risk | 48.1% 51.796 


10-B, r = .498, t (23) = 270, p < .02. 
For Group W Ss, objective contrast 
was significantly correlated with sub- 
jective contrast r — A07, t (47) = 3.02, 
p< Ol. 


DISCUSSION 


The findings of this study support 
Pubol’s contention that presenting levels 
of incentive as a between-Ss variable 
does not result in performance differ- 
ences as measured by time-independent 
indexes of learning, but that manipulating 
levels of incentive as a within-Ss variable 
produces performance differences that are 
directly related to amount of incentive. 
The results are also consistent with the 
findings of other human learning studies 
(eg, Harley, 1965a, 1965b; Lipkin et 
al, 1965; Myers et al, 1963) in which 
the level of reward or risk has been the 
independent variable. These findings im- 
ply that future investigators of incentive 
effects should optimally employ a within- 
Ss design. 

The results show depressed values of 
P(A,) for Group W Ss on low-risk 
trials with no concurrent enhancement of 
performance on high-risk trials. Fur- 
thermore, as seen in Table 1, the mą data 
exhibit similar negative contrast. These 
objective and subjective negative con- 
trast effects are consistent with other 
studies of incentive, both animal and 
human. With regard to the objective 
contrast, Bower (1961) found negative 
contrast of running time in rats using a 
design similar to that of the present 
study. Spence (1956) found negative 
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contrast when rats underwent incentive 
shift in a runway. Swensson (1965) 
found negative contrast in P(A,) when 
human Ss were shifted from monetary 
risk to nonmonetary risk in a three- 
choice probability learning situation. In 
accord with the subjective contrast ob- 
served in the present study, Swensson 
has reported data that also show m, for 
low-incentive trials to be lower under a 
within-Ss presentation than under a 
between-Ss presentation. 

One possible explanation of the nega- 
tive contrast observed in the present 
study is that Group W Ss think that 
there are two problems to be solved: 
one on high-risk trials and one on low- 
risk trials. Having no reason to suspect 
that the solutions to these problems are 
the same, S may try to maximize his 
winnings by attending primarily to the 
outcomes of high-risk trials. Consistent 
with this viewpoint is the fact that 
P(A,) scores in Group W for low-risk 
trials deviate little from 50% over all 
trial blocks. The hypothesis that Group 
W Ss attend differently on high- and 
low-risk trials is also consistent with the 
correlation between P(A,) and z, values. 
If a Group W S has not attended to the 
outcomes of low-risk trials, he has no 
reason to suspect that either event is 
more likely, and he will therefore esti- 
mate z for low-risk trials as .5. Under 
other experimental conditions, where Ss 
have attended, subjective probabilities 
should be closer to z, as they are. In 
summary, under between-S or high-risk 
within-S conditions, $ is more aware of 
trial outcomes than under the within-S 


low-risk condition. Consequently, the 
former set of conditions yields both 
higher P(A,) and z values than does 
the latter condition. 
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EFFECTS OF SOME SEQUENTIAL MANIPULATIONS OF 
RELEVANT AND IRRELEVANT STIMULUS 
DIMENSIONS ON CONCEPT LEARNING 
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University of Illinois 


Alternating series, in which one set of relevant cues appears at odd- 
numbered trials and another set at even-numbered trials, were 
contrasted with constant series, in which the same relevant cues 
appear at every training trial. The principal result was the interaction 


on both training trials and test trials between type of series and the 


number of stimulus dimensions 


changing from trial to trial. 


alternating series, the group in which 1 stimulus dimension changed 


For 


between each pair of adjacent training trials showed fewer errors 


(p <.01) than the group in which 3 dimensions changed. 


In the case 


of constant series, the group in which 3 dimensions changed displayed 


fewer errors (p < 01) than the group in which 


Two concept-learning studies (Kurtz 
& Hovland, 1956; Peterson, 1962) 
have manipulated the sequence of rele- 
vant cues, both comparing constant 
series, in which each of the stimuli in 
a series contains the same relevant 
cues (e.g. RED, RED, RED, RED, RED, 
RED), with mixed series, in which some 
of the stimuli in a series exhibit one 
set of relevant cues and other stimuli 
display different relevant cues in an 
irregular pattern (eg, GREEN, RED, 
RED, GREEN, RED, BLUE). Both studies 
seemed to show that constant series are 
superior to mixed series. One study 
has manipulated the irrelevant stimu- 
lus dimensions. Detambel and Stolu- 
row (1956, p. 40) proposed that to 
maximize rate of learning: 


(a) When the same relevant cue is pre- 
sented on adjacent trials, as many as possible 
of the irrelevant stimulus components should 
have different values on the two trials, (b) 
When two different relevant cues are pre- 
sented on adjacent trials, as many as possible 
of the irrelevant stimulus components should 
be kept constant. 


Detambel and Stolurow conducted a 
study in which a sequence composed 
according to the two rules mentioned 


just 1 changed. 


in the quotation was contrasted with 
a sequence composed according to the 
inverse of these rules. The result 
was a striking superiority for the se- 
quence constructed according to the 
rules on both the training series and 
a test series in which the training 
stimuli were presented randomly. 
The previous studies have not ade- 
quately disentangled the effects of se- 
quential manipulations of the revelant 
stimulus dimensions from the effects 
of manipulations of the irrelevant 
stimulus dimensions. The Detambel 
and Stolurow (1956) study employed 
a single “good” stimulus sequence con- 
structed according to two rules. There 
is no way of telling whether the 
superiority of this sequence was a 
function of both rules or only one or 
the other. With regard to the two 
studies of constant vs. mixed stimulus 
series, there is reason for believing 
that the manner of ordering stimuli 
inadvertently confounded manipulation 
of the relevant dimensions with vari- 
ation in the irrelevant dimensions. In 
these investigations, the probability 
was high that a large proportion of the 
irrelevant stimulus dimensions would 
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change between successive trials, Con- 
sequently, it appears that the mixed 
series in these studies approximated 
the “bad” series in the Detambel and 
Stolurow study. 

The study reported in this paper 
attempted to disentangle the effects of 
trial-to-trial changes in the relevant 
stimulus dimension from the effects of 
trial-to-trial changes in the irrelevant 
dimensions. The expectation was that 
within alternating series, a special case 
of a mixed series in which the stimuli 
presented on odd-numbered trials con- 
tain one relevant cue whereas stimuli 
presented on even-numbered trials con- 
tain a different relevant cue, perform- 
ance would be facilitated by decreasing 
the number of stimulus dimensions that 
change from trial to trial. On the 
other hand, within constant series, the 
expectation was that performance 
would be facilitated by increasing the 
number of stimulus dimensions that 
change from trial to trial. In other 
words, an interaction was predicted be- 
tween type of series and number of 
stimulus dimensions changing from 
trial to trial. 


METHOD 


Apparatus.—The apparatus consisted of a 
Masonite screen (23 X 4 ft.) with a centered 
window (23 X 5 in). The S sat facing the 
screen with his head approximately 20 in. 
from the window. Except when instructions 
were read, E was shielded from S's view. 

Stimulus materials.—The stimuli were 
drawn from templates on 3X 5 in. file cards 
using felt-tipped pens. The stimulus could 
be inscribed within a 13 X3 in. rectangle. 
There were five dimensions, each of which 
took on three values as follows: number— 
one, two, or three figures; color—red, blue, 
or green; form—rectangle, diamond, or oval ; 
shading—outline, solid, striped; bar (black, 
14 in. long, centered) —vertical, horizontal, 
absent. 

Procedure—The same instructions were 
read to all Ss as follows: 


In a moment, I will show you a series of 
cards. They will appear one at a time in 
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the little window in front of you. Your 
task will be to classify the cards into one 
of two categories. You should try to 
learn the basis of the correct classification 
as quickly as possible. Each time you 
see a card in the window, you are to say 
either Alpha or Bravo. I will tell you 
whether you are correct. Alpha will be 
the correct word for some of the cards 
and Bravo will be the correct word for 
some of the cards. Your task is to learn 
which word goes with which kind of card. 


Stimuli were presented one at a time, 
The S had as much time as he needed to 
respond. Upon hearing S’s response, E said 
either Correct or Incorrect. The intertrial 
interval, which E timed with a stopwatch, 
was 15 sec. The S was presented with one 
problem involving one relevant stimulus di- 
mension and four irrelevant dimensions, 
Since there were three values for each di- 
mension and just two instrumental re- 
sponses, each S was required to learn a one- 
valued concept (e.g., red—Bravo) and a two- 
valued concept (e.g., blue-or-green—AIpha). 
The S received a training series of six 
trials and a test series. There was no break 
between the two series. The test series, 
consisting of stimulus cards presented in 
random order, continued until S made 10 
consecutive correct responses or completed 
80 trials (not counting training trials). The 
test series was randomized by shuffling a 
deck containing all of the stimulus cards. 
The training series were constructed sepa- 
rately for each S and assembled prior to the 
experimental session. 

Experimental groups.—There were five 
groups of 30 Ss, each of which received a 
different training series. One group, the 
control group, received training consisting of 
series of six randomly selected stimulus 
cards. Among the remaining four groups, 
two received alternating series in which the 
relevant cue of the one-valued concept ap- 
peared on Training Trials 1, 3, and 5 and a - 
relevant cue of the two-valued concept ap- 
peared on Training Trials 2, 4, and 6. Only 
one of the two relevant cues of the two- 
valued concept was presented during the 
training series. That is to say, if red were - 
the cue for the one-valued concept, either 
green or blue but not both would appear on . 
Training Trials 2, 4, and 6. Two of the - 
groups received constant training series in - 
which the relevant cue of the one-valued 


ced appeared on each of the six training Q 
trials, 


ALTERNATION OF RELEVANT CUES IN CONCEPT LEARNING 


Within one of the groups receiving an 
alternating series (Group Al) and one of 
the groups receiving a constant series 
(Group Cl), the training sequence was ar- 
ranged in such a way that exactly one 
stimulus dimension changed between adja- 
cent trials. In the case of Group Al, 
it was the relevant dimension that changed; 
thus for this group all of the irrelevant 
dimensions were held constant. One of the 
irrelevant dimensions changed on each train- 
ing trial for Group Cl, with the added re- 
striction that each irrelevant dimension 
change at least once during the six training 
trials. 

For one group presented with an alternat- 
ing series (Group A3) and one group pre- 
sented with a constant series (Group C3) 
exactly three dimensions changed from one 
stimulus to the next. The relevant dimen- 
sion and two irrelevant dimensions changed 
for Group A3. In the case of Group C3, 
three irrelevant dimensions changed. For 
these two groups, there was the further re- 
striction that each irrelevant dimension 
change at least twice during the six training 
trials. All of the experimental groups re- 
ceived enough information to „learn the 
concept within the six training trials. Con- 
trol Ss received training series randomized 
by shuffling a deck of stimulus cards, It is 
probable that most but not all control Ss 
received sufficient information to learn the 
concept during the six training trials. 

Within each of the five treatment groups 
there were five equal-sized subgroups ac- 
cording to which of the five stimulus dimen- 
sions was relevant and a further division 
such that half of the Ss in each subgroup had 
one of the instrumental responses associated 
with the single-valued concept and half had 
the other instrumental response associated 
with the single-valued concept. , 

Except for the rules by which the treat- 
ments were defined and the other restrictions 
that have been noted, sequencing decisions 
were made randomly by flipping a die. Spe- 
cifically, the values of the first stimulus, the 
dimensions of the stimulus that changed 
between any two trials, and the new values 
of dimensions that changed were determined 
randomly for each S within the constraints 
provided by the rules. é 

Subjects.—The Ss were 150 college seniors 
enrolled in a course in educational psychol- 
ogy. Participation in the study was a 
course requirement. Three Ss were elimi- 
nated and replaced by others. In two cases 
E discovered an error in the sequence of 
stimuli that was presented. In one case S 
refused to cooperate. 
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TABLE 1 


Means AND SDs or TRAINING ERRORS 
AND TEST ERRORS 


Measure 

Group Training Errors Test Errors* 
M SD M SD 
Al 1.30 .84 1.37 1.60 
A3 2.30 1.21 2,31 248 
C1 2.97 1.33 1.79 1.87 
C3 2.37 1.10 95 1.34 
Control | 2.77 1.10 2.61 247 


* Square root of test errors, 


RESULTS AND DISCUSSION 


Table 1 contains means and stand- 
ard deviations of training errors for 
the five groups. Training errors were 
normally distributed in each of the 
groups and the variances were not 
significantly different (Fmax = 2.51, k 
—5,n230; p.05). Analysis of 
variance showed two significant effects 
with respect to training errors. Fewer, 
F (1, 100) = 17.29, p < .01, training 
errors appeared in groups receiving 
alternating series than in groups re- 
ceiving constant series. There was an 
interaction, F (1, 100) = 14.73, p < 
01, between type of series and number 
of stimulus dimensions changing from 
trial to trial. Since there was a sig- 
nificant interaction and since, in ad- 
dition, there was a desire to contrast 
each of the treatment groups with the 
control group, tests (a= .01) were 
made between pairs of means using 
the Newman-Keuls procedure. There 
were significant differences among the 
five groups, F (4, 145) = 9.81, p< 
01. Group A1 showed fewer training 
errors than any of the other groups. 
Groups A3 and C3 were not different 
from one another but both exhibited 
fewer training errors than either the 
control group or Group Cl. The lat- 
ter two groups were not different. 

Table 1 contains the means and 
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standard deviations of the square root 
of test errors. The square-root trans- 
formation reduced the skew and 
brought the distributions within the 
normal range of kurtosis. The vari- 
ances of the transformed distributions 
were not significantly different (Fmax 
222,66, k= 5, n 30/5 > 105): The 
only significant effect revealed by the 
analysis of variance was the predicted 
interaction between type of series and 
number of dimensions changing be- 
tween adjacent trials, F (1, 100) = 
7.35, p « .01. In order to examine 
the interaction more closely and also 
to contrast the treatment groups with 
the control group, the Newman-Keuls 
procedure was applied to pairs of 
group means. The differences among 
groups in the square root of test er- 
rors, ignoring the factor of relevant 
dimension, were significant, F (4, 145) 
= 3.94, p < .01. Group C3 exhibited 
fewer (a = .01 for each of the follow- 
ing comparisons) test errors than C1, 
A3, and the control group but not A1. 
Group A1 showed fewer test errors 
than A3 and the control group but was 
not different from C1. Group C1 was 
superior to the control group but not 
A3. Finally, Group A3 and the con- 
trol group were not different, 


The main result of the study was the 
interaction, on both training trials and 
test trials, between type of series and 
number of dimensions changing from 
trial to trial. As predicted, when differ- 
ent relevant cues appear on adjacent 
trials, the condition in which just one 
stimulus dimension (the relevant dimen- 
sion) changes is more efficient than the 
condition in which three dimensions 
change. When the same relevant cue is 
presented on adjacent trials, the condi- 
tion in which three stimulus dimensions 
change is more efficient than the condi- 
tion in which just one dimension changes. 
These results lend support to the two 
rules proposed by Detambel and Stolurow 
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(1956), though their study showed only 
that a training series constructed accord- 
ing to both rules was superior to a con- 
trol series. The present experiment 
seems to indicate some independent valid- 
ity for each of the two rules. 

Even the least effective of the experi- 
mental conditions, A3 in the case of 
training trials, C1 in the case of test 
trials, were superior to the control con- 
dition, suggesting that the constraints on 
randomness entailed in alternating and 
constant series facilitate performance. 
But it must be emphasized that the con- 
trol condition differed from the experi- 
mental conditions with respect to the net 
amount of information available from the 
training trials as well as with respect 
to sequential organization. 

Previous investigators have concluded 
that constant series lead to speedier learn- 
ing and/or superior transfer of training 
than mixed series. In the present study 
a special case of the mixed series, the 
alternating series, produced fewer train- 
ing errors than constant series and 
equally good performance on test (trans- 
fer) trials. These contrasting results 
can perhaps be attributed to the control 
that was maintained over trial to trial 
variations in the characteristics of the 
stimulus. The probability was high in 
the previous studies that a large propor- 
tion of the irrelevant stimulus dimensions 
would change from one trial to the next 
in both the mixed series and the constant 
series, a condition that retards learning 
in the former kind of series but promotes 
it in the latter. 
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DEVELOPMENT OF READINESS TO RESPOND 
DURING SHORT FOREPERIODS * 


LAWRENCE KARLIN 
New York University 


Simple reaction time as 


a function of foreperiod duration was 


determined for 6 foreperiod distributions characterized as leptokurtic, 


bimodal, or rectangular. 


S's inability to maintain a peak level of 


readiness geared to the most frequently occurring foreperiod suggested 


a “ballistic” type of preparation which was relatively independent of 


the conditional probability of occurrence of the stimulus during the 


foreperiod. 


Mowrer (1940) used what may be 
called a “catch” foreperiod technique to 
determine how readiness developed 
during a foreperiod of 12 sec. He 
presented a reaction stimulus for the 
great majority of trials 12 sec. after the 
warning signal. Interspersed at ran- 
dom among these trials were occasional 
trials with foreperiods shorter and 
longer than the modal (12 sec.) fore- 
period by varying amounts. The 
group readiness curve which Mowrer 
obtained showed that readiness reached 
a peak (minimum RT) at the modal 
foreperiod of 12 sec. Over the range 
of longer foreperiods (15-24 sec.) 
RT increased slightly. In a recent 
study, using a less direct technique, 
readiness curves of a sort were ob- 
tained for average foreperiod intervals 
of .5, 1, 2, and 3.5 sec. (Karlin, 1959). 
While the last three average fore- 
periods yielded results which could be 
held consistent with Mowrer’s the 
shortest average interval yielded a 
readiness curve in which the peak 
readiness reached at .5 sec. was not 
maintained, but dropped when the fore- 
period increased to 6 sec. 

When compared with those of 
Mowrer, these results suggested to the 

1 This investigation was supported by a 
Public Health Service Research Grant (MH 
07253) from the National Institute of Mental 
Health, Public Health Service. 


writer that S, for some reason, may not 
maintain a peak of readiness much 
beyond the expected length of fore- 
period when it is of relatively short 
duration, despite the fact that objec- 
tively the probability of occurrence of 
the reaction stimulus keeps increasing 
with elapsed time during a given fore- 
period. The present study was de- 
signed to examine this question more 
closely. 

A second aim of the present study 
was to explore certain possibilities sug- 
gested by an experiment of Lansing, 
Schwartz, and Lindsley (1959) on the 
relationship between alpha blocking 
time of the EEG and the foreperiod- 
RT function. They interpreted their 
results to mean that alpha blocking 
time is related to the time required to 
develop maximum readiness to respond 
during the foreperiod. The present 
study will check the possibility that 
their finding is a coincidence due to a 
range effect of the distribution of fore- 
periods which they used. Our study 
cannot be decisive, however, since we 
are not measuring EEG but it may 
indicate the need for caution in inter- 
preting their results. 


METHOD 


Apparatus.—A. 500-cycle tone was used 
both as warning signal and reaction stimulus. 
Tones were generated by an audio-oscillator 
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and presented to S through a pair of Sharpe, 
Model HA-10 headphones. These head- 
phones have water-filled cushions which pro- 
vide a seal that attenuates a considerable 
amount of ambient noise. Each distribu- 
tion of foreperiods was preprogrammed on 
magnetic tape as a series of signals of vary- 
ing frequency. When played back through 
a read-out device these signals triggered 
various relays at the appropriate time in- 
tervals which then presented the stimuli and 
started two pairs of chronoscopes (Hunter 
klockounter), one pair measuring S’s RT 
and the other pair, foreperiod duration. The 
S stopped one pair of chronoscopes by re- 
leasing a telegraph key. The entire appa- 
ratus was sound-proofed and was located 
with S in a sound-deadened, air-conditioned 
room. 

Instructions—The S was told to place his 
index and forefinger on a telegraph key 
prior to the first tone (warning signal), to 
get ready to respond when he heard the first 
tone and to release the key as quickly as 
possible when he heard the second tone (re- 
action stimulus). Except for one or two 
instances, Ss were not warned against mak- 
ing premature reactions because pilot work 
had indicated that, for most Ss, they were 
not excessive when nothing was said about 
them. On the other hand, some Ss tended to 
increase RT appreciably when they were so 
warned. In the present study, the mean 
number of premature reactions per S was 
less than .5 on any catch foreperiod and did 
not exceed 11 on any modal foreperiod. 

Foreperiod distribution —Durations of re- 
action stimulus and intertrial interval were 
constant for all foreperiod distributions at 
50 msec. and 7 sec., respectively. The warn- 
ing signal was 40 msec. in duration for 
Distribution B (see below) otherwise it was 
50 msec. Foreperiod durations were meas- 
ured from onset of warning signal to onset 
of reaction stimulus. Four different lepto- 
kurtic distributions, one bimodal distribution, 
and one rectangular distribution were used. 
These are designated, respectively, as L 
(long), M (medium), S (short), SS (very 
short), B (bimodal), and R (rectangular). 
The first 29 or 30 trials in Distributions 
L, M, and S used the modal foreperiod and 
thereafter the catch foreperiods were dis- 
tributed randomly with 5-10 modal fore- 
period trials between successive catch fore- 
periods. Seven different orders of Distribu- 
tion SS were prepared in which the modal 
foreperiod was repeated for the first 9-15 
trials and thereafter the catch foreperiods 


were presented in random order with 4-15 
modal trials between successive catches. 
Two orders of Distribution B were used in 
which both modal foreperiods appeared in the 
first 3 or 10 trials. Thereafter, catch fore- 
periods were used at intervals of from 3 to 
10 trials. The number of trials used for 
each foreperiod are shown in Table 1 in the 
results section. Session length varied from 
15 to 20 min. 

Practice.—The 22 Ss of Distributions L, 
M, and S attended one session. The six Ss 
of Distribution SS attended one session a day 
on nearly successive days. One S attended 
for 3, another for 4, two for 5, one for 7, 
and one for 8 days. On each day each S 
received a different form of Distribution SS. 
The six Ss of Distribution B had two ses- 
sions separated by a rest period of 10 min. 
but only the results of the second session are 
reported. 

Subject—A total of 34 college students 
(19 female) were distributed among the 
various conditions as shown in Table 1. 
One male S was used in Distributions L, M, 
and B. Otherwise all groups were inde- 
pendent. None had any RT experience prior 
to their participation in the present study; 
three of the Ss in Distribution B and two of 
the Ss in Distribution SS had prior experi- 
ence with other distributions in exploratory 
experiments. Four Ss in their first session 
showed excessively long RTs at all fore- 
periods or an excessive number of premature 
reactions and they were dropped from the 
study. 
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Fic. 1. Mean reaction time as a function 
of foreperiod duration for unimodal and 
bimodal frequency distributions of foreperiod. 
(See Table 1 for the number of observations 
used in determining each point.) 


DEVELOPMENT OF RESPONSE READINESS IN RT 


TABLE 1 


Mean RT In MiLLISECONDS AS A FUNCTION OF FOREPERIOD IN MILLISECONDS FOR 
Various FOREPERIOD DISTRIBUTIONS 
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Distribution L; N = 11 


Foreperiod 150 350 450 550 650 750 950 

Mean RT 365 235 191 192 193 239 285 

No. Trials per S 3 3 3 162 ER 3 3 
Distribution M; N = 4 

Foreperiod 150 250 350 450 550 

Mean RT 253 201 186 224 257 

No. Trials per S 5 5 160 5 5 
Distribution S; N = 5 

Foreperiod 100 150 200 250 300 350 400 

Mean RT 247 220 199 210 209 212 240 

No. Trials per 5 3 3 3 159 3 3 3 
Distribution SS*; N = 6 

Foreperiod 100 150 200 250 300 350 400 

Mean RT 259 221 252 271 272 251 328 

Mean RT 252 197 205 248 291 283 320 

No. Trials per 5 

for each ion 2 108/114 2 2 2 2 2 
Distribution B; N — 6 

Foreperiod 100 150 200 250 300 350 400 450 

Mean RT 221 190 181 203 199 178 188 226 

No. Trials per 5 4 48 4 4 4 48 4 4 
Distribution R; N = 5 

Fi i 100 150 200 250 300 350 400 

A RES T EU io edP! 223 cook AZT VAAG 20A 

26 26 26 26 26 26 26 


No. Trials per S 


RESULTS AND DISCUSSION 


Figure 1 shows that for all the uni- 
modal, leptokurtic foreperiod distri- 
butions given in Table 1, there is a 
marked tendency for average RT to 
reach a minimum near the modal 
foreperiod and then to increase appre- 
ciably within the next 200 msec. f 

The significance of these trends is 
indicated by an analysis of variance 


a Second set of means obtained after Ss were practiced in from four to eight repeated sessions. 


using orthogonal polynomials (Winer, 
1962) which yields a quadratic com- 
ponent significant considerably beyond 
the 1% level for Curves L, M, and S. 
On the other hand, while an overall 
variance analysis yields highly signifi- 
cant results (p < .01) for Curves SS: 
and SSm, their quadratic components 
are not significant. . All Ss in both 
curves reach a minimum RT at or near 
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150 msec. from which they show a sub- 
stantial rise as indicated by the average 
trends of Fig. 1. 

These results suggest that S’s readi- 
ness develops in accordance with the 
probability of occurrence of a fore- 
period of a given length and in relative 
independence of the conditional prob- 
ability of occurrence of the stimulus 
during any given foreperiod. It is as 
if S presets the development of his 
readiness and "tunes" it to the statisti- 
cal properties of the distribution of 
foreperiods. Readiness then runs its 
course regardless of when (at least 
within the range of foreperiods used) 
the reaction stimulus occurs. 

While Mowrer’s (1940) function 
shows an increase in RT from 4 to 
11 msec. for foreperiods greater than 
the modal foreperiod of 12 sec., the 
increase in RT in the present study was 
much greater over a much shorter 
change in foreperiod length. The dif- 
ference between the two sets of results 
may be due to the possibility that with 
the longer foreperiods used by Mowrer, 
S has more time to monitor current 
information about the occurrence of 
the stimulus and can use this infor- 
mation in maintaining relatively con- 
tinuous regulation of the degree of 
readiness. It is also possible that the 
rate of development of preparation in 
some way influences the length of time 
over which S can maintain a state of 
heightened readiness. Thus, the slower 
rate of development implied by 
Mowrer’s functions may make it pos- 
sible for S to maintain his state of 
heightened readiness more easily than 
is the case with the much more rapid 
development of readiness found in the 
present experiment. Finally, S may 
"tune" his peak readiness more broadly 
because of the greater absolute error in 
estimating longer time intervals. 

Tn general, the readiness curves ob- 
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tained with the shorter foreperiods of 
the present study suggest a “ballistic” - 
type of preparation which follows a` 
preset course of development inde- 
pendent of current information input. 

To determine if S could learn the 
statistical structure of the distribution 
and in this way modify his readiness 
curve to take greater account of the 
probability of occurrence of the stimu- 
lus during the long foreperiods, a group 
of six Ss was given repeated sessions 
with Distribution SS on successive 
days. In general, the results were 
equally negative for all Ss who prac- 
ticed on the average of 5 days. The 
group results are shown in Fig. 1 
where the curves for first (SS;) and 
last (SS) sessions are compared. If 
anything, tuning of the preparatory 
process to the modal foreperiod seems 
sharper. Also of interest is the sug- 
gestion of a recovery at 350 msec. in 
both curves, although in no case did 
latency drop back to its former level. 

Since repeated sessions did not elimi- 
nate the increase in RT found with 
Distribution SS, it was felt that a bi- 
modal leptokurtic distribution might: 
provide another means of determining 
whether training could modify this 
characteristic, Distribution B consists 
of a series of foreperiods in which two 
intervals instead of one are repeated fre- 
quently (see Table 1). If peak readiness 
were reached in this bimodal distribu- 
tion near the first mode then the greater 
probability of longer foreperiods should 
help in maintaining this peak, unless 
the increased latency obtained with the 
unimodal distribution reflects some in- 
herent characteristic of rapidly devel- 
oped states of readiness which prevents 
their being maintained at peak. Figure 
1 plots the results and shows that when 
the development of peak readiness is 
geared to a very short foreperiod, it is 
not maintained. The longer mode 
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Fic. 2. Mean reaction time as a function 
of foreperiod duration for selected fore- 
period distributions in the present study com- 
pared with the function determined by Lans- 
ing, Schwartz, and Lindsley (1959). 


clearly had an effect but a recovery 
period was required before the second 
peak could be reached. 

Every one of the six Ss displayed 
the tendency shown in Fig. 1 and 
when analyzed by orthogonal poly- 
nomials, highly significant (P <.01) 
results were found for both quadratic 
and quartic components. The former 
component reflects the increase in 
latency at both ends of the foreperiod 
distribution while the latter component 
takes account of the increase in the 
middle. 

In Fig. 2, Distributions R and SS: 
are compared with a portion of the 
foreperiod-RT function as approxi- 
mated from Fig. 1 in the study by 
Lansing et al. (1959) in which a rec- 
tangular distribution of foreperiods 
ranging from 50 to 1,000 msec. was 
used. 

Their curve shows an inflection in 
the region of 250-300 msec. and this is 


about where the percentage of trials in 
which alpha blockade was complete, 
reached 100. They suggested that this 
result might stem from a common 
process underlying the development of 
both readiness and alpha blockade. 
However, the two distributions from 
the present study show that, on the 
average, maximum or near-maximum 
readiness (minimum RT) can be 
reached by 150 msec. This finding 
casts some doubt on the interpretation 
of Lansing et al. unless it can be shown 
that the average time to complete alpha 
blockade under present conditions is 
also 150 msec. or less. On the basis 
of other reports this may not be likely 
(Stamm, 1952), although alpha-block- 
ing latencies as low as 160 msec. have 
been reported (Jasper & Cruikshank, 
1937). An extension of the present 
study adding the EEG measurements 
needed to explore this question is now 
in progress. 
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CLASSICAL CONDITIONING WITHOUT DISCRIMINATION 
TRAINING: A TEST OF THE GENERALIZATION 
THEORY OF CS INTENSITY EFFECTS * 


G, ROBERT GRICE, LARAINE MASTERS, anp DAVID L. KOHFELD 
University of Illinois 


Eyelid conditioning was studied in a situation in which the CS 
consisted of a shift in stimulus intensity from one level to another. 3 
intensity values were employed and each of them served equal time 
and equally often as the background intensity. Significant within-Ss 
effects were obtained which were dependent upon the amount of stimu- 


lus change. Direction of change had slight effect. 


These findings are 


contrary to the prediction of a theory which states that CS intensity 
effects are dependent upon the generalization of inhibition from the 


background intensity. 


Until recently, there have been two 
main theories of the effect of variation 
in stimulus intensity upon response 
evocation, Hulls (1949) dynamo- 
genic theory assumes that the stimulus 
intensity dynamism (V) is a mono- 
tonic function of stimulus energy, and 
that Y enters into a multiplicative rela- 
tion with habit strength and drive in 
the determination of reaction potential. 
Perkins (1953) and Logan (1954) 
proposed an alternative view which 
they contended was deducible from the 
remainder of Hull’s theory and which 
made the dynamism concept super- 
fluous.? Essentially, the notion was 
that during intertrial intervals, re- 
sponse to the background intensity 
(usualy zero) becomes extinguished, 
and that the inhibition generated by 
this extinction generalizes to the re- 
inforced stimuli. The more intense the 
stimulus is, the less similar it is to the 
background, and therefore, the less 
subject it is to generalized inhibition, 


1 This investigation was supported by Pub- 
lic Health Service Research Grant MH 
08033, from the National Institute of Mental 
Health. 

? Actually, this view was briefly anticipated 
by Mowrer and Lamoreaux ( 1951, p. 200, 
Footnote 6). 


In the case of response evocation by 
stimulus offset, the explanation works 
equally well in reverse. 

This theoretical picture has been 
changed by the work of Grice and 
Hunter (1964). They found that the 
size of the intensity effect was greatly 
increased if S were exposed to more 
than one intensity during the experi- 
ment. This finding is disproof of 
Hull’s hypothesis that these effects are 
a function of the absolute intensity of 
the stimulus. One aspect of the Grice 
and Hunter data also casts doubt upon 
the adequacy of the Perkins and Logan 
theory. In their eyelid conditioning 
experiment, they found that a 50% 
reduction in the number of reinforced 
presentations of a 100-db. tone, to- 
gether with the introduction of an equal 
number of reinforced presentations of 
a 50-db. tone, resulted in a significant 
increase in the level of conditioning to 
the louder tone. This is the opposite 
of the prediction of a generalization 
theory which would lead to the expec- 
tation of generalization decrement. In 
view of these findings, Grice and 
Hunter tentatively proposed a dyna- 
mogenic theory dependent upon stimu- 
lus change rather than stimulus onset, 
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and suggested the use of adaptation- 
level concepts rather than mere stimu- 
lus energy. 

The present paper is a further at- 
tempt to evaluate experimentally the 
Perkins-Logan generalized inhibition 
theory. Evidence cited in favor of this 
theory includes two bar-pressing stud- 
ies with rats (Gray, 1965; Perkins, 
1953) in which intensity effects were 
obtained only with discrimination con- 
ditioning procedures, In this same 
context, Perkins (1953) concluded that 
classical conditioning necessarily in- 
volves differential conditioning and 
cited Skinner (1938) and Mowrer and 
Lamoreaux (1951) as having pre- 
viously made the same point. By dif- 
fertial conditioning he, of course, re- 
ferred to the presence and absence of 
the CS. Now this claim may have 
been true of all previous classical 
conditioning experiments, but the ex- 
periments reported here introduce a 
design for which it is not true. This 
has been accomplished by arranging 
that there be no intensity which is con- 
sistently background or CS. This 
situation constitutes a test of the 
theory, for if there exists no back- 
ground intensity to serve as the origin 
of an inhibitory gradient, the theory 
predicts that intensity effects will not 
occur. In the present experiments, 
three stimulus intensities were used: 
Soft and Loud Tones, and Off. Each 
CS consisted of a shift from one of 
these states to one of the other two, 
the new value remaining on during the 
intertrial interval and providing the 
starting point for the next CS. Thus, 
each of the six possible stimulus 
changes constituted the CS equally 
often, and each of the three intensity 
values serves as the intertrial back- 
ground one third of the time. Eyelid 
conditioning procedures were used and 


TABLE 1 


Mean NuMBER or CRs ror EACH 
STIMULUS CHANGE 


Exp. | OS so SL Ls OL LO 
I | 4.37 | 4.93 | 9.17 | 8.07 | 8.83 | 7.53 
IL | 3.97 | 4.03 | 5.43 | 4.80 | 5.00 | 4.97 ; 


each CS was paired with the air-puff 
UCS. 


METHOD 


The general conditions of the experiment 
were similar to those employed by Grice and 
Hunter (1964). The UCS was a puff of 
air to the cornea at 1 psi. The intertrial 
interval was 15, 20, or 25 sec. in an irregular 
order. A buzzer warning signal preceded 
the CS by an average of 3 sec. Auditory 
stimuli were presented by earphones and 
consisted of 1,000-cycle tones of 50 and 100 
db. SPL for Exp. I. Experiment II dif- 
fered only in that the louder tone was 70 db. 
rather than 100. The two tones plus an Off 
condition represented the three states of 
constant stimulation. The CS consisted of 
a shift from one level to another. Switch- 
ing was between the two channels of a 
Grason-Stadler electronic switch set with 
30-msec. rise or decay time. The air puff 
followed the tone change by .5 sec. Refer- 
ring to the three states as Off (O), Soft 
(S), and Loud (L), there were six stimulus 
changes: O-S, S-O, S-L, L-S, O-L, and L-O. 
One basic, irregular order for 96 trials was 
constructed for 16 presentations of each 
change. The order was so constructed that 
each change occurred twice in each block of 
12 trials. From this basic arrangement six 
orders were derived by assigning each of the 
stimulus changes once to each of the six 
positions in the basic order. This consti- 
tutes a counterbalancing of position in the 
sequence of conditioning trials, The first 
conditioning trial was preceded by an "inter- 
trial interval" during which one of the tones 
or the Off condition was in effect. Thus, 
each of the six orders began with a different 
stimulus change. During each intertrial in- 
terval, the state persisted which resulted 
from the shift of the preceding trial, and this 
state provided the starting point for the shift 
of the next trial. 

The S’s were 60 undergraduate women, 
There were 30 in each experiment, with 5 


512 G. R. GRICE, L. MASTERS, AND D. L. KOHFELD 


TABLE 2 
ANALYSES OF NuMBER or CRs 
Exp. I |Exp.II 
Source df 
F F 
Between Ss 29 
“Order (0) 5 .92 «30 
Error» (Mean Square) | 24|(102.17) | (51.63) 
Within S; 150 
Change Q 2| 2T.55*| s.og** 
Direction (D) 1| 3.88 «51 
CXD 2| 5.49% «81 
C XO 10 250 78 
DXO 5 -20 B 
CXDXO 10 E 
Ertorw (Mean Square) |120| (6.11) | (3.27) 
Errors (C, C X O) 
(Mean Iu 48| (10.22) (3.73) 
Errory (D, D XO) 
(Mean Square) 24| (417) | (3.55) 
Errory (C X D, 
C XD X0) 
(Mean Square) 48| (2.98) | (2.67) 
"*p «.01. 
+*+ p < 001. 


assigned to each of the six presentation 
orders. 

CRs were scored if they occurred 150 
msec. or more after the stimulus change and 
before the UCS. With the gain setting used, 
responses were scored if they resulted in a 
pen deflection of 1 mm. or more. The 
same gain calibration and criterion have been 
employed in all previous reports from this 
laboratory. 


RESULTS 


The results are presented as mean 
numbers of CRs for each of the six 
stimulus changes, There were 16 pres- 
entations of each stimulus change. 
These data are presented in Table 1, 
Analyses of variance are presented in 
Table 2. The data have been analyzed 
with respect to the particular intensities 
involved in the Change and Direction, 
that is, increase or decrease in inten- 
sity. The Change effect is significant 
in both experiments, but is considerably 
greater in Exp. I, where it accounts for 
about half of the within-S variance and 
over 80% of the nonerror variance 
within S's. The Direction effect does 
not reach significance in either experi- 


ment, although the Change X Direction 
interaction is significant for Exp. I. 
An inspection of the means indicates 
that an increase in intensity leads to 
more responses than a decrease when 
the Loud tone is involved in the 
change. Neither Order, nor any inter- 
action involving Order is significant. 


Discussion 


The primary finding of these experi- 
ments is that intensity effects were ob- 
tained, even though the explanation in 
terms of generalized inhibition from 
background intensity is not applicable 
with this procedure. This finding, in 
combination with the previous results of 
Grice and Hunter (1964), casts consid- 
erable doubt upon the adequacy of the 
generalization theory as an explanation 
of stimulus intensity effects. The fact 
that the greatest effect was obtained in 
Exp. I with the 100-db. tone is to be 
expected. It is further to be noted that 
the effect obtained is limited to the dif- 
ference between those changes involving 
the loud tone and those involving only 
the soft tone. This result is sensible 
from two points of view. Considered 
with respect to loudness functions, the 
Off-Soft relation is a considerably smaller 
loudness difference than the others. This 
finding may also be regarded as in agree- 
ment with those of Chocholle (1948) for 
reaction time, in which the intensity 
effect for both onset and termination of 
auditory stimuli was found to be nega- 
tively accelerated with respect to in- 
creasing intensity, 

In general, the results indicate that 
the dynamogenic effect depends upon the 
amount of stimulus change rather than 
its direction. The exception to this in 
Exp. I is a small effect but could be 
of some interest if it holds up in further 
research. This finding was that onset 
of the 100-db. tone was more effective 
than offset. It could be that the long 
exposure to this tone during the inter- 
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trial interval produced enough adapta- 
tion that offset actually was a smaller 
sensory change than onset. 
effect was not observed in Exp. II with 
the 70-db. tone. 

One rather unexpected finding of this 
research was the substantially greater 
level of conditioning obtained in Exp. Ts 
This could be interpreted as a between-Ss 
intensity effect due to the difference 
between the 100- and 70-db. tone, but 
the Grice and Hunter findings suggest 
that this is probably not an adequate 
explanation, One possibility is that 
the extended presentation of the rather 
unpleasant 100-db. tone between trials 
served to induce tension or increase the 
overall drive level. 

It is quite clear that this research 
constitutes a definite source of embarass- 
ment to the generalized inhibition theory 
of intensity effects. On the other hand, 
the technique is new and we do not know 
very much about it. Further work must 
be done, exploring the characteristics of 
this procedure. 


Such an. 
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CHANGE OF ITEM FUNCTION IN PAIRED- 
ASSOCIATE LEARNING ' 


ROBERT K. YOUNG ann TROY R. BICKERSTAFF 
University of Texas 


The effects of change of item function from one paired-associate list 
to another were studied by having each S learn 2 lists composed of the 
same items, 3 the Ist-list stimuli became 2nd-list responses and 3 


the 1st-list responses became 2nd-list stimuli. 


The arrangement of the 


2nd list was such that for each pair either 2, 1, or 0 items changed 
function from stimulus to response or vice versa from the Ist to the 
2nd list. It was found that as the number of items in a pair changing 
function increased, difficulty of learning increased. 


It is well established that the number 
of functions an item serves in a paired- 
associate (PA) list is of considerable 
importance both in learning and in 
transfer (Young & Jennings, 1964). 
If an item serves two functions (i.e, 
both as a stimulus and as a response) 
in a PA list, the list is more difficult 
to learn than if the item serves a single 
function—as a stimulus or asa response 
but not both. This is true whether the 
item serves as a stimulus in one pair 
and as a response in another in a 
double-function PA list (Umemoto & 
Hilgard, 1961; Young, 1961) or al- 
ternates from trial to trial as stimulus 
and response within the same pair 
(Underwood & Keppel, 1963). 

Almost nothing is known about the 
effects of change of item function 
(CIF) from one PA list to another. 
Furthermore, the possible effects of 
CIF has received little attention in the 
controversy over whether forward or 
S-R associations are Stronger or 
weaker than backward or R-S associa- 


1 The research reported in the present paper 
was supported in part by National Science 
Foundation Grant GB-3629 to the senior 
author and David T. Hakes and in part by 
Public Health Research Grant HD-01062 
for the National Institute of Child Health 


and Human Development to William F. 
Battig. 


tions (Asch & Ebenholtz, 1962; Feld- 
man & Underwood, 1957). 

The results of certain studies would 
suggest that CIF does exert some in- 
fluence on the amount of transfer from 
one list to another. Murdock (1958), 
for example, compared the A-B, A-C, 
the A-B, B-C, and the A-B, C-A 
paradigms of negative transfer. He 
found that the latter two paradigms, 
the two in which CIF occurred, showed 
somewhat greater, although insignifi- 
cant, amounts of negative transfer than 
did the A-B, A-C paradigm in which 
no CIF took place from the first to 
the second list. Jennings (1963) has 
obtained similar results and has sug- 
gested that CIF would lead to a de- 
crease in performance, thereby playing 
a much more important role in transfer 
designs than has been previously recog- 
nized. 

The present experiment represents 
an attempt to determine the effects of 
CIF from one PA list to another, both 
from stimulus to response and from 
Tesponse to stimulus. This was ac- 
complished through the orthogonal 
manipulation of both stimulus and re- 
sponse changes between first and sec- 
ond lists. Thus in one paradigm both 
stimulus and response functions re- 
main unchanged from first to second 
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lists, in two others the stimuli or re- 
sponses, but not both, maintain the 
same function in both lists, and in a 
fourth neither stimuli nor responses re- 
tain the same function. These four 
paradigms thus define a CIF dimension 
along which pairs retain either 2, 1, or 
0 functions from first to second lists. 


METHOD 


General—Each S learned two PA lists 
composed of eight nonsense-syllable pairs. 
The same items were used in both lists but 
their arrangement was changed from the first 
to the second so that half the first-list 
stimuli became second-list responses and 
half the first-list responses became second- 
list stimuli. These relationships are illus- 
trated in Table 1 in which is presented the 
first and second lists from one arrangement 
of one of the several samples of items used 
in the experiment.  First-list stimuli are 
presented in lower case and first-list. re- 
sponses are presented in upper case. It 
can be seen from the table that the second 
list includes two S-Rr pairs (first-list stimuli 
serve as stimuli and first-list responses serve 
as responses in the re-paired second list), 
two S-S pairs (first-list stimuli serve both 
as stimuli and responses), two R-R pairs 
and two R-S, pairs. In no case were the 
same two items paired in both the first and 
second lists. 

Design.—The four sublists of pairs can 
be thought of as a 2X2 within-S design. 
The row variable (Stimulus) designates the 
first-list function of the second-list stimuli 
and the column variable (Response) desig- 
nates the first-list function of the second-list 
responses. Two levels of meaningfulness 
(9395 and 2096 association value) were em- 
ployed. The verbal materials were taken 
from lists prepared by Glaze (Hilgard, 
1951) and were as dissimilar as careful in- 
spection would allow. Within each level 
of meaningfulness two samples of items were 
used and within each sample two arrange- 
ments of first-list items were used with a 
common second list. Thus each paradigm 
was represented by four different pairs 
within each sample of items. The two Es 
ran equal numbers of Ss in each condition, 
the design may be summarized as a 2° fac- 
torial with the first two factors representing 
within-S variables and the last four factors 
representing between-S variables. A total 
of 48 experimentally naive Ss from introduc- 


TABLE 1 


First AND SECOND Lists UNDER ONE OF 
THE 20% CONDITIONS 


First List | Second List | Paradigm | Ndyunetlons 
zid yun |zid Fur | S-R S-R. 0 

oh TIV oh VAK 

eg vax |heg saj |S-R S-S 1 
qam LEB |qam wef 

buq xoc | DEJ xoc S-R R-R 1 

saj FUP | RUY LEB 

ciw DEJ | YUN ciw | S-R R-S, 2 

wef RUY |TIV buq 


Note.—Upper and lower case are used to facilitate 


tory psychology classes served in the experi- 
ment (N —3 per cell). 

Procedure.—Each list was learned by the 
anticipation method to a criterion of one 
perfect recitation. The lists were learned at 
a 2:2-sec. rate with a 4-sec. intertrial in- 
terval on a Lafayette memory drum and all 
items were presented in upper case. For 
each list four orders of presentation were 
used to avoid serial learning. Approximately 
1 min. elapsed between the learning of the 
first and second lists to allow time to 
change lists and give additional instructions. 
The Ss were assigned to groups according 
to a prearranged random order, Four Ss 
were dropped for failure to learn or 
mechanical difficulty ; each was replaced by 
the next S to appear at the laboratory. 


RESULTS 


First-list. learning.—Trials to learn 
the first list were analyzed. In this 
analysis meaningfulness was found to 
be an effective variable. The mean 
numbers of trials to learn the 93% and 
20% lists were 18.37 and 24.92, re- 
spectively, F (1, 32) = 4.32, p < .05. 
No other Fs approached significance 
in the analysis, and the hypothesis of 
equivalent groups and lists within 
levels of meaningfulness cannot be re- 
jected. 

Second-list learning.—The correct 
responses per trial for each of the 
second-list sublists were analyzed and 
the means for the S-R,, S-S, R-R, and 
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R-S, paradigms were found to be 1.19, 
1,06, 1.05, and .97, respectively. The 
Response variable (the S-R,, R-R vs. 
S-S, R-S, comparison) was significant, 
F (1, 32) = 5.37, p < .05. The pairs 
containing responses which served as 
responses in the first list were given 
correctly more often than those pairs 
containing responses which were em- 
ployed as first-list stimuli. Similarly, 
the Stimulus variable (the S-R,, S-S 
vs. R-R, R-S, comparison) was signifi- 
cant, F (1, 32) = 4.60, p < .05. The 
pairs containing stimuli which served 
as stimuli in the first list were learned 
more quickly than pairs containing 
stimuli which were used as first-list re- 
sponses. The results of this portion of 
the analysis can be summarized by 
noting that pairs composed of items 
changing both stimulus and response 
functions from first to second list 
(R-S, pairs) were more difficult to 
learn than those pairs which contained 
only one item changing function (S-S 
and R-R pairs) and that these pairs in 
turn were more difficult to learn than 
those pairs composed of items retaining 
the same function from the first to the 
second list (S-R, pairs). This CIF 
variable was found to be significant, 
using the pooled within-S error term, 
F (2, 96) = 5.14, p < .01. No dif- 
ference was found between the two 
paradigms (S-S and R-R) which had 
a single item from each pair retaining 
the same function in both lists, F 
« 1.00. 

Meaningfulness was also found to 
yield a significant effect, F (1, 32) 
= 9.42, p < 01. The mean numbers 
of correct responses per trial for the 
eight pairs in the 93% and 20% groups 
were 4.59 and 3.94, respectively. The 
interactions of meaningfulness with the 
Stimulus and Response variables, on 
the other hand, all had Fs < 1 and no 
other F approached significance. 
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Errors during second-list learning — 
The analysis of overt errors made dur- 
ing second-list learning was in terms 
of the functions the incorrectly emitted 
items had served in the two lists. The 
variances in this analysis were found to 
be heterogeneous but the data were 
not transformed. Items which had 
served as first-list responses were 
given incorrectly more often than those 
which had served as first-list stimuli, 
F (1, 32) =9.70, p < .01. Similarly, 
items which had served as second-list 
responses were given incorrectly more 
often than those which had served as 
second-list stimuli, F (1, 32) = 121.55, 
p<.001. No other Fs were signifi- 
cant in the analysis. In summary it 
should be noted that the function 
an item served in the two lists was 
extremely important in determining 
whether it would be given incorrectly 
during second-list learning. No items 
which had served in both the first and 
second lists as stimuli were ever elicited 
incorrectly. Similarly, items which 
had served as responses in both lists 
accounted for more than half the overt 
errors made during second-list learn- 
ing. Finally, the numbers of errors 
coming from outside the second list and 
of partial errors which could not be 
identified as to source were negligible, 
numbering .01 error per trial. 


Discussion 


The results indicate that CIF is closely 
related to the difficulty of learning 4 
subsequent list and that the magnitude © 
the effect is approximately the same 
whether stimulus or response function 
is manipulated. The observed differences 
were almost entirely attributable to the 
number of second-list items (2, 1, OT 
changing function from the first list, and 
no difference was found between those 
two conditions (S-S and R-R) in which 
a single item retained the same function 
in both lists. 3 1 
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The results of the error analysis tend 
to agree with the learning data. That 
is, the error analysis indicates that the 
function an item served in both the first 
and second lists was of importance, Items 
which served as responses in both lists 
tended to be elicited most often while 
items serving as stimuli in both lists 
never occurred as errors, thus indicating 
identification of item function to be an 
important. aspect of PA learning and 
transfer. CIF tends to disrupt second- 
list performance until $ is able to learn 
the function that the item serves in the 
second list. 

A bias favoring the S-R, and R-R 
paradigms may exist because Ss tended 
to give erroneous responses from these 
two paradigms and some items should 
have been given correctly through guess- 
ing. Fewer items should have been given 
correctly through guessing in the S-S 
and R-S, paradigms, Because of this 
a question is raised about the assumed 
equivalence of the S-S and R-R para- 
digms. The guessing bias of Ss would 
suggest that the S-S paradigm was easier 
than the R-R paradigm but that the two 
had equal numbers of correct responses 
because the latter paradigm was favored 
by guessing. If this were the case, it 
would then be necessary to modify the 
conclusion that CIF is the major variable 
influencing learning in the present ex- 
periment. 

The guessing bias suggests that CIF 
from stimulus to response results in an 
easier list than vice versa. However, 
Goulet and Barclay (1965) found sig- 
nificantly greater negative transfer in the 
A-B, C-A paradigm than in the A-B, 


'B-C paradigm and these results are just 


opposite from what would be expected 
from the guessing hypothesis. Thus the 
CIF are 


data on stimulus vs. response 
not unequivocal. 

The Murdock (1958) experiment com- 
pared two negative transfer paradigms 
in which a single item in each pair 
changed function with a negative transfer 
paradigm in which no items changed 
function, Murdock's first list was un- 
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mixed while the second list consisted of 
an experimental vs. control mixed list. 
Although Murdock's differences were in 
the same direction as those obtained in 
fhe present experiment, the nonsignifi- 
cance of the difference between those 
paradigms in which CIF occurred and 
the paradigm in which CIF did not occur 
may have been due to the fact that in 
the former S realized such a change had 
been made. It would appear that if S 
had recognized the relationship which ex- 
isted between the lists and had used this 
as a cue in the learning of the second list 
this would tend to reduce the effect of 
CIF. 

The data of the present experiment 
were obtained in a mixed-list design such 
that each S simultaneously learned pairs 
representing four different transfer para- 
digms, In such a design it is doubtful 
that $ would be able to develop a set to 
use old responses as stimuli or to use 
old stimuli as responses, as would be 
possible in an unmixed-list design in 
which all stimuli become responses or 
all responses become stimuli. 

Attempts to determine the effects of 
CIF in unmixed-list designs have not 
been entirely successful. Jennings ( 1963) 
used an unmixed-list transfer design and 
obtained results which indicated that CIF 
is a source of interference. On the other 
hand, Champ (1965) also using an un- 
mixed-list design investigated the influ- 
ence of CIF upon four negative transfer 
paradigms and obtained results which 
were largely negative. However, diffi- 
culties involving the two samples of items 
in Champ’s experiment prevented a 
straight-forward interpretation of the re- 
sults. Clearly, further research is needed 
to determine the conditions under which 
CIF is an effective variable, 

Tf CIF is found to be an effective vari- 
able both in mixed and unmixed designs, 
this would be of importance in experi- 
ments dealing with stimulus recall as well 
as in transfer experiments. That is, it 
is apparent that during R-S or backward 
recall both the S and the R functions 
have been reversed. If CIF retards 
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stimulus recall as suggested by Jennings’ 
and the present results, the difference 
usually obtained between R-S and S-R 
recall, eg., Feldman and Underwood 
(1957), has tended to be overestimated. 
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EFFECTS OF MULTIPLE STIMULUS VALIDITY AND 
CRITERION DISPERSION ON LEARNING 
OF INTERVAL CONCEPTS 


CHARLES N. UHL 
University of Utah 


The basic interval-concept-learning task employed 3 stimuli with rela- 
tive validities of .775, —.225, and .00. A 4X3 factorial design was 
used incorporating squared multiple correlations (R?) of 1.0, .67, .33, 
and 0.0 between the stimuli and the criterion, and criterion standard 


deviations (criterion dispersion, CD) of 3, 5, and 7. 


Ss were given 


360 training trials. Learning performance was a positive linear func- 
tion of R? as measured by: (a) response validity, (b) error scores, (c) 
stimulus dependency, and (e) cue utilizations. The only effect of CD 
was to facilitate utilization of the stimulus whose relative validity was 


—.225 when R? = 1. 


Response sta: 


ndard deviation was reduced with 


smaller values of both R* and CD, although there was a general tend- 
ency for Ss to disperse their responses too widely for maximum accuracy. 
It was concluded that interval concept learning is seriously impeded 


when R? <1. 


The learning of interval concepts 
only recently has begun to be sub- 
jected to experimental investigation. 
The studies of Azuma (1960), Smeds- 
lund (1955), and Summers (1962) 
have shown that Ss can learn to 
make use of several stimuli when 
the relationship between stimuli and 
the criterion value is based on a 
multiple-regression model operating 
on stimulus and criterion values 
located on interval scales of meas- 
urement. The general form of this 
multiple regression of the criterion 
on three stimuli is represented by: 


C = B4,81 + B38 + BaSs 
+A +o [1] 


where C is the criterion, A is a 
constant, S; is the value displayed 
on a stimulus dimension, B; is a 
regression weight, gz.1.2.3 is the stand- 
ard error of estimate, and N is a 


1 Supported in part by grants (4894 and 
08520) from the National Institute of Mental 
Health, United States Public Health Service. 
The data were collected while the author 
was at the University of Alberta. 


random normal number with a mean 
of 0 and variance of 1. Decisions 
about the values of constants in the 
equations and about the parameters 
of the distributions of the variables 
for construction of a learning task 
can be made to suit the purposes of 
the experiment, provided they con- 
form to statistical restrictions and 
are within the limitations of the 
apparatus. The S's task is to deter- 
mine the approximate covariance 
between each stimulus dimension and 
the criterion, then to use such de- 
terminations of covariance to weight 
the stimulus dimensions, and to 
combine weighted stimulus values in 
selecting a response predictive of the 
criterion. It has been pointed out 
(Summers, 1962; Uhl, 1963) that the 
interval-concept-learning task differs 
from the traditional (nominal) con- 
cept-learning task in that: (a) stimu- 
lus values and criterion values com- 
prise interval scales, (b) the weight 
attached to a stimulus dimension 
depends on its linear covariance 
with the criterion, and (c) weighted 
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stimulus values are combined addi- 
tively in the determination of the 
criterion. 

Hoffman (1960) has described in 
some detail the paramorphic repre- 
sentation of human judgment. Many 
of the cases which Hoffman describes 
are judgmental analogues of the 
interval-concept-learning task. For 
example, the clinical psychologist 
who uses psychometric information 
in the assessment of personality 
utilizes a concept about the rela- 
tionship between psychometric scales 
and the criterion personality dimen- 
sion, a concept which often exhibits 
many of the characteristics of the 
interval concept as Hursch, Ham- 
mond, and Hursch (1964), Smedslund 
(1955), Summers (1962), and Uhl 
(1963) have described it. Hoffman 
(1960) has shown that human judg- 
mental performance can be repre- 
sented with considerable accuracy by 
a multiple-regression prediction equa- 
tion even though his Ss had no idea 
of what such an equation is. An 
impressive demonstration of interval- 
concept performance is provided by 
the carnival weight guesser who uses 
visual stimulus information, e.g., 
height, waist girth, etc., to make a 
surprisingly accurate estimate of his 
customer’s weight. 

All of the studies of interval-con- 
cept learning conducted thus far have 
employed tasks in which the multiple 
correlation (R) between stimuli and 
criterion (multiple-stimulus validity) 
was unity. Learning of tasks with 
multiple-stimulus validity less than 
unity requires investigation for theo- 
retical and practical reasons. In- 
terval-concept-learning tasks encoun- 
tered outside the laboratory typically 
do not exhibit multiple-stimulus valid- 
ities of unity; e.g., there is an error 
component in the relationship be- 
tween stimuli and criterion. 


Uhl (1963) found that Ss adjust the 
dispersion of the distribution of their 
responses to the distribution of cri- 
terion values. Response dispersion is 
an important parameter of response 
accuracy in the interval-concept task. 
Cronbach and Gleser (1957) show 
that predictive response accuracy in 
tasks of this type is optimized when 
the standard deviation of responses 
equals the product of the standard 
deviation of criterion values and 
the response-criterion correlation, or 
=ccrc.r. Consequently, if S is to 
optimize his response accuracy, he 
never should permit the standard 
deviation of his responses to exceed 
that of the criterion. Response 
standard deviation should be reduced 
below that of the criterion—the lower 
the response-criterion correlation, the 
more reduction required. 

The purpose of the present in- 
vestigation was to assess the learning 
of the interval concepts as a function 
of multiple-stimulus validity and of 
the dispersion (standard deviation) 
of criterion values. This provided 
an opportunity to estimate the extent 
of learning when multiple-stimulus 
validity is less than unity, and to 
determine the extent to which re- 
sponse distribution is adjusted as a 
joint function of criterion distribution 
and the limits of response-criterion 
correlation imposed by values of 
multiple-stimulus validity. 


METHOD 


E Subjecis.—The Ss were 72 volunteers from 
introductory psychology classes. The S re- 
ceived course credit for participation in the 
first experimental session, and he was paid 
nominally on the basis of his performance in 
a second session, 

Apparatus.—The S was seated at a table 
on which was located the assembly con- 
taining a stimulus and a response panel. 
The vertically mounted stimulus panel was 
directly in front of S at eye level at a distance 
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of about 3 ft. The stimulus- dimensions 
consisted of three rows of 15 1-in. pilot lights 
which were 2 in. apart on center within and 
between rows. From top to bottom these 
rows of lights were colored red, green, and 
amber. The stimulus display on a trial 
consisted of one illuminated light in each 
of the three rows. The position of the 
illuminated light from extreme left to extreme 
right represented stimulus values from 1 to 
15. Beneath the stimuli were two rows of 
294-in. pilot lights spaced 1 in. apart on 
center within and between rows. The upper 
of these two rows contained white lights 
which represented response categories, and 
the lower row of amber lights represented 
criterion categories. A narrow, vertical 
black line was drawn through the stimulus 
panel bisecting each of the rows of lights to 
facilitate S’s location of the midpoint of each 
row. An electric counter was mounted at 
the base of the stimulus panel. A response 
panel, which contained a button switch and a 
rotary switch, sloped down toward S from 
the base of the stimulus panel. The rotary 
switch controlled the illumination of response 
lights. When S turned the knob of the 
rotary switch from the extreme counter- 
clockwise position to the extreme clockwise 
position, each of the response lights from 
left to right (1-29) was successively illumi- 
nated. The S used the button switch to 
register his final response decision. Depres- 
sion of the button switch immediately 
resulted in the recording of S’s response and 
the restarting of the program beginning with 
criterion feedback represented by illumina- 
tion of 1 of the 29 amber criterion lights. 
The disparity between the locations of the 
response light and the criterion light could 
be seen by S at a glance. 

An automatic programming device located 
in an adjacent room controlled the presenta- 
tion of stimulus events, The E was located 
in the room with the programming equipment. 
A display of the response and criterion lights 
was available to E, who manually recorded 
responses and operated a switch which 
incremented the counter on S’s stimulus 
display panel. 

Design.—The experimental design was a 
2X3 X 4 factorial incorporating. sex of S, 
criterion standard deviations of 3, 5, and 7, 
and multiple-stimulus validities squared (R?) 
of 1, .67, .33, and 0, respectively. Three Ss 
were assigned at random to each cell of the 
factorial. 

Tasks.—Twelve learning tasks were con- 
structed, one for each factorial combination 
of the three values of criterion. standard 


TABLE 1 


VALUES ENTERED INTO EQUATION 1 FOR EACH 
OF THE LEARNING TASKS 


Tasks Equation Values 
R Criterion B: B: 

Dispersion 2 3 A 0.123 

3 —.42| .77 | 21.14] 0 

1 5 —.69| 1.29 |10.25| 0 

7 —.97| 1.80 | 8.34] 0 
3 —.34] 63 ‘| 12.67 | 1.73 
«67 5 —.57| 1.05 | 11.11} 2.88 
7 —.79| 1.47 | 9.56 | 4.04 
3 —.24| .45 |13.36| 2.45 
194 5 —.40| .74 |12.26 | 4.08 
7 —.56| 1.04 | 11.17] 5.72 

3 0 0 |15 3 

0 5 0 0 15 5 

T 0 0 |15 7 


Note,—Bi: = 0 for all tasks since v1 = 0. 


deviation (criterion dispersion, CD) and the 
four values of R*% The relative validites 
(v's) of stimulus dimensions (rows of lights) 
1, 2, and 3 were 0, —.225, and .775, respec- 
tively, for all tasks. When it is assumed that 
intercorrelations between stimulus dimen- 
sions are 0, the value of v; = 7;/R*, where r 
is the stimulus-criterion correlation. Thus, 
v is the proportion of stimuli-criterion 
covariance accounted for by a given stimulus 
dimension. The algebraic sign of v is given 
only to represent the direction of the stimulus 
dimension's covariance with the criterion 
(C). Construction of each task required 
determination of the values of the constants 
in Equation 1. The regression weight of a 
stimulus, B;, is obtained by Bie NwRte:/oi, 
where c, and e; are the standard deviations 
of the criterion and stimulus, respectively. 
Stimulus values and random normal numbers 
(N) substituted into Equation 1 were the 
same for all tasks. In any block of 60 trials, 
stimulus values on each dimension were 
approximately normally distributed with a 
mean of 8 and a standard deviation of 3.4, 
and N had a mean of 0 and variance of 1. 
Stimulus intercorrelations did not exceed 
4.10 in any block of 60 trials, and were 
treated as being 0 for computational pur- 
poses. Table 1 gives the values of the 
constants entered into Equation 1 for com- 
putation of values of C.' Two restrictions 
were imposed on C after its computation 
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which remained in effect during the execution 
of the experiment and analysis of data. 
These restrictions were: (a) C was rounded 
to the nearest whole integer, and (5) if C<1, 
it was assigned a value of 1, or if C > 29, it 
was assigned a value of 29. The second 
restriction was necessitated by the limitation 
of the display apparatus, but it did not 
produce a truncated criterion distribution. 

It should be pointed out that S did not 
have to make use of A in Equation 1 in 
selecting his responses if he performed a 
simple transformation on each of the stimulus 
values S; — S; — 8, where 8 is the mean of a 
stimulus dimension. The instructions called 
the possibility of this transformation to S’s 
attention by pointing out the bisecting 
vertical line on the stimulus display panel 
and suggesting that he consider how far a 
given stimulus value was located to the left 
or to the right of the line. 

Procedure.—A set of standard instructions 
was read to S after he had been seated 
in front of the display apparatus. These 
instructions acquainted S with the operation 
of the apparatus and with the general require- 
ments of the task. The S was told to attend 
to the three rows of stimulus lights because 
they represented values which determined 
the criterion value. He was not told that the 
solution to the problem was of the multiple- 
regression form, that the stimulus dimensions 
might differ in the weight of their determina- 
tion of C, or that R? <1. The instructions 
did urge S to treat each of the stimulus 
dimensions separately. The S was advised 
to use the vertical line on the stimulus display 
panel as a reference against which to com- 
pare the locations of stimuli. Although most 
‘Ss have appeared to find the task intrinsically 
motivating, S was informed that he would 
receive points on his counter on the basis 
of the absolute discrepancy between his 
response and the criterion. The number of 
points received followed by the absolute 
discrepancy between response and criterion 
were: 7-0, 4-1, 2-2, 1-3, and 0-4 or more. 

Four practice trials, during which S was 
permitted to ask for clarification of the 
instructions, were followed by 180 training 
trials in S's first session. A trial was initiated 
by the presentation of the stimuli, ie., 
illumination of one light in each of the three 
rows of stimulus lights. Self-pacing of S's 
responding was used, although it was neces- 
sary to request of an occasional S that he 
not delay his response longer than about 
30sec. After S had selected his response by 
rotating the knob on the rotary switch to 
illuminate the desired response light, he 


pushed the button switch to register his 
response. Immediately after the button 
switch was depressed by S, feedback was 
given by illumination of 1 of the 29 criterion 
lights. All display lights were turned off 
2 sec. after response registration. The E 
incremented S’s point counter, if S had 
earned points, immediately after S's response. 
The intertrial interval from lights off to 
onset of the lights on the next trial was 4 sec. 

The S's second session of 180 trials was 
scheduled within 1 wk. of his first session. 
The S was assured at the start of the second 
session that his task remained the same. 
He was told that he could earn up to $1.50 
in this session depending upon the number 
of points he accumulated. The minimum 
actually paid to an S was $1. Each session 
lasted between 45 min. and 75 min. 


RESULTS 


Sex differences.—No significant dif- 
ferences between sexes were found in 
any of the analyses at the .05 level. 
Therefore, sexes were combined in the 
report of results which follows. 

Response validity.—A response va- 
lidity (RV) coefficient (Pearson r 
between responses and criterion val- 
ues) within blocks of 60 trials was 
obtained for each S. These RVs were 
transformed by Fisher's method for 
all statistical treatments. Figure 1 
presents the mean transformed RVs 
over blocks of trials as a function of 
multiple-stimulus validity (R?) and 
criterion dispersion (CD). Curves 
of RV for R?=0 are not included in 
Fig. 1 since they fluctuated about a 
value of 0. A two-way analysis of 
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variance of trends was performed 
on these data. A significant linear 
effect of blocks of training (B), F 
(1, 225) = 229.52, p « .001, reflected 
an increase in RV as training pro- 
gressed. This simply attests to the 
learning of the interval concept. A 
small quadratic component of the B 
effect, F (1, 225) = 6.79, p < .05, 
was due to the diminished rate of 
increase in RV in the last two or three 
blocks of trials, especially when R? <1. 
The total amount of increase in RV 
with training was a positive linear 
function of R? which was reflected 
in a significant linear component of 
the R? X B interaction, F (2, 225) 
= 49,52, p < .001. This effect can 
be seen by a comparison of the family 
of curves in each of the three panels 
of Fig. 1. Mean untransformed 
values of RV observed in the final 
block of trials, followed in parentheses 
by the limit imposed on RV by A, 
were .89 (1) when R? = 1, .54 (.82) 
when R? = .67, and .20 (.58) when 
R = 33. 

Although CD was not represented 
in any significant source in the 
analysis of variance, inspection of 
Fig. 1 suggests that learning was 
facilitated by larger values of CD 
when R? = 1 but not when R? < 1. 
If a relationship of this nature had 


been present, it should have pro- 
duced a significant KR? X CD X B 
interaction. At the risk of capital- 
izing on chance, a separate analysis 
of variance of trends of the effects 
of CD when R? = 1 was conducted. 
A significant linear component of the 
CD XB interaction was obtained, 
F (1, 75) = 4.20, p «.05. When 
R?=1, the increase in RV with 
practice was a positive linear function 
of CD. However, this was a rela- 
tively weak effect, and CD generally 
was not a powerful variable. 

Error scores.—As has been noted 
previously (Uhl, 1963), the absolute 
error of S's responses is an ambiguous 
measure because of its sensitivity 
to several sources of variation. 
Nevertheless, it is a useful measure 
of response accuracy resulting from 
the combined effects of these sources 
of variation. It is the variable by 
means of which S presumably evalu- 
ates his own performance. Six error 
scores (ESs), one for each successive 
block of 60 trials, were obtained for 
each S. Figure 2 displays mean ES 
over blocks of training as a function 
of R? and CD. A two-way analysis 
of variance of trends was performed 
on these data. There was a signifi- 
cant linear effect of CD, F (1, 48) 
— 285.90, p <.001, disregarding blocks 
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of trials. The absence of a significant 
(p < .10) B X CD interaction indi- 
cates that the effect of CD remained 
about the same throughout training. 
Errors were larger with larger values 
of CD. Again disregarding blocks, a 
significant linear R? X CD interac- 
tion, F (2, 48) = 32.65, p < .001, was 
due to an increase in the effect of CD. 
when AA? decreased, i.e., the tendency 
for larger values of CD to produce 
error was enhanced with smaller 
values of R?. Finally, there was a 
significant linear B X R? interaction, 
F (3, 240) = 29.98, p < .001, which 
reflected a more rapid decrease in 
ES with training for larger values of 
R. It is noteworthy that even when 
R? = 0 there was a significant reduc- 
tion in ES from block to block, F 
(5, 85) = 10.19, p < .001. 


TABLE 2 


MEAN OBSERVED Response SD COMPARED 
WITH OPTIMAL RESPONSE SD 


R 
Criterion 
Dispersion 
1.0 67 33 0.0 
3 3.39 2.24 247 2:52 
(2.44) | (1.54) | (0.69) (0.00) 
5 5.20 4.47 4.19 3.92 
(4.21) | (2.80) | (0.62) (0.00) 
7 6.41 6.26 5.56 4.16 
(6.02) | (3.85) | (1.71) (0.00) 


Response dispersion.—The standard 
deviation of each .S's responses over 
blocks of 60 trials was taken as a 
measure of response dispersion (RD). 
Figure 3 shows mean RD over blocks 
of trials as a function of R? and CD. 
A two-way analysis of variance of 
trends was executed on these data. 
There was a significant linear BXA* 
interaction, F (3, 240) = 29.65, p 
X .001, representing a tendency for 
RD to decline more during training 
with lower values of R?. A significant 
linear B X CD interaction, F (2, 
240) = 45.51, p < .001, was due to 
the greater reduction in RD during 
training with lower values of CD. 
When R?=1, mean RD closely 
approximated the CD of the respec- 
tive CD groups, and RD fell further 
and further below CD as R? fell 
below unity. Since optimal RD is 
equal to the product of the CD and 
response validity (Cronbach & Gleser, 
1957), it is of interest to examine the 
extent to which Ss optimized their 
RD. Table 2 presents the optimal 
RD (in parentheses) and the mean 
Observed RD (without parentheses) 
for each combination of values of R? 
and CD for the last block of 60 trials. 
The optimal RD was computed by 
taking the product of a group’s mean 
response validity in the final block 
of trials and its CD. It can be seen 
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that all 12 groups exceeded optimal 
RD, and that the tendency to dis- 
perse responses beyond the optimum 
increased inversely with A?. 

Stimulus dependency.—Yhe multi- 
ple correlation between the three 
stimulus dimensions and S’s responses 
is referred to as stimulus dependency 
(D). It provides a measure of the 
extent to which S’s responses de- 
pended upon the stimulus dimensions 
in a reliable manner, but it does not 
necessarily reflect the accuracy of 
responding. Measures of D were 
obtained over successive blocks of 60 
trials for each S. Figure 4 shows 
mean values of D over blocks of 
trials as a function of R?. A two-way 
analysis of variance of trends was 
performed on a Fisher transformation 
of the D measure. There were 
significant effects of CD, which did 
not interact with blocks, exhibited 
in linear and quadratic components, 
F (1, 48) = 6.71 and 4.55, p < .025 
and p < .05, respectively. The un- 
transformed mean values of D for 
CD Groups 3, 5, and 7 were .74, .75, 
and .68, respectively. The quadratic 
component of the CD effect reflected 
the fact that D was lower when CD =7 
as compared to CD=3 or CD—5, 
but the latter two CD groups did not 
differ from each other. A significant 
linear R? X B interaction, F (3, 240) 
=4.59, p < .01, was attributable to a 
greater increase in D with training 
with larger values of R?. On the last 
block of trials, mean untransformed 
values of D were .95, .81, .63, and 
.54 for R? of 1.0, .67, .33, and 0.0, 
respectively. 

Cue utilization—In a previous 
paper (Uhl, 1963) the Pearson corre- 
lation between a stimulus dimension 
and S's responses was referred to as 
a criteriality. Following the termi- 
nology of Hursch, Hammond, and 
Hursch (1964), such a correlation 


MEAN TRANSFORMED STIMULUS DEPENDENCY 


BLOCKS OF 60 TRIALS 


Fic. 4. Stimulus dependency during training 
as a function of Rè. 


now is termed cue utilization. Cue 
utilizations were computed for each 
stimulus dimension for successive 
blocks of 60 trials for each S. Sepa- 
rate two-way analyses of variance 
of trends were performed on Fisher 
transformations of the cue-utilization 
measure for each stimulus dimension. 
The results of these analyses closely 
parallel those for the other measures 
already reported and are summarized 
here in abbreviated form. The utili- 
zation of each cue tended to undergo 
modification during training as evi- 
denced by significant main effects 
of training. Such adjustments in cue 
utilization during training brought 
the utilization of each cue in closer 
correspondence with the actual cor- 
relation of the cue with the criterion 
(cue validity). Correspondence be- 
tween cue utilization and cue validity 
improved as a positive linear function 
of R?, The only effect of CD was 
upon the utilization of the stimulus 
dimension (S2) with a relative valid- 
ity of —.225. Utilization of Sz de- 
creased more during training with 
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TABLE 3 
Mean RELATIVE CUE UTILIZATIONS 


Stimulus 
P 
Sı 2.00 Sı = —25 Sı =.75 
1.0 .029 —.062 .884 
i (0/18) | (0/18) | (18/0) 
67 .090 —.023 781 
` (0/18) (1/17) (18/0) 
33 251 .030 .503 
j (6/12) (3/15) (12/6) 
0.0 151 152 .367 
P (5/13) | (8/10) (8/10) 


Note.—The rows do not sum to unity because some 
Ss exhibited negative relative cue utilizations. 


larger values of CD as evidenced by a 
significant CD X B interaction. 

In order to compare cue utilizations 
between values of R? it was necessary 
to obtain a relative cue-utilization 
measure (w) according to: 


riBi 


wi = 
Rea? 


where Rz.123 is the square of S's stimu- 
lus dependency, r; is the stimulus- 
response correlation, and 5; is the 
beta weight of the regression of the 
response on the stimulus. Thus, w 
is the proportion of the combined 
covariance of the three stimuli and 
the criterion accounted for by a single 
stimulus as utilized by S. The sign of 
ři was given to w; to preserve repre- 
sentation of the direction in which .S 
utilized the stimulus. Mean values 
of : for each value of R? in the final 
block of 60 trials are given in Table 3. 
If S did not utilize the cues differ- 
entially, w would have an expected 
value of .33 (assuming all 7;’s to be 
positive). It can be seen in Table 3 
that mean w departed from chance 
in the direction of the relative cue 
validity of the respective stimuli 
when R? > .33. The fractional val- 


ues in parentheses in Table 3 refer to 
the number of Ss whose relative cue 
utilization was greater than .33 in 
the numerator and less than .33 in the 
denominator. 


Discussion 


The most prominent result of this 
investigation was the appreciable im- 
pairment of interval-concept learning 
when Rè < 1. When R? <1, there is 
random error in the determination of a 
criterion value by the stimuli, intro- 
ducing a probabilistic feedback situation. 
Also, relevant stimuli have less absolute 
regression weight with lower values 
of R. The impairment of response 
validity by these two factors apparently 
was due to their effect upon stimulus 
dependency rather than upon cue utili- 
zation. Relative cue utilizations ap- 
proximated relative cue validities fairly 
closely when R?> .67, and there was 
little difference between relative cue 
utilizations when R? = 1 as compared 
to R'—.67. Even when R= .33, 
relative cue utilizations did not differ 
appreciably from the relative cue validi- 
ties. However, stimulus dependency 
dropped markedly as R? decreased. 
That is, to the extent that S in a condi- 
tion of 0 < R? <1 based his responses 
upon the stimuli within a multiple- 
correlation model, he utilized the stimuli 
effectively. However, much of his 
responding was unaccountable for by 
that model. Such reduced stimulus 
dependency may have been the result 
of some random error in S's responding 
or to nonlinearity or to nonadditivity. 
All that can be said at the present time 
is that the conditions coincident with 
R «1 tend to discourage utilization 
of stimuli consistently according to the 
multiple-regression model. There is 
reason to expect that this result would 
be diminished if S were given more 
information about the structure of the 
task in his instructions. Research in 
Progress in this laboratory is investigat- 
ing the instructional variable. 

Criterion dispersion (CD), operation- 
ally defined as criterion standard devia- 
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tion, proved to have relatively little 
effect upon learning. An increase in 
CD results in greater information trans- 
mission between each of the stimuli and 
the criterion (Attneave, 1959). Such 
increased information transmission is re- 
flected in the multiple-regression predic- 
tion equation in larger regression weight 
for any stimulus whose relative validity 
is not zero. For S, this increased regres- 
sion weight is exhibited in greater dis- 
placement of the criterion value when 
the stimulus deviates a given amount 
from the mean stimulus value. How- 
ever, the only significant effect of CD 
was upon the utilization of the stimulus 
dimension (S+) having a relative validity 
of —.225. Utilization of Sı (relative 
validity = 0) and of S; (relative validity 
=.775) was not effected by CD. Uhl 
(1963) found that much of the error 
in responding was due to underutiliza- 
tion of a stimulus dimension having 
moderate to low relative validity and to 
overutilization of stimuli having large 
relative validity. Apparently, high val- 
ues of CD tend to counteract this 
tendency, at least in the case of a 
stimulus negatively correlated with the 
criterion. However, CD had no effect 
upon response validity (RV) except 
when R? = 1, and if the effect of restric- 
tion in range upon RV is taken into 
account, differences between CD groups 
when R? = 1 disappear. 

One factor which may have atten- 
uated any effect of CD on learning in- 
volved the feedback reinforcement given 
S. A given amount of error in cue 
utilization produced a larger discrepancy 
between response and criterion value 
with larger values of CD. The effect 
of such response-criterion discrepancy 
presumably would be to cause S to 
modify his utilization of a stimulus even 
when his cue utilization closely approxi- 
mated the cue validity. This effect 
would be particularly destructive when 
R? «1. Some evidence consistent with 
this interpretation that increased CD 
produced instability of stimulus utiliza- 
tion was provided by the depression 


d d dependency with maximum 

It was found that Ss tended to adjust 
the dispersion of their responses to 
criterion dispersion (CD). When R?=1, 
the mean response dispersion (RD) of 
each CD group almost equaled the 
respective CDs. When R? € .33, there 
was a reduction in mean RD during 
training. Such reduction in RD en- 
abled Ss in these groups to improve 
their response accuracy. This was the 
factor which accounted for the signifi- 
cant improvement in response accuracy 
when R? = 0. However, in all cases 
RD exceeded optimal RD. Apparently, 
overdispersion of response is an im- 
portant contributor to error in interval 
concept performance, especially with 
lower values of R?, When R? < .33, RD 
was still declining at the end of training. 
There is a good possibility that it would 
have continued to decline with addi- 
tional training. 
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EFFECT OF PULSE RATE AND INTENSITY 
UPON VISUAL FLASH RATE? 


WILLIAM R. MACKAVEY 


Boston University 


The rate at which an intermittent visual target appears to be flashing 
is a function not only of its actual pulse rate, but of its intensity as 
well. This was shown by having 6 Os match the flash rates of 2 
contralaterally presented targets having various intensity relations. 
The standard target was set to pulse at 4, 8, 12, 16, 24, or 32 cps at 
intensity levels of 630, 63, 6.3, or .63 apparent ftc. The O adjusted the 
pulse rate of the variable, 630 apparent ftc., so that it appeared to be 
flashing at the same rate as the standard. Flash rate and intensity 
were found to be independent at the lowest pulse rates used. At the 
higher values of pulse rate, a 30-db. reduction in intensity produced as 


much as a 100% increment in flash rate. 


It has been reported that the rate at 
which a visual target appears to be 
flickering (flash rate) is related not 
only to its actual pulse rate, but to its 
intensity as well. Whereas a magni- 
tude estimation procedure has revealed 
the relation between flash rate (FR) 
and pulse rate (PR) to be in accord 
with a power function of approxi- 
mately unity (Stevens & Schickman, 
1959), the relation between FR and 
intensity seems to be more complex, 
For example, it has been shown that 
the FR of a “low” intensity target 
can be increased simply by reducing its 
intensity. Within limits, therefore, an 
O confronted with two sources pulsing 
at different rates can equate their FRs 
simply by reducing the intensity of the 
slower. For theoretical reasons, Bart- 
ley (1938), who provided the first 
demonstration of this effect, was more 
concerned with establishing its exist- 
ence than with providing a detailed ac- 
count of its course and extent. As a 
result, little descriptive data were made 


1This research was supported by funds 
made available through the Boston Univer- 
sity graduate school. Thanks are due to 
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available. Subsequent indirect confir- 
mation has been reported by Segal 
(1940) and by workers in Bartley’s 
own laboratory (Nelson, Bartley, & 
Bochniak, 1965). These confirmations 
have been indirect by reason of the 
cross-modality procedure used, i.e, 
visual FR was matched to an acoustic 
intermittency. The use of an acoustic 
intermittency as one member of the to- 
be-matched pair should be avoided, 
however, unless there is a separate 
reason for using it. Not only is the 
intersensory match more difficult than 
its intrasensory counterpart, but the 
acoustic intermittency confounds the 
experiment by producing a change in 
FR (Gebhard & Mowbray, 1959). 
The present experiment was de- 
signed to provide a fuller account of 
the way in which pulse rate, flash rate, 
and intensity interact within the visual 
modality. To accomplish this, a situa- 
tion was devised which permitted the 
presentation of two intermittent targets 
to an O. The observational task was 
to.adjust the PR of one of these, the 
variable, so that it appeared to be flash- 
ing at the same rate as the other, the 
standard. Requiring O to achieve an 
FR match through an adjustment of 


DETERMINANTS OF VISUAL FLASH RATE 529 


PR rather than through an adjustment 
of intensity seemed a more direct ap- 
proach to the problem and was cer- 
tainly an easier observational condition 
to specify for O. Moreover, it did 
not provide O with the biasing sugges- 
tion that changes in the intensity level 
of the standard would have any effect 
upon its FR. 

In brief, if FR is affected by inten- 
sity, this should be revealed by an alter- 
ation in the PR of the variable seen as 
equal to the standard, as the. intensity 
of the latter is altered. 


METHOD 


Subjects —Six male Os were used. Five 
of these were graduate students in psychol- 
ogy, four of whom were experimentally 
naive. The author served as the sixth O. 

Apparatus—The apparatus was designed 
to permit the presentation of two pulsing 
targets, one to each eye. The O's view of 
the targets is depicted in Fig. 1. 

The source for the variable was a Sylvania 
glow modulator tube, R1131C, controlled by 
a Grayson-Stadler flicker fusion apparatus, 
Model E622. The flux from the tube was 
collimated and passed through a piece of 
diffusing glass. The diffusing glass was 
entirely shielded except for those regions 
which were to serve as the variable target 
and fixation point. The variable target and 
fixation point were viewed by the left eye 
through a 2-mm. artificial pupil. 

The source for the standard was a 6-v. 
ribbon filament lamp supplied from an Elec- 
tro, Model H filtered dc power supply. The 
flux from the ribbon filament was focused 
in the plane of a shutter, collimated, and 
brought to focus in the plane of the 
episcotister. The flux was again collimated 
and then interrupted by a piece of diffusing 
glass. This diffusing glass was entirely 
masked except for those regions which were 
to serve as the basis for the standard target 
and the fixation point. Neutral density 
filters could be inserted in front of the 
standard for purposes of intensity control. 
The reflection of the standard target and 
fixation point in a front surface mirror, 
were viewed by the right eye through a 2- 
mm. artificial pupil. A septum prevented 
either eye from seeing the view intended for 
the contralateral eye. Proper registry of 
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Fic. 1. O's view of the target situation. 


the fixation points was aided by means of a 
rotary prism located in front of the right 
eye. The optical distances of the standard 
and variable were identical and hence seen 
in a common plane. 

The PR of the standard was regulated by 
the episcotister which was monitored by a 
Smiths tachometer. The PR of the variable 
was given by the dial setting of the Grayson- 
Stadler, The pulse to cycle fraction in each 
case was .5. 

Procedure.—Each O was shown a drawing 
depicting the arrangement of the targets as 
they should appear when in proper registry. 
Alignment was then achieved with the 
targets in a steady state, ie, nonpulsing. 
The O was then told that the light at the 
top (standard) would be made to flicker 
and that while maintaining gaze upon the 
fixation point he was to adjust the bottom 
light (variable) so that it appeared to be 
flickering at the same rate. 

Pulse rates of 8, 12, 16, 24, and 32 cps were 
used for the standard at intensity levels of 
630, 63, 6.3, and .63 apparent ftc. In addi- 
tion, a 4-cps PR was used for two of the Os. 
The variable was always set at 630 apparent 
ftc. 

In practice, a single PR was selected and 
for that PR, two FR matches were made at a 
randomly selected intensity level. ^ After 
a block of eight matches at that PR, a 
second PR was selected and the procedure 
repeated. All matches were made by con- 
tinuously adjusting the PR of the variable in 
ascending order. If overshoot occurred, as 
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Fic. 2. Flash-rate matches for each 
of the Os. 


was usual, bracketing was permitted. Be- 
tween trials, the glow modulator tube was 
extinguished and the optical path of the 
standard occluded by closing the shutter. 


RESULTS 


The outcome of the FR matches for 
the six individual Os is given in Fig. 2. 
Each point on the curve is the arith- 
metic mean of the matches generated 
by that O. The early termination of 
certain of the curves indicates that the 
standard at that particular combination 
of intensity and PR had fused, an in- 
stance of the Ferry-Porter law. 

When the standard and variable 
have the same intensity, indicated by 
the first point on each curve, FR 
matches should be achieved by setting 
the PR of the variable equal to that of 
the standard. Although the discrep- 
ancy from expectation is larger than 
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that reported by Mowbray and Geb- 
hard (1955), the approximations are 
quite close. Procedural differences 
probably account for the difference, 
such as it is, eg., mildly eccentric 
fixation as contrasted to superposi- 
tion, episcotister and glow modulator 
matches as contrasted to straight glow 
modulator matches, etc. 

The important feature of the results, 
however, is that as the intensity of the 
standard is reduced, the PR of the 
variable seen as equal increases. This 
trend is prominent at the higher PRs 
and gradually diminishes until at the 
lower PRs the match is largely inde- 
pendent of intensity. Because of this 
independence, which emerged in the 
early stages of data gathering, FR 
matches at 4 cps were discontinued 
after Os WRM and HD had been run. 


Discussion 


The magnitude of the effect displayed 
in Fig. 2 is considerable, especially at the 
higher PRs. At these higher PRs the 
effect is greater than one would suppose 
from casually observing the FR of an 
intermittent target as its intensity is 
varied over the same 30-db. range em- 
ployed here. The discrepancy between 
the casual observation and the more 
rigorous matching procedure led to the 
temporary supposition that what Os were 
actually doing was matching flash promi- 
nence rather than flash rate. In other 
words, as the intensity of the standard 
was lowered, the temporal discontinuity 
in brightness which defined the flash con- 
dition gradually became subdued, the 
Ferry-Porter law once again. The vari- 
able target, which remained constant in 
intensity, then required an acceleration in 
PR so that its fusion point was ap- 
proached to the same extent, At this 
point, O reported a match, not because 
FR had been equated but because flash 
prominence had been equated. Such a 
view, although appealing from the stand- 
point of parsimony, has several things 
wrong with it. One of the stronger ob- 
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jections is that it predicts a constant 
ratio between the PR of the variable at 
the lowest intensity to the PR of the vari- 
able at the highest intensity. This ratio 
is identical to the ratio shift in critical 
fusion frequency (cff) caused by the 
intensity change. Thus, if cff is doubled 
by the 30-db. shift in intensity, then all 
PR curves should show a doubling when 
the intensity extremes are compared. 
The data of Fig. 2 show that nothing of 
this kind occurs. Therefore, if changes 
in flash prominence have affected the 
results, they have not done so in a 
predictable way. 

It is clear, however, that the effect of 
the intensity reduction upon FR does not 
become very intrusive at PRs which are 
far below the fusion point. This is con- 
sistent with Bartley’s (1938) conclusion 
from his earlier investigation that, ". . . 
flicker rate (subjective) and flash rate 
(objective) . . . differ as critical flicker 
frequency is reached.” 
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LEARNING THE POSITION 


S OF WORDS RELATIVE 


TO A MARKER ELEMENT * 


MARTIN D. S. BRAINE 
Walter Reed Army Institute of Research, Washington, D. C. 


The learning of conditional definitions of position—"first after g,” 
"second after g"—was studied by exposing children to a subset of the 
sentences of a miniature, semantically empty, “language.” This 
contained 2 phrase types, fA and gPQ, occurring alone or in either 
order. Ss learned the structure, including the conditional positional 
definitions. Errors in sentence-completion test problems indicated a 
tendency to confuse homologous positions in the 2 phrases, particularly 
early in learning. The relationships learned could not reasonably be 
represented by a finite state diagram, and 1 kind of state diagram 
incorporating a push-down store seemed more appropriate than 
another. It was argued that finite state diagrams do not capture the 
properties of associations, particularly remote associations, and that 
an association might better be regarded as a rule relating contingent 
items to an entry in a short-term store, or, perhaps, as a context- 


sensitive rewrite rule. 


In previous work (Braine, 1963), a 
theory of the learning of some aspects 
of grammatical structure was proposed. 
The heart of the theory was the notion 
that "what is learned" are the positions 
of units in a verbal array, and it was 
further suggested that units can be 
hierarchically organized. Thus, people 
might learn the proper locations of 
words and morphemes in various sorts 
of phrases, and what kinds of phrases 
go in what positions in sentences. In 
the miniature “languages” used in the 
experiments, position was always de- 
fined absolutely (eg., “first,” “sec- 
ond”). The purpose of the present 
experiment was to explore the learning 
of conditional definitions of position, 
i.e., “before x,” “first after 4," “second 
after 4," etc, where x is some fre- 
quently recurring element which serves, 
in effect, as a reference point. It may 


1The author is grateful to the Burgundy 
Farm Country Day School, Alexandria, Vir- 
ginia, and to Donald Mills, Principal, for 
provision of Ss and space for this study. 
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be observed that although conditional 
relationships of the form “after x” and 
"before x” pose no obvious theoretical 
problems—similar kinds of relation- 
ships are presumably learned in paired- 
associate experiments— relationships of 
the form “second after x” involve a 
contingency which straddles interven- 
ing material. Remote associative con- 
nections of this sort have been little 
studied, although a recent experiment 
(Braine, 1965b) involved a somewhat 
similar straddling contingency. 

The method used is a modified forin 
of the “verbal reconstructive memory” 
technique discussed elsewhere (Braine, 
1965b; cf. also Smith, 1963, 1965). It 
differs from the method used in Braine 
(1963) in that Ss are given no training 
in the construction of correct rather 
than incorrect “sentences” under condi- 
tions of differential information feed- 
back. 


METHOD 
Description of the System 


The “language” contained two phrase 
types, A phrases and PQ phrases. There 


ASSOCIATION AS A CONTEXT-SENSITIVE RULE 


were four A phrases (a:b: = 0B ORDEM, asbs 
= REMIN GICE, as = KIVIL, a= Noor), which 
were always preceded by a function word 
marker (f=yYARMO), giving rise to the 
sequences YARMO OB ORDEM, YARMO REMIN 
GICE, YARMO KIVI YARMO NOOT. PQ 
phrases also began with a function word 
marker (g = ceD), followed by any of three 
P words (p1= MERVO, ps — YAG, ps— LECK), 
followed in turn by any of three Q words 
(q1 = sOM, qs— EENA, qs = WIMP), €g, GED 
MERVO SOM, GED MERVO EENA, GED YAG EENA, 
etc, Complete sentences consisted of either 
of these phrase types alone ("short" sen- 
tences), or of sequences of two phrases, one 
of each type, in either order ("long" sen- 
tences). Thus, the sentence forms were 
fA, gPQ, fAgPQ, and gPQfA. Since there 
are four A phrases and nine PQ phrases, the 
system contains 85 possible sentences, 72 long 
and 13 short. (Only a subset of these were 
exposed to S.) It may be observed that no 
A, P, or Q item is uniquely correlated with 
an absolute sentence position. However, 
all A, P, and Q items are uniquely cor- 
related with a particular sentence position 
defined relative to one or the other marker. 


Subjects 


The Ss were 20 fourth-grade children, 11 
boys and 9 girls, aged 9-1 to 10-6. They 
were from middle-class professional back- 
grounds, and of above average intelligence. 


Procedure 


The S was told that he was going to 
learn part of a new language which might 
seem strange because he would not know 
what any of the words meant. The words, 
written on cards, were shown to him, and 
read by E and then by S. The experiment 
consisted of an exposure period, followed by 
a test (Test I), another exposure period, and 
a second test (Test II). The entire pro- 
cedure occupied a single session of 45-55 min. 
duration with each S. 

Exposure.—During the exposure periods S 
was exposed to 29 of the 85 possible sen- 
tences. The exposed sentences used all four 
A phrases, and five of the nine possible PQ 
phrases (gpiqds gpids, 901, gpsds, and gpsds)- 
There were 9 different short sentences (the 
4 fA and the above 5 gPQ sequences), and 
20 different long sentences (10 fAgPQ and 10 
gPQfA). Thus, of the 40 different long 
sentences obtainable by recombining the 4A 
and 5 PQ phrases in either order, 20 were 
exposed and 20 were withheld. Both ex- 
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posure periods used the same 29 sentences 
and were similarly organized. The sen- 
tences were set up in 10 multiples of 7, 5 
short and 2 long (1 fAgPQ, 1 gPOfA) ; in 
every set of 7 sentences each of the 4 A 
and 5 PQ phrases occurred once, either as 
a short sentence, or as part of a long sen- 
tence. Altogether, each exposure period 
presented the same 70 sentences, 20 long (all 
different) and 50 short (many repetitions). 
Each of the 9 A or PQ phrases used occurred 
10 times, sometimes as a short sentence, 
sometimes as part of a long sentence. 

The S first read 14 sentences typed on a 
sheet; if he had difficulty reading, he was 
helped by E, who reread each sentence 
after S. The E then read 14 sentences, and 
S listened and attempted to repeat each 
sentence after E. If S's repetition were cor- 
rect he received a counter; if it were in- 
correct E repeated the sentence again before 
going on to the next one. (Sixteen counters 
were worth one chocolate.) The remaining 
part of the exposure used a match-from- 
sample procedure: a sentence was presented 
and read by S, and then E read two, three, 
or four sentences, which used the same 
phrase types as the sample sentence, the last 
sentence read being the same as the sample 
sentence. The S had to say for each sen- 
tence whether it was the same as, or different 
from, the sample sentence. The S received 
a counter if his judgments were correct, 
which they always were.  Forty-two sen- 
tences, 14 sample sentences and 28 choice 
sentences different from the sample, were 
presented. (The match-from-sample pro- 
cedure was employed only because it pro- 
vided a convenient way of exposing a child 
to a large number of sentences, under con- 
ditions assuring attentive listening and un- 
likely to provoke boredom.) 

Tests—The purpose of the tests was to 
investigate the extent to which Ss had 
learned the pattern properties of the sen- 
tences. Thus, they were designed to find 
out (a) whether Ss had learned the forma- 
tion rule for long sentences—that they were 
composed of two short sentences, one of 
each type, (b) whether S could distinguish 
which word sequences followed f and which 
followed g, and (c) whether S had registered 
the internal structure of the PQ phrase. 
Each test comprised 20 multiple-choice sen- 
tence-completion problems, as follows: (In 
specifying problems below, the dash indicates 
the location of the space to be filled in.) 

1. Four problems, —fA, fA—, LgPQ—, 
—gPQ. The two alternatives offered were 
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always an fA and a gPQ sequence. The A 
and PQ phrases used were always phrases 
to which S had been exposed; however, the 
long sentence resulting from a correct choice 
was one which S had not previously heard 
(and differed in Tests I and II). Thus, 
correct responses presumably indicate that S 
knows how to construct new long sentences 
out of familiar short ones, ie, that he knows 
that a phrase marked by f must be paired 
with a phrase marked by g. 

2. Eight problems, four using short sen- 
tences (f—, f—, g—, g—), and four using 
long sentences (f—gPQ, gPQf—, g—fA, 
fAg—). The short sentences were ones 
used in the exposure, but the long sentences 
were new combinations of familiar phrases. 
The two alternatives offered were always an 
A and a familiar PQ phrase, a different 
phrase being correct in each problem. (The 
phrases correct in the short-sentence prob- 
lems in Test I were correct in the long- 
sentence problems in Test II, and vice 
versa.) These problems test S’s knowledge 
of which sequences follow f and which follow 
g, and whether he is able to make this 
discrimination in both long and short 
sentences. 

3. Four problems, in which the sentences 
lacked a word from the PQ phrase. In Test 
I, the problems were IP, 9—4» fAgp—, 
9—«fA; in Test II they were gps—, I-n, 
9p—fA, fAg—q Three alternatives were 
always offered to complete the sentences: a 
P word (either p; or Ps), a Q word (q or 
q2), and a word from one of the A phrases. 
(Note that a correct Choice always led to 
the construction of a familiar PQ phrase.) 
In one short-sentence problem and one long- 
sentence problem, the A alternative was a 
word that occupied the position in the A 
phrase homologous to the Vacant position in 
the PQ phrase in the problem presented Cie, 
a and a» when a P word was to be filled in, 
bı and bə for a Q word) ; in the other prob- 
lems the A alternative was a nonhomolog- 
ously positioned word, (Note that a tendency 
to respond on the basis of absolute position 
should lead to errors involving a choice of a 
homologously positioned A-phrase item.) 
These and the next set of problems test S"s 
knowledge of the contents of the PO-phrases. 

4. Four problems, similar to Type 3, but 
employing Ps and qs: gps—, g—qs, 9Ps—fA, 
fAg—qa in Test I; gp—, g—as, fAgp.—, 
9—4«fA in Test II. The alternatives were 
again a P word (p: or Pz), a Q word (q or 
q), and a word from one of the A phrases, 
(The PQ phrases correct in the short-sen- 


tence problems in Test I were correct in the 
long-sentence problems of Test II, and vice 
versa) The problems were designed to be 
exactly parallel to Type 3 above, except that 
correct choices led to the construction of the 
four PQ phrases to which S had not been 
exposed (Psd:, psds pigs, psqs). While the 
problems of Type 3 could be solved on the 
basis of rote learning of the PQ phrases, 
Type 4 could presumably be solved only if S 
had learned the internal structure of the PQ 
phrase, i.e., that p; and ps go first after g, and 
q: and qs second. 

The problems were presented in a random 
order in each test. The words (each on a 
separate card) were placed on the ledge of a 
board with the beginning and end of the 
sentence indicated, and with the space to be 
filled in specified by a gap. The alternatives 
were placed one above the other beneath the 
gap, the top-to-bottom placement of the cor- 
rect alternative being chosen randomly. The 
S was told to read over to himself the 
sentences that could be formed by inserting 
each alternative in the space, and to choose 
the alternative that made a sentence that he 
remembered hearing before. The S was not 
informed of the correctness of his responses. 
At the beginning of the test he was told “I’m 
not going to tell you each time if you're 
right or not, but at the end I'll tell you if 
you got most of them right, and if you get 
most of them right, you win the rest of this 
pile of chips" The S was always told he 
got most of them right. 

After Test II, there was an unsys- 
tematically conducted "free-recall" — test, 
whose purpose was to determine if learning 
had progressed to the point that Ss could 
freely generate sentences. The words were 
laid out, and S was invited to try to make 
sentences. He was stopped after he had 
made 10 error-free sentences, or earlier if he 


seemed unable to cope with this part of the 
task. 


REsuLTS 


Table 1 summarizes the results for 
the different types of problems. To 
compare the means with the values ex- 
pected by chance (i.e., 2.0 for the two- 
choice problems, 1.33 for the three- 
choice), a £ test was used, basing the 
error term on whichever was larger, 
the sample standard deviation or the 


theoretical ¢ ( Vnpq) computed from . 
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TABLE 1 
MEANS AND SDs or THE NUMBERS OF PROBLEMS CORRECTLY SOLVED or EACH TYPE 


Types of Problems 
Test 
(1) (2) short (2) long (3) (4) 
I 3.80 (0.52) 3.20 (0.62) | 3.05 (0.89) | 2.60(127) | 2.95 (1.05 
II 3:70 (0.57) | 3:65(0.67) | 305(105) | 340(0.5 | 3.40 (0.80) 


Note.—Four problems per cell. Three multiple-choice items used in Problem Types 3 and 4, two otherwise. 


the binomial expansion. All the means 
were significant (p < .001). (While 
this test assumes independence of trials 
within Ss, to the extent that there is 
nonindependence correlation would pre- 
sumably be positive, and would render 
the test conservative.) The correct re- 
sponses on the problems of Types 1 and 
2 indicate that Ss knew how to form 
long sentences from short ones, and 
were able to recall which sequences 
followed which marker in both short 
and long sentence frames. The prob- 
Jems of Types 3 and 4 tested Ss’ knowl- 
edge of the composition of PQ phrases. 
Since the correct responses on Type 4 
led to the construction of PQ phrases 
not previously experienced, they indi- 
cate that Ss had learned the internal 
structure of the PQ phrase; moreover, 
although Type 3 problems could have 
been solved on a rote-learning basis 
(since the PQ phrases constructed 
were the ones used in the exposure 
phase), it can be seen that these prob- 
lems were not solved more frequently 
than Type 4, ie, new PQ phrases 
appear to have been constructed about 
as readily as familiar ones. 

Detailed comparisons among the 
various means in Table 1 seem inappro- 
priate, because of the skewed distribu- 
tions and heterogeneous variances due 
to differential ceiling effects, and be- 
cause some of the problems are two- 
choice and some three-choice. Com- 
paring Tests I and H globally, the 


total number of correct responses was 
greater on Test II (p < 01, by t test) ; 
inspection of the table indicates that the 
improvement in performance occurred 
primarily on Problem Types 3 and 4. 
In both tests sex differences were small 
and not consistent as to direction on 
different types of problems. 

Half the problems of Types 3 and 4 
were of the form g—Q, in which the 
space was contiguous with the marker 
element, and the other half were of the 
form gP—, where the space was not 
contiguous with the marker. It might 
perhaps be thought that a contingency 
of a word on another word would be 
more readily learned if the items were 
contiguous. Combining Tests I and 
II, the mean numbers of correct re- 
sponses were: 3.35 for Type 3 prob- 
lems of the form g—Q, 2.65 for Type 3 
gP—, 3.30 for Type 4 g—Q, and 3.05 
for Type 4 gP— (four problems in each 
category). Each category was solved 
with significant frequency (p < .001), 
indicating that correct responses oc- 
curred regardless of whether or not the 
item filled in was contiguous with the 
marker. In the Type 3 problems, sig- 
nificantly more problems of the form 
g were solved than of the form 
gP— (by t test, p<.01). However, 
in these problems, correct responses 
could reflect learned contingencies with 
the other P or Q words as well as with 
the marker. In the Type 4 problems, 
where correct responses can reflect a 
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contingency only on the marker, the 
difference was not significant (t 
= 1.16). These results do not suggest 
any great difference in the facility with 
which the positional definitions “second 
after g? and “first after g” are 
acquired. 

A. further analysis of the Problems 
3 and 4 was made to see if errors on 
these problems showed any tendency 
for an A item to be chosen which oc- 
cupied a position in the A phrase 
homologous to the vacant position in 
the PO phrase. For the eight 3 and 4 
problems in which such a choice was 
possible 13 Ss selected the A alterna- 
tive on more problems than the wrong 
PQ phrase item and 4 Ss selected the 
wrong PQ phrase item more often (p 
« .05, by sign test, omitting the 3 S's 
who made both types of errors equally 
often). On these problems the A alter- 
native was actually selected on an 
average of 1.6 problems (as against .65 
problems for the wrong PQ phrase 
item, and 5.75 problems answered cor- 
rectly). In the other eight 3 and 4 
problems, in which the A alternative 
was not homologously positioned, the 
A item was chosen only as frequently 
as the wrong P-phrase item (.7 prob- 
lems each for the A- and the wrong 
PQ-phrase items, and 6.6 problems 
answered correctly). Comparing the 
frequency of choice of a wrong A- 
phrase item on the eight problems on 
which it was homologously positioned 
with the frequency on the eight prob- 
lems in which it was not homologously 
positioned (ie., 1.6 vs. 0.7), a homol- 
ogously positioned A item was chosen 
on more problems (5 < 01, by f test). 
Thus, although errors were few, the 
favorite error was an item that oc- 
cupied the same position in the other 
phrase. 

In the “free recall,” only two Ss 
made 10 correct sentences, and two 
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others made 6 or more without making 
errors; it thus appears that the learn- 
ing had not progressed to the point that 
Ss could freely generate sentences, 


Discussion 


The sentence-completion test results in- 
dicate substantial learning of the pattern 
properties of the exposed strings, includ- 
ing, in particular, the internal structure 
of the PQ phrase. The only basis for 
solving the Type 4 problems would ap- 
pear to be a learning of the positions of 
p, and p, (“first after g") and q, and q, 
("second after g"). An explanation of 
the responses on these problems in terms 
of mediation theory (Jenkins & Palermo, 
1964) would appear ruled out because 
neither p, nor q, were ever exposed in. 
combination with any of p,, Pa q,, or qs 
so that some of the stages required in the 
stimulus-equivalence paradigm were not 
present (cf. Smith, 1965). Also a media- 
tional explanation of the fact that the 
favorite error was a homologously posi- 
tioned A-phrase item would obviously be 
very difficult to formulate. In general, 
therefore, this experiment provides evi- 
dence that Ss can learn positional rela- 
tionships where position is defined rela- 
tively to a marker morpheme. The 
conditionality of the position learning on 
the marker may well be due to S simul- 
taneously learning (a) the absolute posi- 
tion of the word in the phrase, and (b) 
contingencies between the words and the 
respective marker morphemes. The fact 
that the favorite error was a homol- 
ogously positioned A item would then be 
due to somewhat better learning of a 
than b. 

The total time occupied by the two ex- 
posure periods together was of the order. 
of 20-25 min. Although learning had not 
progressed so far as to allow free genera- 
tion of sentences, nevertheless the Test IT 
results indicate that a very substantial 
degree of learning had taken place dur- 
ing this relatively brief period. It ap- 
pears that 5s are not merely able to learn 
fhe kinds of grammatical relations in- 
volved but learn them rather rapidly; 


ASSOCIATION AS A CONTEXT-SENSITIVE RULE 


moreover, information feedback contrast- 
ing grammatical with ungrammatical 
strings is clearly not necessary for 
learning. 

Grammatical representation of what ts 
learned —Chomsky (1963) has described 
a hierarchy of types of grammars, which 
provide possible models of the kinds of 
regularities learned in experiments such 
as this. The simplest of these, a finite 
state grammar, is a grammar all of whose 
rules are of the form A — +B or A+, 
where capital letters designate word or 
phrase classes (ie belong to the non- 
terminal vocabulary), and lower-case let- 
ters designate words (ie. belong to the 
terminal vocabulary). Alternatively, and 
equivalently, the term refers to a state 
diagram (SD) of the usual type with 
no push-down storage annotations. (See 
Chomsky [1963] for details.) 

In comparing possible grammars with 
experimental data, it is important to 
distinguish sharply between a grammar 
(a set of rules), and a language (a set of 
sentences): a given set of sentences may 
have several grammars, although, pre- 
sumably, only one grammar will be “cor- 
rect,” in the sense that its rules represent 
the regularities actually learned by Ss. 
In the present experiment the sentences 
are a finite set. One possible grammar is 
therefore a simple list of the 85 strings. 
However, it is clear that Ss did not learn 
the sentences as a list of 85 different 
items, so such a grammar would not 
represent what was learned. 

More generally, the regularities learned 
in this experiment, like those learned in 
a previous experiment (Braine, 1965b), 
do not appear to be describable in terms 
of the rules of a finite state grammar. 
The test results indicate that Ss learned 
that the components of the long sentences 
were the short sentences; thus a mini- 
mally adequate grammatical representa- 
tion of the regularities learned must treat 
an fA or a gPQ sequence as being the 
same phrase (ie, as having the same 
“P marker" or sequence of P markers), 
regardless of whether it occurred alone 
or as a part of a long sentence. Any 
finite state grammar with this property, 
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and which generated the correct sequences 
fA, gPQ, fAgPQ, gPQfA, would neces- 
sarily incorporate a recursive rule 
which would generate also fAgPQfA, 
ÍAgPOfAgPQ, fAgPOJAgPOfA, . . » 
and gPQfAgPQ, gPOfAgPOfA, 

(note that this recursion is obtained if 
the push-down store is eliminated in 
SD-A in Fig. 1). No S in the free 
recall generated a string longer than six 
words, or any string which looked as if 
it might be an attempt to construct a 
sequence longer than two phrases; it was 
apparent that Ss had learned that the 
sentences were not more than two phrases 
long—one would surmise that this was 
one of the earliest properties they regis- 
tered. Thus, a grammar which rendered 
the regularities learned would have to 
arrange that not more than one traverse 
through each phrase occurred. SD-A in 
Fig. 1 shows a device which appears to 
be a reasonable model of the relationships 
learned in all respects except one. It 
differs from a finite state diagram only 
in incorporating a small push-down 
store (PDS) which is used to control the 
entry into each phrase path, so that there 
is not more than one traverse through 
each. SD-A appears to be the minimally 
complex grammatical device (ie, mini- 
mally more complex than a finite state 
diagram) which renders the principal 
regularities Ss learned. 

The one aspect of the data which is 
not accounted for by SD-A is the finding 
that the favorite error in Problem Types 
3 and 4 was a homologously positioned 
A item, This sort of error indicates a 
relation between the corresponding posi- 
tions in the A and PQ phrases which has 
no counterpart in SD-A. SD-B in Fig. 
1 shows a model which predicts this kind 
of error. In this model the A and PQ 
phrases are both generated through a 
single sequence of states, with homolo- 
gously positioned items generated at the 
same state transitions. The contingencies 
among the words and the markers, which 
establish which word sequences belong to 
each phrase type, are indicated by means 
of the PDS entries. In effect, this 
model, relative to “A,” achieves an 
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SD-A 


Fic. 1, Two possible PDS state diagrams rendering the relations learned. 


Sp-8 


(Strings are 


"generated" by movement from IN to OUT via the arrows joining states, cf. Chomsky, 1963, 


pp. 339—345. 
Si and S;. 


A rule [x, Ss, y] — [S;, 2] is diagrammed as the triple [z, y, z] between 
The first entry in a triple is the word that is generated at the transition, the 


second entry is the item which must be on top of the PDS for the transition to take place, 


and the third entry is the operation carried out on the PDS. 


Thus [x, y, s] means 


"generate x if y is the most recently stored item in the PDS, and store z on top of y." 
e is the identity element: as first entry it means that nothing is generated, as second, that 
the operation occurs regardless of what is in the PDS, and as third, that the PDS is left 


unchanged. 


g means that the top PDS item is erased—bringing the item beneath it to the 


top. S-PQ/PQ means that PQ in the PDS is erased and replaced by S-PQ; similarly 
fa/f means that f in the PDS is replaced by fai.) 


economy of states by exploiting its PDS 
to record features of structure. The 
positions are represented by the transi- 
tions between states, and the contingencies 
between words by PDS entries In 
Model “B,” any transitory forgetting of 
the PDS entries, or any incompleteness 
‘of learning of the connection between the 
PDS entries and the words they control, 
would lead to confusion as to which of 
the words occurring at a state transition 
should be chosen; confusion of this sort 
should lead, in Problems 3 and 4, to a fre- 
quent choice of the word.occurring in 
the homologous position of the other 
phrase, 

It is of considerable interest that if one 
tries to represent the relations learned in 
a grammatical notation (i.e., using a se- 
quence of rewrite rules [Chomsky, 
1963]), rather than by state diagrams, 
the only apparent way of capturing the 
difference between SD-A and SD-B is 
by rendering SD-A by context-free (CF) 
and SD-B by context-sensitive (CS) 


rules. Thus, a CF grammar might con- 
tain the rules: A phrase — fA; A — a,b, 
a,b, a, a,; P phrase — gPQ; P— p, 
D» D; Q > d, dos qs- Like SD-A, such 
a grammar provides no basis for predict- 
ing the kinds of errors made. A CS 
grammar might contain rules such as A 
phrase > fX; P phrase > 9X; X ay, 
a, in context f—; X > YZ; Y—a,, a, 
in context f—Z; Y — P in context g—Z; 
Z — b, in context fa, —; Z— b, in con- 
text fa, —; Z— Q in context gP—; P 
> Py P» P Q—q, day dy In this 
grammar, the contingencies among the 
items are represented in the contexts to 
which the rules are sensitive; as in SD-B, 
any temporary forgetting or incomplete 
learning of the contextual sensitivities 
leads to a merger of a, and a, with P, 
and of b, and b, with Q, and thus to the 
errors found. 

Parenthetically, it seems to the writer 
that the apparent correspondence of the 
CF grammar to SD-A, and of the CS 
grammar to SD-B, raises the question of 
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whether some strong equivalence 
theorem may not be provable between 
CF and CS grammars, and classes of 
automata differentiated according to the 
purpose to which their short-term stores 
are put. There appear to be two kinds 
of uses for a short-term store: to record 
each entry into a phrase path (with 
erasure of the record on exit), thus keep- 
ing a count on successive embeddings in 
nested constructions; and to indicate 
contingencies between elements of a 
string, as in SD-B. If the difference 
between these uses were made precise, 
a strong equivalence might be shown be- 
tween CF grammars and PDS SDs which 
use their PDS only in the first manner. 
(Note that in Chomsky's [1963, pp. 371- 
373] construction converting a normal 
CF grammar into a PDS SD that is 
weakly equivalent, the PDS is used only 
to mark the beginnings and ends of 
phrases.) Similarly, CS rules seem to 
correspond to the second use of a short- 
term store; such a store would be *linear- 
bounded" (Chomsky, 1963, p. 338) rather 
than “push-down” when the contingencies 
in the CS rules straddle phrase bound- 
aries. 

These speculative correspondences be- 
tween rewrite grammars and SDs aside, 
the difference between the A and B 
models above would appear to have inter- 
esting implications for the concept of an 
association in traditional S-R theory. 
The finite state model has been widely 
accepted as a model of “chains” of asso- 
ciations (e.g, Miller, 1951; Osgood & 
Sebeok, 1954, especially Sec. 5): in this 
model an association is represented by 
assigning the associated items to succes- 
sive transitions in a sequence of states of 
an SD. Although SD-A above does go 
beyond a finite state system, it still repre- 
sents the associations (ie. the contin- 
gencies) among the words of each phrase 
in the manner of a finite state system. 
SD-B, however, represents associations 
in an entirely different manner: an asso- 
ciation is a relation between an item and 
an entry in a short-term memory store. 
More precisely, an association between 
two items, a and b, is represented by a 
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pair of rules (a, Sj x) > (Sus f) and 
(b, S, f) > (Sy; y) where S, is a state 
subsequent to S, in a left-to-right pass 
through the diagram, and f may be 
reasonably identified with either a or b. 
(x and y are irrelevant, and if a and b 
are associated only with each other, then 
x-e and y=o.) Alternatively, and 
more speculatively (cf. preceding para- 
graph), am association might be viewed 
as a context-sensitive rule. These models 
would provide a much more powerful and 
flexible concept of an association than the 
finite state model. 

Although the conclusion that at least a 
PDS model of associations is required 
is based on a relatively subtle feature of 
the data in this experiment, the conclu- 
sion agrees closely with the results of a 
previous experiment (Braine, 1965b). 
In that experiment, Ss were exposed to 
a set of sentences, half of which were of 
the form aXb and half of the form pXq, 
where a, b, p, q were individual words 
and X was a word from a list of 18 items. 
Most Ss readily learned the a—b and p—q 
contingencies, and the X items were 
learned, not as two lists, one associated 
with a... b, and the other with p... q 
but as a single list which went second, or 
“in the middle." The regularities learned 
cannot be adequately represented by a 
finite state grammar, for several reasons 
(Braine, 1965a, 1965b; Gough & Segal, 
1965), At least a phrase structure model 
is required, or a PDS SD in which the 
a—b and p—q associations are repre- 
sented in the manner outlined above. 
Both this experiment and the present one 
have in common the fact that they in- 
volved the formation of “remote” associa- 
tions between items whose positions were 
also learned. Remote associations are, 
by definition, associations which straddle 
intervening material. It seems intui- 
tively clear that such associations are 
poorly translated in finite state diagrams 
where nothing corresponds to the “strad- 
dling” property. It may be that the 
tacit acceptance of the finite state model 
as adequate for representing learned as- 
sociations is due only to the fact that 
the actual properties of remote associa- 
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tions have been relatively little explored 
in the verbal learning literature. 
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SOME EFFECTS OF PARTIAL ADVANCE INFORMATION 
ON CHOICE REACTION WITH FIXED OR 
VARIABLE S-R MAPPING 
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2 experiments are reported on 2-choice reaction with variable S-R 
mapping. In the task a signal, I, designates a mapping relation 
between signal, M and response, R. Both I and M can be random 
variables in a trial sequence. In some conditions a value of I or of M 
or a neutral light was given in advance, with either ł- or 4-sec. fore- 


period. 


alerting and of the transition from the previous trial. 


Response time was examined as a function of the advance 


Both were sig- 


nificant variables and there was an interaction between them indicating 


distinct phases in the choice process. 


It was also shown that with a 


neutral advance signal there is an optimal foreperiod in the interval 
0-4 sec. with fixed mapping but not with variable mapping. 


An earlier study (Shaffer, 1965) 
examined performance in a choice- 
reaction task in which the S-R map- 
ping relation was a random variable 
over the sequence of discrete trials. 
The mapping relation in each trial was 
designated by an extra symbol in the 
display. Thus the signals (M) were 
two lights horizontally arranged and 
a vertical or horizontal line midway 
between them designated contra- or 
homolateral mapping relations (I), re- 
spectively. The S responded by press- 
ing one of two horizontally arranged 
buttons (R). Particular attention was 
given to response time (RT) asa func- 
tion of what happened on the previous 
trial. Ina transition I, M, or R could 
be the same (s) or different (d) and 
if we refer to an ordered triple IMR 
then four transition categories could be 
distinguished, sss, sdd, dsd, and dds. 
It was shown that RT in these cate- 
gories could be ordered sss < sdd < dsd 
< dds and that if a display component, 


1 Acknowledgment is made to the Medical 
Research Council’s Naval Personnel Re- 
search Committee for research facilities. 

Now in the Department of Psychology, 
University of Exeter. 
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I or M, was given 4 sec, in advance 
of the other then there was an overall 
reduction in RT but no change in the 
pattern of transition effects, 

It is convenient to restate the task as 
a sequence of computations, I (M) =R, 
in which a transformation, I is applied 
to a datum, M to produce an outcome, 
R. If I and M are two-valued vari- 
ables there is a set of four possible 
computations. As soon as either I or 
M is given a value this logically re- 
duces the possibilities to a subset of 
two. If the previous computation has 
the role of transferring information 
about I and M then this role should be 
preempted for one of them if it is given 
in advance. Thus, either there should 
be an interaction between advance com- 
ponent and transition, or the previous 
computation transfers information rele- 
vant to a later stage of the computation 
process. Accordingly, the first experi- 
ment replicates the earlier one, but uses 
a different design and extends the 
range of foreperiod to provide a more 
sensitive test of interaction. 

Tn addition, the earlier result showed 
that a neutral advance signal, i.e., a 
light giving no information about I or 
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M, also led to a reduction in RT, as 
have results in simple RT (Klemmer, 
1956) and choice RT (Bertelson & 
Boons, 1960). It is interesting to find 
out whether the amount of reduction 
varies with size of the set of computa- 
tions. Thus the second experiment to 
be presented examines whether the 
foreperiod is a significant variable if 
the advance signal is neutral and 
whether it affects differentially trial 
sequences with fixed or variable 
mappings? 


METHOD 


Apparatus.—The display and response units 
have already been described (see also the 
earlier paper). A punch-tape reader deter- 
mined the sequence of stimuli. A key oper- 
ated by E brought on the display and simul- 
taneously started a Dekatron timer. An 
electronic delay line could introduce a vari- 
able delay between onset of the two display 
components and, when used, the Dekatron 
timer was triggered with onset of the second 
component. 

Design and procedure—A group of 14 Ss 
was used in Exp. I and a different group of 
18 Ss in Exp. II. 

In Exp. I Ss received each of seven test 
conditions in randomly permuted orders using 
a Latin square. The conditions are shown 
in Fig. 1, The notation is: M was a light 
on the left or on the right; I was a hori- 
zontal, H, or vertical, V, line designating a 
homo- or contralateral mapping of lights into 
response keys; -— IM was a condition in 
which an alerting light (the intersection of 


o 1 * sec. 
+ IM 
M-I 
IM 


Fig. 1. Conditions of Exp. I. 
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-v V 


Fic. 2. Conditions of Exp. II. 


H and V) preceded the onset of I and Mj 
M-I was a condition in which the onset. 
of M preceded the onset of I, and I — M the 
converse of this. Clearly, with zero advance 
interval -— IM, M — I, and I — M are not 
distinguished. j 
In Exp. II Ss received each of nine 
conditions in randomly permuted ordei 
again using a Latin square. The conditi 
are shown in Fig. 2, H was a condition 
which a homolateral mapping was alwa 
used and the horizontal line appeared in 
display; similarly in a V condition a 
tralateral mapping was always used. 
For each condition a different sequem 
of 81 trials was used. If both I and M wet 
variable each of the 4 possible displays O 
curred 20 times in random sequence 
each of the 16 possible first-order transiti 
occurred 5 times. If only M was variab 
each of 2 possible displays occurred 40 tim 
in random sequence and each of the 4 possi 
first-order transitions occurred 20 times. 
There was a delay of about 7 sec. betw 
trials. 
On the first day in which each 5 came 
the experiment he was given instruction 
three practice sequences involving variabl 
mapping and shown the two types of fi 
mapping sequence as special cases. 
Instruction.—It was explained that 
plane containing the two lights could be 
tated about a vertical axis midway bet 
them or about a horizontal axis passi 
through them. The task was to press th 
button corresponding to the position of the 
light if the appropriate rotation were carriet 
out, The Ss were told to use their inde? 
fingers to press the buttons and to resp 
as quickly as possible without making erro 
Subjects—The 32 Ss were males in the 
age range 18-30. 
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RESULTS 


Experiment I—Errors were made 
on about 1% of all trials and no analy- 
sis of these scores was carried out (but 
see the earlier paper for analysis and 
discussion of these). 

Mean RT in each of the conditions 
and for each transition category is 
shown in Fig. 3. Three points are of 
interest: (a) there is a clearcut effect 
of advance component, (b) there is a 
consistent but minor effect of fore- 
period, and (c) there is some evidence 
of an interaction between advance com- 
ponent and transition such that RT is 
reduced in a transition in which the 
advance component repeats itself. 

An analysis of variance in the three 
:— IM conditions distinguished fore- 
period, A (0, 4, 3 sec.), transition, T 
(sss, sdd, dsd, dds), and Ss. The 
foreperiod effect was not significant, 
F (2, 26) = 1.3, p > 05. The transi- 
tion effect was significant, F (3, 39) 


“75 
— IM cif O sec. 


mane Mel D Le 


IM WO lose. 


RT IN SEC. 


TRANSITION CATEGORIES 


Fic. 3. RT as a function of advance compo- 
nent, foreperiod, and transition category. 
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FOREPERIOD IN SEC. 


Fic. 4. RT as a function of foreperiod 
and mapping relation. 


= 34, p<.001. The interaction 
AXT was significant, F (6, 78) 
= 2.25, p < 05. The graphs in Fig. 3 . 
do not suggest an empirical significance 
for this interaction. 

An analysis of variance on RT in all 
conditions except the one with no ad- 
vance signal distinguished foreperiod, 
A, advance component, C (*, M, I), 
transition, T, and Ss. The foreperiod 
effect was not significant, F (1, 13) 
22.5, p.05. The advance compo- 
nent effect was significant, F (2, 26) 
= 132, p < 001. The transition effect 
was significant, F (3, 39) = 31, p 
<.001. Only one interaction, GE, 
was significant, F (6, 78) = 3.5, p 
« 01. This interaction may be asso- 
ciated with that inferred from Fig. 3. 

In each condition a t test was carried 
out on the difference in RT between 
trials involving H or V transforma- 
tions. This was significant only for the 
IM conditions: when A = 1 sec., 
t (13) = 6.0, p < .001 and when A 
= 4 sec., t (13) = 40, p < 01. 

Experiment II —Mean RT in each 
of the conditions is shown in Fig. 4. 
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There is a slight, monotonic decrease of 
RT as foreperiod increases with vari- 
able mapping, but with fixed mapping 
the graphs suggest that RT has a 
minimum value for some foreperiod in 
the interval 0-3 sec. Tt is also clear 
that RT is much less with fixed than 
with variable mapping and, in the 
former case, RT is less with homo- 
lateral than with contralateral mapping. 
An analysis of variance on RT in 
the six fixed-mapping conditions dis- 
tinguished mapping class, K (H, V), 
foreperiod, A, transition, T (sss, sdd), 
and Ss. The effect of mapping class 
was significant, F (1, 17) = 369, p 
< 001. The foreperiod effect was sig- 
nificant, F (2, 34) = 180, p < .001. 
The transition effect was not signifi- 
cant, F (1, 17) <1. There were no 
significant interactions. 
An analysis of variance on the three 
,Variable-mapping conditions distin- 
guished foreperiod, A, transition, T, 
and Ss. The foreperiod effect was 
not significant, F (2, 34) = 30, p 
>.05. The transition effect was sig- 
nificant, F (3, 51) = 407, p < .001. 
There were no significant interactions. 


Discussion 


The results of Exp. I confirmed those 
of the earlier experiment which showed 
that transition and advance component 
are significant variables effecting RT, but 
failed to confirm that foreperiod is a sig- 
nificant variable with neutral advance 
alerting, at least in the interval 0—3 sec. 
This discrepancy may be attributable to 
the different designs in the two experi- 
ments. In the earlier study different 
groups were employed in each condition, 
in the present study the same Ss received 
each condition once, with order random- 
ized within the Sroup. In this case a 
differential response to practice among 
Ss can enlarge the error variance to the 
point of concealing real difference be- 
tween conditions. Similarly, with I or 
M given in advance, the tendency for RT 
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to be less at 4 sec. than at } sec. fore- 
period failed to be statistically significant. 

The main point of Exp. I was to pro- 
vide a test between two hypotheses, that 
information contained in a transition as- 
sists either the preparation of the con- 
stants, I and M, or the completion of the 
computation, I(M) — R. It is difficult 
to obtain an outright rejection of the 
latter, but the former is tenable only if 
the difference in RT, between transitions 
in which I (or M) changes or is invari- 
ant, is reduced when I (or M) is given 
in advance. Although an interaction be- 
tween advance component and transition 
was in fact observed it is not compatible 
with the first hypothesis. Instead, what 
Fig. 3 indicates is that RT was reduced 
if the advance component repeated itself 
in a transition. This is consistent with 
the second hypothesis and suggests that 
if alerted to the commonality between 
computations S can complete the present 
one by analogy with the previous one. 

The major result of Exp. IT is that 
there is an optimal foreperiod for an 
alerting signal with fixed mapping and 
that if there is a corresponding optimum 
with variable mapping it must have a 
longer value. The result cannot indicate 
whether the discrepancy occurs because 
the latter task involves four possible com- 
putations instead of two, or because I is 
variable instead of fixed, 

The effect upon RT of giving an ad- 
vance warning signal at a more or less 
Specific time has been well documented 
(Bertelson & Boons, 1960; Karlin, 1959; 
Klemmer, 1956, 1957). It is usually 
ascribed to a state of preparation which 
must be achieved in each trial and can be 
transiently maintained. The next con- 
structive step is to try to specify what is 
prepared. 

Let us assume a process of computation 
that requires, in the present task, the con- 
struction of a canonic representation of 
the form I(M) — M'. Given adequate 
warning S can prepare a generic form, 
*(*) =*, appropriate to the set of com- 
putations; given I or M in advance he 
can prepare I (*)—* or *(M)—* 
Each of these incomplete forms can be 
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regarded as a neural analogue of a 
more or less general instruction, “M is 
to be reflected about I.” Given both I 
and M S can complete the computation 
and what the results indicate is that in- 
formation from the previous computation 
can assist the completion. 

It is still uncertain whether a generic 
form can be usefully prepared in the 
variable-mapping task, but there is little 
doubt in the case of the more specific 
forms. Also, the form which specifies 
I is more useful than that which specifies 
M, although both reduce computation un- 
certainty equally. This implies that S 
can more readily complete a computation 
when he knows which operation he has 
to perform than when he knows which 
datum he has to operate upon. 
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CS REDUNDANCY AND SECONDARY PUNISHMENT 
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University of 

* $ . * 
Rats received conditioning. involving 2 CSs and electric shock. 2 
groups received the sequence: Si, Ss, shock on all trials. 
groups, S, was an informative predictor of stiock, ‘while S; was 
2 other groups received, in addition} S; alone on some 
For these groups, Si was unreliable and Ss was reliable. A 
control group received the CSs and shock unpaired. When Ss sub- 
sequently received these CSs contingent {upon bar pressing for food, all 
experimental CSs were stronger “suppressors than ‘the ‘control CSs. 
The informative CS: was stronger than the redundant CS and the 
Implications of these findings for the generalizability 

of the Egger and Miller information hypotheses are discussed. 


redundant. 
trials. 


unreliable CS. 


* 


Egger and Miller (1962, 1963) pro- 
posed two hypotheses concerning the 
relationship between the potency of re- 
inforcing stimuli and their information 
value. Their "strong" hypothesis states 
that an informationally redundant pre- 


dictor of primary reinforcement should ` 


“not acquire secondary reinforcement 
strength. Their “weak” hypothesis 
states that when there is more than one 
stimulus predicting primary reinforce- 
ment, the more informative (the less re- 
dundant) stimulus should be the more 
effective secondary reinforcer, They 
tested these hypotheses for secondary 
positive reinforcement imposed on an 
appetitive base line as follows: Ss were 
conditioned with two CSs and food pel- 
lets. In Group A, the first stimulus 
(Si) was an informative predictor of 
food. Since it always preceded and 
overlapped the second stimulus (S; 
S, was a redundant predictor of food. 
In Group B, S, occurred on half the 
trials followed by S, and food, but on 


1 This study was supported by United 
States Public Health Service Grant MH- 
04202 to Richard L, Solomon. The author 
wishes to thank Richard Solomon, Robert 
Rescorla, Bruce Overmier, and Vincent 
LoLordo for their aid and advice. 
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Pennsylvania 


For these 


the other half of the trials, S, occurri 
alone not followed by food. Thus bee 
cause S, was unfeliable, S,, as the only 
reliable predictor of food, was info 
tive. In a "pseudoconditioning" c 
trol group, the two. CSs and food wer 
presented, but were explicitly unpaire 
Egger and: Miller found that S, (ri 
dundant) in Group A was not a mo 
effective reinforcer of the relearning 
a bar press which had previously pi 
duced food than were the stimuli in 
control group. This confirmed 
strong hypothesis. Furthermore, th 
found that in Group A, S, (info 
tive) was more reinforcing than S, 
that S, (reliable and informative) 
Group B was more reinforcing 
S, in Group A, and that S, in Group 
was more reinforcing than S, (unre 
able) in Group B. These outcomes 


This study tests whether or not a re- 
dundant predictor of shock will acqui e 
Secondary punishing value, and 
whether, when there is more than one 
stimulus predicting the onset of sho 
the more informative stimulus will be 
the more effective secondary punishe 
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METHOD 
Subjects 


The Ss were 40 male hooded rats obtained 
from K-G Farms, Parsippany, N: J., 120-150 
days old at the beginning of deprivation. 


For approximately 1 wk. preceding training ` 


and testing, and then throughout, thé experi- 


ment, Ss were given frée actéss to Purina: 


lab chow pellets for 45 min. daily in their 
individual home cages. This feeding” period 
occurred immediately after each experimental 
session. The Ss were fun daily 223 hr./after 


their last access to food.. There were 8 Ss. 


in each of the five groups." . w- * 


Apparatus AP 

4 4 
The apparatus consisted of two identical 
Skinner boxes, one of which was used for VI 
training and the other for fear conditioning 
and testing. The inside dimensions of the 
boxes were 113 in. long, 93 in. wide, and 
73 in. high. The sides, were aluminum, 
while the door and the back were Plexiglas. 
The floors consisted of 18 aluminum rods, 


3 in. in diameter running,parallel to the sides.” 


Shock was delivered to the-rod floor from a 
constant current source of 550 v. ac. In 
series with S was 2.01 megohms resistance. 
Grid polarity was scrambled four times per 
sec. Duration of shock was 0.5 sec. Meas- 
ured across a standard resistor placed across 
two grids which were always of opposite 
polarity, the current was 0.21 ma. 

The boxes were contained in light-proof, 
sound-attenuating ice chests and were ven- 
tilated by fans which provided 88 db. (SPL) 
masking noise. The food magazine was in 
the center of a side wall, 3 in. above the floor. 
A bar was located 14 in. above the top of the 
food magazine. The bar was 14 in. wide and 
protruded 2 in. from the side wall into the 
chamber. A press of the bar delivered a 45- 
mg. Noyes pellet to the magazine on the 
appropriate schedule. 

Two inches directly above the center of the 
bar was an opening covered with white 
frosted glass, behind which a light was 
located. The light was always off, except 
when used as a stimulus flashing 10 times 
per sec. The other stimulus was a white 
noise which when added to the masking noise 
produced a total of 89 db. (SPL). This 
noise was delivered through a loudspeaker 
1 in. below and 2 in. to the left of the food 


magazine. 
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Fic. 1. Conditioning paradigm of the ex- 
perimental groups (1, 2, 3, 4) and the control 
group (5). (In Groups 1 and 2, Sı is an 
informative and Sa a redundant predictor of 
shock. In Groups 3 and 4, since Event A is 
randomly interspersed with Event B, S; is an 
unreliable and Ss a reliable predictor of 
shock. In Group 5, Events C and D are in- 
terspersed in such a way that S; and Ss are 
never explicitly paired with shock.) 


Procedure 


Magazine and CRF training.—Each S was 
given magazine training and CRF training 
simultaneously, until S had pressed the bar 
80 times for pellets in a 2-hr. session. 

VI training—Following magazine and 
CRF training, Ss were given three daily 40- 
min, sessions of pressing for pellets on a 
VI-1 schedule. The VI tape was generated 
in accordance with the procedure outlined 
by Fleshler and Hoffman (1962). 

Fear conditioning.—On the day following 
the third VI session, each S received one 
fear-conditioning session, For the first 20 
min. of the session, the bar was retracted and 
stimuli and shock were presented in ac- 
cordance with the experimental design in 
Fig. 1. This design parallels the informa- 
tional paradigm of Egger and Miller (1962) ; 
where Egger and Miller used food pellets 
as primary reinforcement in their condition- 
ing sessions, electric shock was used. 

For Ss in Groups 1 and 2, 10 pairings of 
stimuli and shock were presented on the 
average of once every 2 min. The range 
of separation of events was from 56 sec. to 
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TABLE 1 


Loc PERCENT SUPPRESSION: MEAN LOGARITHM OF 100 X THE Ratio OF No. or BAR 
PRESSES FOR THE First 5 MIN. FOLLOWING PUNISHMENT TO THE No. 
OF PRESSES FOR THE 5 MIN. BEFORE PUNISHMENT INITIATION 


MARTIN E. P. SELIGMAN 


Groups 
1 24 3 4 5 
Test Stimulus S Ss Sı ED 
5 xu: informative redundant unreliable reliable & 
informative 
M 0.63 1.30 141 1.51 


Note.—A score of 2.00 indicates that the number of presses in the first 5 min. punished was equal to the number: x 
of presses in the preceding 5 min; a score of 0 indicates complete suppression. 


214 sec. The duration of S: was 2.0 sec. 
One-half second following the onset of Sı, S; 
came on and stayed on for 1.5 sec. One 
second following the onset of Ss, shock was 
delivered for 0.5 sec. The two stimuli and 
the shock terminated together. Thus in 
Groups 1 and 2, although S: was a reliable 
predictor of shock, it was redundant, since it 
was always preceded by Sı. 

For Ss in Groups 3 and 4, 20 events oc- 
curred on the average of once every minute, 
the range of separation being from 4 sec. 
to 188 sec. Ten occurrences of the two 
stimuli paired with shock were randomly 
interspersed with 10 presentations of S; alone, 
"Temporal relations of stimuli and shock were 
as described for Groups 1 and 2. Thus for 
Groups 3 and 4, since S; was an unreliable 
predictor of shock, S; as the only reliable 
predictor, was informative. 

In Group 5, 20 events occurred on the same 
schedule as for Groups 3 and 4. Ten oc- 
currences of the two stimuli together were 
interspersed with 10 occurrences of shock 
in such a way that stimuli and shock were 
never explicitly paired. Duration of stimuli 
and shock and the temporal relation between 
the stimuli were as described for Groups 1 
through 4, 

For each S in every group, one stimulus 
was the flashing light and the other the white 
noise. Half the Ss received light as S; and 
noise as Ss; for the other half this was 
reversed, 

: Following the 20 min. of Pavlovian condi- 
tioning, the bar was replaced and Ss were 
allowed to press on the VI-1 for the remain- 
ing 20 min. 

: Testing.—On the day following fear condi- 
tioning, Ss were placed in the box and al- 
lowed to press on the VI-1 for pellets. After 
15 min. had elapsed following Ss’ first 


presses, stimuli were presented for 1.5 sec, 

contingent upon each bar press. In Group 1, t 
Ss received S; (informative). In Group 2, - 
Ss received Sa (redundant), In Group 3, Ss — 
received S, (unreliable). In Group 4, Ss 
received S; (reliable and informative). In 
Group 5, half the Ss received S; and half the 
Ss received Sa. If S pressed the bar while 
a stimulus was still being presented, it was 
not prolonged. The VI-1 schedule for food 
reinforcement remained in effect throughout 
the session. The secondary punishment con- 
tingency remained in effect for 25 min. 


Response Measures 


The amount of suppression of bar pressing 
produced by a stimulus was used as an index 
of its punishing power. During the 5 min. 
immediately before initiation of punishment 
and during the 5 min. immediately following 
initiation of punishment, minute-by-minute 
Pressing was recorded. At other times press- 
ing was recorded at 5-min. intervals. 


RESULTS 


Data were transformed to log per- 
cent suppression, a transformation. 
which resulted in no evidence of hetero- 
geneity of variance, F,,, (10, 3) 
—203, p.05. The percentage of 
Suppression scores were based on the 
number of bar presses for the first 5 
min. following punishment initiation. 
and the number of bar presses for the 
5 min. immediately preceding punish- 
ment initiation. These results ar 
presented in Table 1. The nontrans- 
formed data as well as all common sta- 
tistics of the data and the data for the 
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entire punishment period yield sub- 
stantially the same results. 

The confirmation of the hypotheses 
depended not upon the significance of 
the main effects of the analysis of vari- 
ance summarized in Table 2, but rather 
upon comparisons made between 
the cells of Table 1. The analysis 
of variance reveals that the main 
effect of groups was significant, F (4, 
30) = 6.90, p < .01. Also there was 
some tendency for Ss to be suppressed 
more by white noise than by flashing 
lights, F (1, 30) = 3.92, p < .10. 

Strong hypothesis—Group 2 vs. 
Group 5: The strong hypothesis pre- 
dicts that the redundant stimulus pre- 
sented to Group 2 should not differ in 
its secondary punishing power from 
the stimuli presented to Group 5, the 
nonassociative control group. Com- 
parisons based on the means in Table 1, 
however, reveal that Group 2 was sig- 
nificantly more suppressed than Group 
5, t (30) = 248, p < .01, one-tailed. 

The four experimental groups were 
suppressed significantly more than the 
nonassociative control group, indicating 
that fear conditioning had occurred in 
the experimental groups ( Mann-Whit- 
ney U test: 2 = 4.65, p < .001. Fur- 
thermore, suppression was reliably 
greater for each of Groups 1 through 4 
than for Group 5. 

Weak hypothesis—The weak hy- 
pothesis predicts that S, (informative) 
presented to Group 1 should be more 
punishing than S, (redundant) pre- 
sented to Group 2. This prediction is 
confirmed, t (30) = 2.55, p < -01, one- 
tailed. The weak hypothesis also pre- 
dicts that S, (informative and reliable) 
presented to Group 4 should be more 
punishing than S, (unreliable) pre- 
sented to Group 3. This prediction is 
not confirmed. The results are in the 
opposite direction, but not significantly 
so, t£ (30) 2149. In addition, the 
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TABLE 2 
ANALYSIS OF VARIANCE 0F RESULTS 
IN TABLE 1 
Source df F 
Experimental Groups (G) 4 6.899** 
Modality (M) 1 3.917 
GXM 4 or 
Error (MS) 30 (.278) 
** 5 <.01. 


weak hypothesis predicts that S, pre- 
sented to Group 1 should be more 
punishing than S, presented to Group 
3. This prediction is confirmed, t (30) 
= 1,83, p <.05, one-tailed. — Finally 
the weak hypothesis predicts that S, 
presented to Group 4 should be more 
punishing than S, presented to Group 
2. The results are in the opposite di- 
rection from the prediction, but not 
significantly so, t (30) <1. 


Discussion 


The strong informational hypothesis, 
which Egger and Miller confirmed for 
a secondary positive reinforcer imposed 
on an appetitive base line, did not hold 
for a secondary punisher imposed on an 
appetitive base line. A redundant pre- 
dictor of shock acquired significant pun- 
ishing power when compared with stim- 
uli not associated explicitly with shock. 
The weak informational hypothesis, how- 
ever, was partially confirmed for a sec- 
ondary punisher, in the sense that a 
redundant predictor of shock was sig- 
nificantly less punishing than was an in- 
formative predictor of shock; further, 
this informative predictor of shock was 
more punishing than an unreliable predic- 
tor of shock. 

Our disconfirmation of the strong in- 
formational hypothesis may indicate that 
associations formed in aversive condi- 
tioning are somewhat more arbitrary or 
diffuse than those formed in appetitive 
conditioning. Hearst (1960, 1962) has 
demonstrated flatter and broader general- 
ization gradients for stimuli associated 
with shock than for stimuli associated 
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with food. Hoffman and  Fleshler 
(1961), Hoffman, Fleshler, and Jensen 
(1963), and Ray and Stein (1959) have 
Obtained broad generalization gradients 
for CER stimuli, particularly in the 
earlier stages of conditioning. Such evi- 
dence, along with clinical observations 
of the persistence of "neurotic" symptoms 
formed as a result of aversive situations, 
suggests that associations formed in aver- 
sive situations may be more controlled by 
sheer contiguity than by informational 
aspects than those formed in appetitive 
situations. 

'If one supposes, as Egger and Miller 
did, that a secondary positive reinforcer 
is an actual drive reducer, and not an 
inducer of drive, and if one also Supposes 
that a secondary punisher is a drive in- 
ducer, then it is possible that the empir- 
ical confirmations by Egger and Miller of 
the strong hypothesis and our discon- 
firmation of it might reflect an asymmetry 
between a stimulus associated with the 
termination of a drive (hunger) and a 
stimulus associated with the onset of a 
drive (fear). From this point of view, 
an interesting analogue of the Egger and 
Miller studies for the aversive case might 
be one which examined the effect of in- 
formation on the reinforcing power of a 
poss associated with the reduction of 
ear, 

It might be argued that the lower 
efficacy of the redundant stimulus relative 
to the informative stimulus was a result 
of generalization decrement in our study 
as well as the Egger and Miller studies, 
Thus during conditioning S, (redundant) 
had always occurred with S, (informa- 
tive), while in testing S, occurred alone. 
This seems unlikely since generalization 
decrement should also have affected S,, 
since in conditioning S, also always oc- 
curred with S,, while in testing S, oc- 
curred alone. y 

There is always the possibility that our 
results were biased by the choice of 
parametric settings used for fear condi- 
tioning. Such a conclusion seems un- 
likely. Our extensive pilot experiments 
indicated that intensity of shock was an 
important parameter of secondary punish. 
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ment. Fear conditioning with shock 

0.9 ma. and 0.3 ma, resulted in virtua 

complete suppression by both redu; 
and nonredundant stimuli. u 
pression was not produced by stimuli 

sociated with a 0.15-ma. shock. Degree 
of suppression in the nonassociative con- 
trol group was not markedly affected by 
intensity of shock. For these reaso 
0.21 ma. approximated the minimum | 
of electric shock at which secondary pi 
ishment will manifest itself in the numb 
of conditioning trials used in this stud; 
Our shock level was less intense than con- 
ventionally used secondary punishment 
levels, but this does not wholly vitiate tlie 
possibility that differences in results be- 
tween the Egger and Miller studies am 
our study might be because electric shock 
—even 0.21 ma.—is perhaps a more “in 
tense" primary reinforcer than foo 
pellets. 
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RECALL AS A FUNCTION OF QUANTITY AND ENCODED 
CLUSTERING OF ITEMS ELICITED UNDER TWO 
METHODS OF PRESENTATION * 


JEAN DREVENSTEDT ? 
Vanderbilt University 


Visual symbols in a 3X 3 diagonal-shaped display were presented 
sequentially to 45 Ss, who were then cued to recall the entire set of 
stimuli or selected parts. Partial reports were varied both in size (3 
and 6 symbols) and degree of organization among the symbols 
(Sampling Methods I, IT, III). Main effects of both report size and 
sampling method were found significant (p < .01). As predicted, 
however, partial recall by temporally and spatially organized item 
clusters yielded higher retention estimates than either total recall or 
partial recalls of a comparable quantity of items, randomly selected 
or merely spatially grouped. Accuracy of recall increased as a 
positive monotonic function of decrease in the number of temporal- 
"spatial clusters elicited. The data appeared to indicate that degree of 
clustering among symbols in a partial report has a more facilitative 
effect upon retrieval than mere size of the report. A second experi- 
ment (N —41), using a simultaneous presentation of items, yielded 
relative retention estimates which were lower than those of Experiment 


I, but generally consistent with the prior findings. 


By eliciting recall of multiple, ran- 
domized samplings of a set of stimuli, 
an estimate may be made of the total 
material available in memory from 
which the samplings were drawn. 
Such partial recalls have yielded short- 
term retention estimates superior to 
those obtained from total recall (An- 
derson, 1960; Klemmer & Loftus, 
1958; Sperling, 1960). 

Little evidence is available, however, 
to assess variables promoting the effi- 
ciency of the partial-retrieval tech- 
nique. Limited data in this respect 


1 Based on a dissertation submitted to the 
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were provided by Sperling (1960), 
who found partial reports by location 
(row) to be effective in facilitating re- 
trieval of visually presented material, 
whereas reports by content category 
(letters or digits) were not. It would 
appear, therefore, that factors besides 
mere size of report are significant con- 
tributors to the efficiency of the par- 
tial-recall procedure. The purpose of 
the present study was to explore fur- 
ther the facilitative effects upon recall 
of both organizational clustering and 
reduction in quantity of items elicited. 
The design of the experiment permitted 
partial reports to be varied in quantity 
retrieved and degree of organization 
among items, thus allowing a test of in- 
teraction as well as main effects of the 
two variables. 

Visual symbols were presented se- 
quentially on a display board arranged 
in 3x 3 diagonal rows. Groups of 
items were retrieved by three methods 
of sampling the display: (a) by elicita- 
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Fic, 1. Diagram of display board. (A= 
tab of display sheet; B=slot for display 
sheet; C = symbols sequentially illuminated 
by stepper-relay system.) 


tion of random individual symbols over 
the entire display ; (b) by spatial order, 
i.e, diagonal rows at right angles to 
the direction of temporal order of 
presentation (Method IT) ; (c) by spa- 
tial order coinciding with temporal 
order (Method IIT), Early studies 
(Gill & Dallenbach, 1926; Oberly, 
1928) have shown that Ss group items 
within the memory span. Moreover, 
in nonsense sequences such as the task 
of this experiment, it was assumed that 
associative strength would be greatest 
between items in temporal units (Mül- 
ler & Pilzecker, 1900; Müller & Schu- 
mann, 1894), Therefore, Method IIT : 
sampling by combined temporal and 
spatial order, should have elicited item 
clusters in which organizational bonds 
were stronger than the other two 
methods of sampling. It was predicted 
that partial recall by Method IIT would 
produce retention Scores superior to 
total recall or to Partial-recall Methods 
I and TI. 


EXPERIMENT I 
Method 


Subjects—The Ss were 45 volunteer un- 
dergraduate Women students from intro- 
ductory Psychology courses at Vanderbilt 
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University. They were alternately assigned 
to Group A or B in the order in which they 
reported for the experiment. i 

Apparatus—The display board consisted 
of a punched-out grille covering nine tubular 
cells, each 12 in. diameter (Fig. 1). Cells 
were aligned $ in. apart along the diagonal 
rows. A No. 47 miniature lamp was cen- 
tered within each cell and wired to a Stepper- 
relay system which allowed the cells to be 
successively illuminated at controlled time 
intervals. By use of a Switching mechanism 
on the relay, either of two orders of se- 
quential direction could be used: along diag- 
onal rows slanting upward, starting at left 
angle of the board (Serial Order No. 1); 
along diagonal rows slanting downward, 
starting at top angle (Serial Order No. 2). 
The directions attempted to control, to some 
extent, for top-bottom, left-right reading 
habits. The display board stood upright on 
a table in a sound-attenuated room. 

A slot between the grille and the cell 
apertures allowed a display sheet to be in- 
serted for each trial. Display sheets were 
8X 81 in. heavy-weight translucent paper, 
with a tab attached for manual insertion into 
the slot. Nine mirror-image, block-lettered 
symbols, 1 in. high, were drawn with black 
ink on the reverse side of each sheet. The 
symbols were positioned so as to center in 
the cell apertures when the sheet was in- 
serted, and thus become visible, as opaque 
figures, when cells were sequentially lighted 
behind them. 

The S's cue board was identical to the dis- 
play board, except for a Plexiglas cover over 
the grille and blank Paper over all nine cells. 
The board was fitted into a polyfoam pad 
taped to the arm of a classroom desk chair. 
Individual-cell and Master switches on the 
control box allowed E to illuminate simul- 
taneously any selected group of cells or all 
nine. The Ss were requested to recall se- 
lected parts of the display when E lighted the 
Corresponding cells on the cue board. A 
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table on S’s right held prenumbered score 
sheets on a spindle, and a tray for completed 
sheets. | Apparatus controls and E were 
screened from S’s view to the right of the 
display board. 

Stimulus material—On every trial, each 
diagonal row in the display, regardless of 
direction, contained one digit and two conso- 
nants. The symbols were randomly selected 
and positioned within the constraints of 
category equivalence. Digits were drawn 
from the Set 2 through 9 (omitting 1); 
consonants, from the Set B through L (omit- 
ting G). An attempt was made to prescreen 
and delete all combinations of symbols ap- 
pearing compellingly familiar. 

Procedure.—Each S was tested under 
seven recall conditions in a Sampling Method 
X Item Quantity X Session X Ss design, 
with total recall of all nine symbols as the 
seventh, standard comparison condition. 
During each of two sessions, 42 trials were 
run—six for each condition, rerandomized 
for presentation order after every fourth S. 
The six conditions of partial recall comprised 
two quantities (three and six symbols) by 
the three sampling methods (I, IT, III). 
For partial recalls, an equal number of trials 
was given for each row or combination of 
rows, across Methods II and III. Recall of 
the various rows, by each method, was also 
counterbalanced over sessions. For recalls 
of random individual symbols (Method I), 
cells were preassigned a number 1 through 9, 
and a sample drawn from the table of random 
numbers. Redrawings were made, as neces- 
sary, to maintain the required ratio of digits 
to consonants and to break up orderly row 
patterns. 

Sessions were spaced approximately 48 hr. 
apart. Each session was divided into three 
blocks of 14 trials, with a 3-min. rest period 
between blocks. (During each rest period, 
as well as before and after each session, 
lamps on the cue board were checked for 
possible burned-out bulbs. ^ At the first 
session, Group A received Serial Order No. 
1, followed by Order No. 2 at the second 
session. For Group B, the serial orders 
were reversed. 

When an S reported for the first session, 
she was seated at the cue board, directly 
facing the display board at approximately 
6-ft. distance. Introductory typewritten in- 
structions were given concerning the display 
and cueing lights. The S was then asked 
to read aloud a sample display as a test for 
visual acuity, and to complete three repre- 


sentative practice trials before experimental 
trials were begun. 

Each of the nine symbols was shown, in 
appropriate serial order, for 1.0 sec., with the 
next symbol appearing in sequence as the 
preceding one terminated. There was a 1- 
sec. temporal pause at the end of each diag- 
onal row, before the initial symbol in the 
next row appeared. A verbal “ready” signal 
was given 2 sec. before the start of a presen- 
tation. One second after terminating the 
final symbol, cells on the cue board were 
lighted for 25 sec., while S recorded, in any 
temporal order she wished, the correct 
symbols in the correct locations on her score 
sheet. To minimize proactive effects (Con- 
rad, 1960), a minimum of 15-20 sec. elapsed 
between a response period and the start of a 
new presentation. Each trial lasted about 
50 sec., and an entire session from 45 to 50 
min. Session II was essentially identical in 
procedure to Session I. Upon completing the 
experiment, each S was requested not to dis- 
cuss the procedures with classmates. 


Results 


For each session, the number of sym- 
bols recalled in correct location was 
averaged over trials by S for each of 
the seven recall conditions. The means 
were then expressed as proportions 
correct of the total number of symbols 
elicited on a trial for that particular 
condition. , 

Partial vs. total recall—The mean 
proportional accuracies of the six par- 
tial-recall conditions ranged from a 
negligiblé 1.2% to a maximum increase 
of 12.9% over total recall (Fig. 2). 
In the analysis of variance of partial- 
recall scores, F (1, 44) = 24.09 indi- 
cated the main effects of item quantity 
to be significant at p < .01, one-tailed. 
As size of report was reduced, accuracy 
of retrieval increased. Likewise, an 
obtained F (2, 88) = 16.00 showed the 
main effects of sampling method to be 
significant at the .01 point. Since a 
significant overall F value had been ob- 
tained in the analysis of variance, indi- 
vidual comparisons between sampling 
methods were made, using the New- 
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D----O SAMPLING METHOD 1 
A——AÀ. SAMPUNG METHOD 1i 


D 
QUANTITY OF SYMBOLS RECALLED 


Fic, 2, Proportion correct recall as a 
function of report size for three partial-recall 
sampling methods, 


man-Keuls procedure for all pairs of 
ordered means 
85), 


) to mean accu- 
racies of both Method I (72.2%) and 


with that of 
total recall, as the Standard condition. 
Both three- and six-symbol recalls by 
Method III (t = 8.06, 3.31, Tespec- 
tively) yielded significantly higher 
Scores than total Tecall (one-tailed P 
Likewise, three-symbo] re- 
s and II were su- 
perior to total recall (t= 3.06 both 
methods, one-tailed $ < 01). 


sampli 
method and report size, with F (2, 88) 


= 3.08, had closely approached signifi- 
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cance at the .05 point. In summary, 
all three-symbol Tecalls, regardless of 
sampling method by which elicited, 
yielded retention estimates significantly 
higher than total recall, One sampling 
method (IIT) yielded superior esti- 
mates, regardless of item quantity re- 
trieved. The main effects of experi- 
mental session were also highly signifi- 
cant, indicating an improvement in ac- 
curacy for Session II. This result 
probably reflected in large part practice 
effects. Since none of the interaction 
effects were significant, however, no 
further analyses of trends in scores 
Over time were made. Tt was con- 
cluded the effects of practice were es- 
sentially comparable for all partial- 
recall conditions, 

Serial-position effects —Correct re- 
Sponses for each serial position were 
summed over trials, methods, and Ss, 


tained for both Serial Order No, 1 and 
No. 2, despite different Spatial loca- 
tions on the display board for corre- 
sponding seria] Positions. There was 
a moderately Consistent decline in ac- 
curacy over the initial five symbols, 
with a pronounced Tecency effect for 
recall of the final three symbols pre- 
sented. 

Individual serial-position curves for 
all seven reca] Conditions were ana- 


III. (Curves were not clearly differ- 


Serial Positions 2. 5, and 8 were em- 
bedded within the diagonal rows in both 
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Orders No. 1 and No. 2, the data sug- 
gested that symbols spatially located at 
the extremes of the rows were recalled 
more readily. —Serialorder effects, 
however, were apparently stronger 
than spatial cues in determining the 
shape of the curves. 


Experiment II 


A. second study modified the experi- 
mental procedure by presenting all 
nine symbols simultaneously. It was 
designed to yield information on the 
effect upon retrieval accuracy of con- 
trolling S’s rate and direction of scan, 
as provided by Exp. I. Since no as- 
sumptions could be made regarding Ss’ 
scanning orders or input sequences, 
only the accuracies of three- and six- 
symbol recalls under Method I (ran- 
dom individual symbols) were com- 
pared with that of total recall. 


Method 


Subjects—A new sample of 41 volunteer 
undergraduate women students, enrolled in 
introductory psychology classes on the Van- 
derbilt campus, was selected, 


100-4 O-==40 SAMPLING METHOD | 
A— A. SAMPLING METHOD ii 

€—9 SAWPUNG METHOD Ii 

G——O TOTAL REAL 


SERIAL POSITION OF ITEM 


Fic. 3, _ Percentage of correct recall by 
Serial position of item for total and three- 


Symbol reports (summed over trials, sessions; 
aud Ss). 
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Apparatus.—The apparatus was the same 
equipment as that used in Exp. I, with a 
push-button control unit added to permit 
simultaneous presentation of the symbols. 
Duration of exposure was regulated by stop- 
watch. 

Stimulus material—The first 36 display 
sheets of Exp. I served as stimuli. 

Procedure.—Each S was tested for all 
three recall conditions in a three-way analy- 
sis of variance design, with sources of varia- 
tion being Report Size (3), Ss (41), and 
Residual (error). All Ss were tested indi- 
vidually during a single session for a total of 
36 trials, rerandomized for presentation order 
after every fourth S. Twelve trials each 
were given for total recall and for three- 
and six-symbol recalls (Method I). The 
session was divided into three blocks of 12 
trials, with a 3-min. rest period between 
blocks. 

Experimental instructions were similar to 
those of Exp. I. All nine symbols were 
illuminated simultaneously for 6.0 sec. (A 
pilot study had indicated this duration to be 
roughly the average time required to read all 
symbols silently at a moderate pace.) The 
cueing and response procedures were identi- 
cal to those of Exp. L For partial-recall 
patterns, the cells were again preassigned 
numbers 1 through 9, and a sample drawn 
from the table of random numbers. One 
restriction only was placed upon the ran- 
domization : numbers were redrawn, as neces- 
sary, to maintain the ratio of two consonants 
to one digit. 


Results 


As in Exp. I, the number of symbols 
recalled in correct location for each 
recall condition was averaged over 
trials by S. The means were then ex- 
pressed as proportions correct of the 
total number of symbols requested on 
a trial for that particular condition. 

In the analysis of variance, an over- 
all F (2, 80) — 8.00 indicated a sig- 
nificant difference among recall condi- 
tions (p < .01, one-tailed). The mean 
percentage of accuracy for three-symbol 
recall was 68.0%, as compared with 
percentages of 62.6% for six-symbol 
and 62.7% for total recall. When 
tested by the Newman-Keuls pro- 
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cedure, three-symbol recall was sig- 
nificantly superior to both six-symbol 
and total recall (p < .01 level). Since 
the data of Exp. I had not indicated 
differential practice effects, no attempt 
was made to analyze trends in accuracy 
over time. 


Discussion 


As predicted, partial recall by sampling 
temporal-spatial groupings yielded higher 
estimates of short-term memory than 
either total recall or partial recalls of 
comparable quantities of items which were 
randomly selected or merely spatially 
grouped. Moreover, accuracy of recall 
increased as a positive monotonic func- 
tion of decrease in the number of tem- 
porally and spatially organized clusters 
elicited. Within the scope of the study, 
the findings seemed to indicate that degree 
of clustering among symbols in a partial 
report has a more facilitative effect upon 
recall than does mere size of report. 
With the exception of recalls by Sampling 
Method III, all six-symbol reports on 
both the sequential and simultaneous tasks 
failed to improve retrieval significantly 
above that of total recall. This finding 
led to the conclusion that reduction in 
quantity of items elicited does not neces- 
sarily increase retrieval accuracy. 

The data indicated that sampling by 
spatial grouping (Method II) was no 
more effective in promoting recall than 
a random selection of individual symbols 
over the entire display (Method I). It 


3 The linearity observed for the Method I 
retention curve possibly reflected an artifact 
of the sampling procedure by which symbols 
were selected. To differentiate Method I 
from II and III, one of the diagonal rows 
was broken by reselection whenever, by 
chance, all symbols fell into two rows on six- 
symbol recalls. Thus, there remained a 
number of trials on which at least three 
symbols fell into a single row coincident with 
input sequence, producing a degree of simi- 
larity to Method IIT. If the random-selec- 
tion procedure had forced a true scatter of 
the six symbols as of the three, Method T and 
II curves might have coincided. Data from 
Exp. II seemed to support this explanation. 


would appear that associative strength 
was no greater between symbols grouped 
into diagonal rows, where the spatial se- 
quence was not congruent with temporal 
sequence, than it was between items scat- 
tered over the set of stimuli. On the 
other hand, the pronounced recency effect 
for the final group of three serial positions 
(the last diagonal row in temporal order) 
suggested that the experimental pro- 
cedure was effective in promoting the 
prescribed temporal-spatial clusterings. 
(Since the manner of presentation in- 
volved a confounding of temporal and 
spatial factors, little information is pro- 
vided for analyzing the actual cues used 
by Ss in forming item clusters.) 

The question arises why six-symbol 
reports by Methods I and II are no bet- 
ter than total recall. If, though, Ss in 
the present study clustered the symbols 
upon input and rehearsed according to 
temporal and spatial sequences, they likely 
retrieved by these sequences when asked 
to recall the entire series. Total recall 
may have been essentially retrieval of 
three temporally and spatially organized 
clusters, Recall of nine symbols ap- 
parently was no more difficult than recall 
of six random or spatially grouped ones. 

Serial-order retention curves (Fig. 3) 
were in general agreement with previous 
findings; (Deese & Kaufman, 1957). 
Since the task was that of recalling non- 
sense sequences, a strong recency effect 
would have been anticipated. Of par- 
ticular interest here is the indication that 
partial recall by single, temporally and 
spatially organized clusters promoted 
more efficient retrieval of all symbols, re- 
gardless of serial position. 

The second experiment seemed to pre- 
sent a more difficult task than Exp. I. 
possibly due to reduced rehearsal time 
(briefer presentation over all) and less 
efficient scanning of the entire display. 
Mean accuracy scores for all three recall 
conditions were lower than the corre- 
sponding scores on the sequential task. 
Nevertheless, the relative findings were 
consistent with those of Exp. I. Average 
improvements in accuracy of three-symbol 
recall (Method I), over total recall, were 
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quite comparable in both experiments— 
4.9% and 5.3%, respectively. The shape 
of the retention curve in Exp. II closely 
corresponded to those of Methods I and 
II in the first experiment, The data ap- 
peared to give a limited degree of gen- 
eralization to the facilitative effects upon 
recall of report reduction, where individ- 
ual items are randomly elicited. 

The presentation method used in Exp. 
I had the advantage of controlling rate 
and order of input of items, while allow- 
ing for the imposition of clustering factors 
based on space (i.e., row) and time vari- 
ables. Judging from the data of Exp. II, 
this method of presentation appears to 
promote more efficient estimates of reten- 
tion than a simultaneous display of all 
items, 
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Ss saw a series of 6 consecutively presented vertical lines differing in 
length in increasing, decreasing, or random order and then were asked 
to recognize a 7thiline as identical with a series line. In some condi- 
tions, to test for a frame-of-reference effect, the 6th line was unusually 
long or short. 4 increasing and 4 decreasing length conditions 
produced significantly less error than 3 random conditions. Frame-of- 
reference and discrimination-interference decrements were found for in- 
creasing length conditions, but only discrimination-interference decre- 
ments were found for decreasing length conditions. A related finding 
was that when the longest line was Ist in a series it had a retention 
advantage. A consistent memory bias suggests a reason for pre- 
dominance of aftergradients in so-called “spread-of-effect” experiments. 


A problem in processing information 
is to pick out which of several past 
stimuli is the same as a now perceived 
stimulus. The investigation of such 
situations can require the identification 
of immediately prior stimuli by order 
of presentation and thus give data 
concerning the short-term retention of 
serially occurring stimuli. In the pres- 
ent experiment, in which S viewed a 
set of six lines varying in length and 
then on seeing a seventh line said which 
of the previous six was identical with 
the seventh, two hypotheses were 
investigated. 

First, it was hypothesized that pre- 
senting the six lines in a consecutive 
increasing or decreasing order in length 
would facilitate subsequent recognition 
as compared to presenting the six lines 
in a random order. This hypothesis 
entails an assumption that stimulus or- 
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and Hans Furth for critical comments. 


ganization and not just the separate 
traces of individual stimuli are retained. 
If separate traces are retained by S, 
lines presented in increasing or de- 
creasing order should have maximum 
lack of discriminability from their 
predecessor and successor lines at the 
time of presentation. Thus, in ac- 
counting for a facilitation effect due to 
order, neither discriminatory processes 
nor memory traces for individual lines 
can be of determining importance. 
The prediction of better retention per- 
formance with organized stimuli ac- 
cords with Gestalt principles. “Mem- 
ory processes and ‘experience’ do not 
consist in a bare sequence of events 
each essentially alien to all the rest. 
Contextual indifference in association 
or habit (mechanical memory in gen- 
eral) is simply a limiting case 
[Wertheimer, 1938, p. 16.]” Second, 
it was hypothesized that if correct reg- 
istration is a function of the length 
range of the complete series ("frame- 
of-reference"), the insertion of either a 
very short or very long line (relative 
to the preceding series) just before the 
line to be recognized should increase 
recognition errors. 

An attempt was made to infer the 
Characteristic strategy S used in re- 
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taining the line series. The point of 
departure for this description was the 
unanticipated result that a frame-of- 
reference decrement was produced by 
a consecutively increasing but not by 
a consecutively decreasing serial order. 
Finally, a consistent memory bias is 
interpreted as having implications for 
the error after gradients found in the 
spread-of-effect phenomenon. 


METHOD 


Subjects —Two hundred and twenty male 
students from introductory psychology classes 
at Rutgers University were assigned to 11 
experimental conditions of 20 Ss each. No 
Ss were discarded. 

Procedure—Each S was run individually. 
The S faced a projection screen 4 ft. away 
and was read the instructions: 


You will view a series of lines that vary 
in length. Study each line as carefully as 
possible. (The S then saw six lines ex- 
posed in a vertical position for 4 sec, each 
with a constant approximate 1-sec. interval 
between line exposures.) You will now 
view another line which is identical in 
length to one of the six lines in the series 
you just viewed. After viewing this line 
you are to tell me which line back in the 
series it is a match for. That is, if it is 
identical to the first line you viewed, you 
would say “6-back”; if it matches the 
second line you viewed you would say “5- 
back”; if it matches the third line you 
would say “4-back”; if it matches the last 
(sixth) line you viewed you would say 
"]-back" and so on. 


Time interval between the 1-back line and 
the recognition test line was 30 sec. The S. 
was allowed as long a time as he required to 
^ make his response. In no case did the 
decision time exceed 1 min. Time between 
trials was 1 min. Each S received 12 trials. 
Experimental conditions—For all 11 con- 
ditions the correct match-backs were four 
2-back, four 4-back, and four 6-back matches 
ordered as to correctness. One-back, 3-back, 
and 5-back matches were never correct. 
However, S was told that he was free to 
use any of the six lines as matches. The 
experimental conditions can be divided into 
increasing-order, 4 decreasing-order, and 3 
random-order conditions. 
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In an increasing-order condition the first 
five lines monotonically increased in length. 
Length of the line presented sixth (called the 
"|-back" line in the Results, since it was 
l-back from the recognition line) determined 
which of the four increasing-order conditions 
was being administered. An increasing order 
is designated by the abbreviation 3-7, which 
means that Lines 3, 4, 5, 6, 7 of consecutively 
increasing length were presented to S. For 
each condition, in order that S could not be 
aided by memorizing previously presented 
series, four sets of slightly different line 
lengths were presented three times apiece in 
a random order of 12 trials. For Set 1 the 
projected line lengths were 14, 18, 2:5, 3x, 
and 48 in. For Set 2, the 14-in. line was 
omitted and a 53-in. line added. Set 3 was 
the same as Set 2 except that the 18-in. line 
was omitted and a 7-in, line added. Set 4 
was the same as Set 3 except that the 2;5-in. 
line was omitted and an 855-in. line added. 

The four increasing-order conditions are 
designated 3-7,8; 3-7,2; 3-7,1; and 3-7,9. 
In Cond. 3-7,8 the 1-back line was an in- 
crease over the previous line by a factor that 
accorded with the increases shown by the 
previous lines. Thus, Line "8" represented 
a 5ł-in. line for Set 1, a 7-in. line for 
Set 2, an 85-in. line for Set 3, and a 101-in. 
line for Set 4. Condition 3-7,2 represents a 
condition where the 1-back line was shorter 
than all previously shown lines in the same 
set, yet still in the same approximate range. 
Thus Line “2” was 3 in. for Set 1, 13 in. for 
Set 2, 18 in. for Set 3, and 2/4 in. for 
Set 4. Condition 3-7, 9 was a condition with 
an out-of-range long line in the 1-back 
position and Cond. 3-7,1 was a condition 
with an out-of-range short line in the 1-back 
position. These “aberrant” lines in the 1- 
back position did not vary in length. Thus 
dt was always 172 in, and “1” was always 

in. 

The four decreasing-order conditions were 
the exact reverse of the increasing-order 
conditions with 7-3 standing for mono- 
tonically decreasing Lines 7, 6, 5, 4, 3. Four 
sets of lines with the same length as the 
lines in the increasing series were adminis- 
tered in the same random orders, again with 
aberrant lines of 3 in. and 172 in. as Lines 
“1” and "9." The decreasing-order condi- 
tions were thus abbreviated as 7-3,2; 7-3,8; 
7-3,1; and 7-3,9. 

Three random conditions were adminis- 
tered and designated as R; R,1; and R,9. 
The same four sets of lines were adminis- 
tered each S as were used for the ordered 
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TABLE 1 


MEAN RESPONSE PERCENTAGES FOR THE Four INCREASING LENGTH AND Four 
DECREASING LENGTH CONDITIONS 


Steps back from Recognition Line Steps back from Recognition Line 
Increasing Decreasing 
Cond, Cond, 
6 5 4 3 2 1 6 5 4 3 2 1 
3-7, 8 0 0 6 | 26 | 52 | 15 1-3,2 0 1 3 | 38 | 51 6 
0 | 15 | 54 | 31 0 0 2 | 14 | 65 | 19 0 0 
64 | 29 8 0 0 0 80 | 15 1 4 0 0 
3-7, 9 6 2 9 | 55 | 30 4 1-3,1 0 1 6 | 31 | 61 0 
0 | 13 | 60 | 24 2 0 0 | 12 | 61 | 26 0 0 
51 | 39 6 2 1 0 81 16 2 0 0 0 
3-7, 2 0 1 1 | 15 | 80 2 7-3, 8 1 4 5 | 26 | 60 4 
0 9 | 49 | 42 0 0 1 11 | 46 | 40 1 0 
44 | 29 5 1 0|21 42 | 28 8 2 2 | 18 
3-7, 1 0 0 2 | 38 | 51 9 7-3,9 1 0 2 | 18 | 79 0 
0 | 16 | 40 | 36 8 0 0 | 14 | 52 | 31 2 0 
38 | 48 | 12 2 0 0 59 | 30 | 10 1 0 0 


Note.— Correct responses are in bold face. N = 20 Ss per condition. Each S made 12 observations in each 
condition with 4 correct 2-back, 4-back, and 6-back observations. 


conditions. For each random condition, the 
same five random orders were administered 
for the first five lines shown S. Therefore, 
exactly the same random order was adminis- 
tered to only four Ss within each random 
condition. In Cond. R the l-back line was 
always the largest line (Line *8") of the 
particular set being used. The dimensions 
of the lines in these sets were given above 


TABLE 2 


MEAN RESPONSE PERCENTAGES FOR THE 
THREE RANDOM LENGTH CONDITIONS 


Steps back from Recognition Line 
Random 
Cond. 


6 5 4 3 2 


- 


R,1 5 9 | 19 | 24 | 40 
14 | 14 | 40 | 28 5 
29 | 39 | 16 8 9 


R2 
N 
S 
» 
a 
R 
N 
S 
Con] COS] ANA 


Note.—Correct responses are in boldface. N=20Ss 
per condition. Each S made 12 observations in each 
condition with 4 correct 2-back, 4-back, and 6-back 
observations. 


in describing Cond. 3-7,8. For Cond. R,1 
and R,9 Line “8” of each set was omitted 
and replaced by the aberrant lines. In Cond. 
R, 1 the aberrant 3-in. line was in the 1-back 
position. In Cond. R,9 the aberrant 173-in. 
line was in the 1-back position. 


RESULTS AND Discussion 


Ordered vs. random conditions.— 
Table 1 presents mean response per- 
centages for the four increasing and 
four decreasing conditions. Individual 
rows show the mean correct percent- 
age in bold face with percentage of 
errors produced by each of the other 
series lines shown in the other five row 
entries. The table reads in sequential 
order of presentation from left to right 
so that the 6-back entry is in the left- 
hand column and the 1-back entry is in 
the right-hand column. In similar 
fashion Table 2 presents entries for 
the three-random conditions. 

Statistical analyses were carried out 
in order to test the hypothesis that 
performance for the ordered conditions 
would be significantly better than that 
for the random conditions. Mean total 
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correct responses are lower for each of 
the three random conditions than for 
any of the eight ordered conditions, and 
analysis of variance showed that cor- 
rect responses for the three random 
conditions were not significantly differ- 
ent from each other. Overall tests of 
significance were performed by testing 
each of the random conditions sep- 
arately against the eight pooled ordered 
conditions. A significant t was ob- 
tained in all three tests (p < .005, df 
— 178). Since special effects are pro- 
duced by the differing 1-back lines, as 
shown by a separate analysis of vari- 
ance, more specific tests were also per- 
formed. Each random condition was 
tested against the regular increasing 
and regular decreasing conditions 
(Cond. 3-7, 8 and 7-3,2). All six t 
tests attained significance (p < .01, 
dj — 38). 

Random and ordered conditions dif- 
fer in another way besides the differ- 
ence in mean correct responses. Ex- 
cluding 1-back entries, 22 of the 24 
rows in Table 1 have two or more 
entries of 296 or less; the other two 
rows in Table 1 have one such entry. 
There are no error entries under 596 
for the comparable entries in the nine 
rows of Table 2. This comparison 
indicates that random conditions for all 
three correct match-back lengths con- 
sistently show greater dispersion of 
errors. The hypothesis that consecu- 
tive increasing or decreasing order 
facilitates subsequent correct recogni- 
tion was fully verified. © 

Frame-of-reference and discrimina- 
tion-interference effects.—The effect of 
inserting differing lines in the 1-back 
position was examined for the ordered 
series in Table 1 to determine discrimi- 
nation-interference and frame-of-refer- 
ence decrements. Frame-of-reference 
decrements are those produced by the 
aberrant l-back lines. For all com- 
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parisons it would be expected that the 
effect of altering 1-back line length 
would be strongest for 2-back recogni- 
tion (recency effect). This proved to 
be the case, with some effect usually 
shown also for 6-back recognition 
(primacy effect). The embedded 4- 
back recognitions were generally insen- 
sitive to 1-back line length differences. 
In the increasing-order conditions, the 
best comparison for discrimination-in- 
terference effects (considered as dis- 
tinct from frame-of-reference effects) 
is between Cond. 3-7,8 and Cond. 
3-7,2. If there is discrimination diffi- 
culty, the 2-back match of 3-7, 8 should 
have a lower correct response percent- 
age (<) than that for 3-7,2 and the 
6-back match of 3-7,2 should be < 
that for 3-7,8. Both comparisons in 
Table 1 are in the appropriate direction 
to indicate discrimination-interference 
decrement and both are significant (p 
< 05; p < .01, respectively, df = 38). 

To detect a frame-of-reference decre- 
ment the 2-back match of 3-7, 9 should 
be < than that for 3-7,8, and the 2- 
back match of 3-7, 1 should be < than 
that for 3-7, 2. The same relationships 
should hold for 6-back matches. Notice 
that the aberrant lines are always the 
most discrepant in length from the 
lines for which they are predicted to 
produce an error increase. It is, there- 
fore, reasonable to attribute any decre- 
ment produced to a disturbance-in S’s 
frame-of-reference. Both 2-back com- 
parisons in Table 1 are in the predicted 
direction and both attained significance 
(p < .01, df = 38). Although the two 
6-back comparisons are both in the 
predicted direction, they did not attain 
significance. Nevertheless, considering 
that the criterion for a frame-of-refer- 
ence effect required decrement in ex- 
cess of that produced by discrimination 
difficulty, it appears that there is clear 
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evidence for a  frame-of-reference 
decrement for increasing conditions. 
The counterpart of the comparisons 
made for the increasing conditions can 
be performed with the decreasing con- 
ditions in Table 1. For discrimination- 
interference decrement, Cond. 7-3,2 
is compared with Cond. 7-3,8. Al- 
though both differences are in the pre- 
dicted direction (7-3, 2 < 7-3, 8 for 2- 
back matches and 7-3, 8 < 7-3, 2 for 
6-back matches) only the 6-back com- 
parison is significant (p< .01, df 
= 38). When predicted frame-of- 
reference decrements are looked at for 
2-back and 6-back matches (7-3, 1 vs. 
7-3, 2 and 7-3,9 vs. 7-3,8) no effect 
is found, All comparisons are in the 
reverse direction, indicating that only 
discrimination difficulty is an error de- 
terminant for the decreasing conditions. 
Why should both increasing and de- 
creasing conditions show discrimina- 
tion interference decrement but frame- 
of-reference decrement be shown only 
by increasing conditions? The answer 
remains speculative as it must be 
sought for in the strategy S uses to 
try and retain the line sets. A partial 
answer lies in the fact that S attempts 
to remember length; S retains the 
longest line he has seen and relates sub- 
sequent lines to this maximum length. 
A consequence of this is that the con- 
cepts of “longest” and “shortest” are 
not equal and opposite in efficiency of 
retention. It follows that, if the long- 
est line is the first in a set, it is in an 
especially advantageous position for 
being retained. Support for this spec- 
ulation is found in the result that peak 
correct percentages in Table 1 were 
obtained for the longest line at the 6- 
back position, but only when no longer 
lines were interposed in the 1-back 
position (Cond. 7-3,2 and 7-3,1). 
When no fixed longest line is immedi- 
ately settled upon from the set of 


presented lines (as in the increasing- 
order conditions), an out-of-range line 
in the 1-back position upsets the re- 
tained length relationships. This is 
because the lines are not retained in 
relation to an obvious modulus. At 
best, though, this explanation remains 
somewhat incomplete because of the 
insensitiveness of the 4-back recogni- 
tions to condition differences. 

Relation to “spread of effect” —Over 
and above keeping Ss honest, 4-back 
matches were employed in order that 
a closer look at the distribution of 
errors could be made. An interesting 
and unexpected result was found, In- 
stead of 4-back errors being approxi- 
mately equally distributed between 3- 
back and 5-back responses, there were 
more 3-back than 5-back errors in all 
eight ordered conditions and two of 
the three random conditions. The only 
slight exception occurred in Cond. R, 9. 
Despite this biased error distribution, 
mean 4-back errors produced most 
error for only 2 of the 11 conditions in 
Tables 1 and 2. Thus there does not 
appear to be some special difficulty in 
making 4-back recognitions that would 
produce increased “3-back” responding. 
Nor can it be argued that Ss simply 
tend to call any line in the middle of 
a series “3-back,” since for 5 conditions 
Ss made more 3-back errors when 2- 
back matches were correct than when 
4-back matches were correct. It must 
be conceded that the present experi- 
mental results were obtained for a quite 
specific type of recognition task. But, 
if a general conclusion were reached 
that in lists of items only a few of 
which were correct—and these em- 
bedded in the list—there was a 
systematic recognition displacement 
towards the recency side of the correct 
item, this result would be most import- 
ant. It would help account for the 
predominance of aftergradients found 
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in so-called "spread-of-effect" experi- 
ments. 

Postman (1962) has found after- 
gradient occurrence one of the few 
results general to spread-of-effect ex- 
periments in the numerous studies of 
this phenomenon performed under a 
wide range of experimental conditions. 
Previous attempts at explanations of 
"spread-of-effect" in terms of a memory 
bias have been poorly supported em- 
pirically, but designs of the older 
experiments used in support of this 
position were faulty (Marx, 1956). 
Methodologically, it can be noted that 
if memory bias occurs, it cannot readily 
be eliminated by instructions. For 
example, telling Ss what positions will 
be correct on the second time through 
a list would not eliminate S’s already 
formed position retentions, even if 
these retentions were systematically in 
error. Nor do such instructions fur- 
nish a motive for Ss to alter the repeti- 
tion tendencies centered on wrongly 
remembered “correct” responses. In 
a spread-of-effect experiment using a 
perceptual-motor task, Marx (1957) 
gave a control group such correctness 
information and obtained a significant 
aftergradient. Results showing the 
pervasiveness of aftergradients inde- 
pendently of reinforcement permuta- 
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tions were found in a recent experiment 
by Alfert (1963). Not only did she 
obtain aftergradients without repetition 
of the rewarded stimulus, rewarded 
response, or any reward at all, but she 
found that the frequency of repetitions 
in the position following reward can be 
significant even when the position of 
the previously rewarded response is 
not A memory-bias explanation 
would account for the aftergradient 
findings obtained with such varied con- 
ditions more simply than any other 
single explanation. 
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For 216 days, 4 groups of albino rats ran 1 trial a day in a straight 
alley. On the first 96 days, 2 groups were exposed to 100% (CRF) 
and 2 groups to 50% reinforcement (PRF). On the next 80 days, 
1 CRF group and 1 PRF group were extinguished, while the other 2 
groups remained on their original schedules. In a subsequent re- 
acquisition phase, all groups were run 40 days to 100% reward. In 
acquisition, the PRF acquisition effect was observed, i.e, PRF speeds 
relative to CRF were lower in early acquisition and higher in late 
acquisition in the initial response measures, and lower in the terminal 
response measure throughout. In extinction, the usual PRE was 
observed: PRF Ss were more resistant to extinction than CRF Ss. 
In reacquisition under CRF, all four groups reached the same terminal 
level of responding. The results were compared with recent findings 
which indicated an increase in terminal response level under CRF after 
a period of PRF or extinction, and were discussed in relation to 


current frustration theory. 


The effect of nonreward, either in 
extinction or in partial reinforcement 
(PRF), has typically been to produce 
higher asymptotic performance levels 
when continuous reinforcement (CRF) 
is later introduced, Evidence for this 
statement has come mainly from studies 
employing repeated acquisition-extinc- 
tion sessions (e.g., Jensen & Cotton, 
1960; Lauer & Carterette, 1957 ; Lauer 
& Estes, 1955), although the result has 
also been obtained in another kind of 
experiment (Theios, 1962). Usually, 
relatively short intertrial intervals 
(ITIs) have been employed. 

An experiment by Harris, Smith, 
and Weinstock (1962) has extended 
the evidence for this phenomenon. 
Employing two trials per day with a 
45-min. ITI, they indicate that in 
a continuously-rewarded reacquisition 


1 Supported by a grant (GB 143) from 
the National Science Foundation. The 
authors are indebted to John C. Ogilvie for 
his advice and assistance in the analysis of 
the data. 


phase “Ss who had had 50% rein- 
forcement were running more rapidly 
than Ss who had had 100% reinforce- 
ment, and Ss who had had extinction 
were running more rapidly than those 
who had not. The 50% reinforcement 
Ss ran more rapidly after being shifted 
to 100% [p. 392].” The results are 
interpreted in terms of a modification 
of the theory of Bush and Mosteller 
(1955) for response acquisition under 
PRF. Harris et al. (1962) assume 
that the limit point of the reinforced 
trial operator is an increasing function 
of the number of nonreinforced trials ; 
and this leads them to predict (a) that 
PRF should lead to higher asymptotic 
performance than CRF, and (b) -that 
the greater the number of prior nonre- 
inforced trials, the greater the running 
speed in later reacquisition under CRF. 
"These predictions are apparently con- 
firmed by their results ; and, since there 
was no sign of a decrease in the termi- 
nal level of responding induced by non- 
reinforcement during the 30 trials of 
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CRF reacquisition, they are led to sug- 
gest that the higher ‘performance in 
reacquisition represents a nonreversible 
change in the reinforced trial operator. 

Prediction a is not entirely consistent 
with the PRF acquisition effect shown 
in several studies (eg, Goodrich, 
1959; Haggard, 1959; Wagner, 1961) 
in which the total running response is 
fractionated into starting, running, and 
goal segments. In such investigations 
the superiority of PRF relative to CRF 
performance is evident only in the first 
two segments of the response chain; 
the goal measure usually shows the re- 
verse effect. The fact that Harris et al. 
report only a single running measure 
taken over almost the entire length of 
the runway, and report no goal meas- 
ure, means that a part of the Goodrich- 
type phenomenon is not represented 
in their data—that part which would 
not support their version of the rein- 
forcement operator model. Frustra- 
tion theory (Amsel, 1958, 1962; 
Spence, 1960) attempts to account for 
the complex PRF acquisition effect by 
assuming that weak rp in the early 
segments of the response chain is facili- 
tative, producing the greater asymp- 
totic speeds of the PRF relative to the 
CRF groups; the presence of strong 
learned or unlearned avoidance re- 
sponses to sp for the PRF group close 
to the goal presumably leads to the re- 
versed effect. Frustration theory fur- 
ther predicts that the greater speeds of 
the PRF groups in the initial alley seg- 
ments develop only after a number of 
trials, and that PRF performance will 
be poorer than CRF performance dur- 
ing the early trials. For these reasons 
it seemed important to rerun the ex- 
periment taking several measures in the 
runway rather than just one. 

It seems apparent from the curves 
presented by Harris et al. (Fig. 1, 
p. 390) that extinction, at least for 
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PRF Ss, is not complete. This makes 
it difficult to interpret the differences 
in their CRF reacquisition phase. Be- 
cause extinction of the partial and con- 
tinuous groups was not carried to ap- 
proximately the same level, the results 
in their reacquisition phase are not as 
persuasive as they might be. Apart 
from the inequality of the extinguished 
groups at the start of reacquisition, it 
might be argued that a relatively short 
period of continuous nonreward (ex- 
tinction) would serve to add condi- 
tioned excitement (rp) to the existing 
drive level of PRF and CRF Ss, and 
that the resulting increases in per- 
formance level in reacquisition in these 
groups might be mainly motivational. 
With extinction complete, any such 
motivational effects might be expected 
to disappear, permitting a better test 
of the (presumably associative) rein- 
forced trial operator model. This 
reasoning depends on the assumption 
that rg and rp are conditioned re- 
sponses and that, since primary frus- 
tration (Ry), the UCR for the condi- 
tioning of ry, depends on rg (Amsel, 
1958), nonreinforcement will lead first 
to the extinction of rg and consequently 
to the extinction of rr. 

The study reported here is in essence 
a repetition of the Harris et al. experi- 
ment with the following modifications : 
one trial a day was run instead of two; 
four measures of runway performance 
were taken instead of one; and the 
extinction phase of the experiment was 
extended very greatly The only 
other major difference was that a large 
food reward was used instead of their 
large water reward. 


METHOD 
Subjects 


The Ss were 40 male Wistar albino rats, 
approximately 60-70 days old, supplied by 
Woodlyn Farms, Guelph, Ontario. They 
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were housed in individual cages and main- 
tained on ad libitum water. The daily ration 
of food was 10 gm. of Purina lab chow 
checkers, which included the total weight 
of food pellets given as reward in the experi- 
ment. One S died during the course of 
the experiment, leaving 39 Ss from which 
data were collected. 


Apparatus 


The apparatus was a straight enclosed 
plywood runway consisting of a 15-in. entry 
box, an 11-in. start box, a 50-in. alley, and a 
15-in. goal box. The width was 23 in. 
throughout. All sections had hinged, clear 
Plexiglas lids. Plexiglas inner walls made 
it possible to line the runway with black 
inserts from start box to goal box, while 
flooring in these sections was stiffened black 
rubber matting. The entry box was natural 
plywood with a wire-mesh floor. Guillotine 
doors separated the adjacent sections. The 
entry-box-to-start-box door was of unpainted 
Masonite ; the start-box-to-alley door of clear 
Plexiglas; and the alley-to-goal-box door of 
Masonite; the start-box-to-alley door of clear 
in. food cup was mounted approximately 2 in. 
from the floor at the end of the goal box. 

Four time measures of locomotor perform- 
ance were provided. Starting time was 
measured from the opening of the start-box 
door to the interruption of a photobeam lo- 
cated 6 in. into the alley. The first running- 
time measure was taken from the interrup- 
tion of the first beam to the interruption of 
a second beam located 24 in. beyond the 
first, A second running-time measure was 
taken from the interruption of the second 
beam to the interruption of a third beam 
located 24 in. beyond the second, in the goal 
box. Goal time was measured from the 
interruption of a fourth beam, located 15 in. 
from the end of the goal box, to the interrup- 
tion of a fifth beam located 3 in. from the 
end of the goal box. A microswitch on the 
start door and the five photobeams controlled 
four Standard electric timers which recorded 
to .01 sec. 


Procedure 


Experimental design.—The Ss were ran- 
domly assigned to one of four groups, a 
CRF-CRF group (CC), a CRF-Extinction 
group (CE), a PRF-PRF group (PP), and 
a PRF-Extinction group (PE). (A 50% 
schedule of PRF was employed.) These 
group designations cover the first two phases 


C. T. SURRIDGE, J. BOEHNERT, AND A. AMSEL 


of the experiment. The third phase was 
CRF for all groups. 

Habituation—The Ss were placed: on a 
10-gm/day deprivation schedule 26 days prior. 
to the start of the experiment, during which 
time they were handled and tamed. There 
was no pretraining or preliminary explora- 
tion of the alley. 

Experimental training—A training trial 
was initiated with the introduction of S into 
the entry box. The entry-box door was 
then raised, and lowered as soon as S en- 
tered the start box. The start-box door was 
raised approximately 2 sec. later. When S 
had entered the goal box, the retrace door 
was lowered. All Ss were confined in the 
goal box for 90 sec. on every trial. The Ss 
were tested in four squads of 10 and the four 
experimental groups were represented in 
each squad as equally as possible. The 
position of each S within a squad (i.e., in the 
carrying cage) was randomly assigned at 
the beginning of the experiment and re- 
mained the same throughout, but the order 
of running Ss from day to day was 
randomized. 

Following the testing of all 10 Ss in a 
squad, they were returned to their home 
cages and were given the remainder of 
their daily food ration 30-40 min. later. 

Acquisition—The Ss received a total of 96 
trials, 1 trial per day. Reward was two 
500-mg. Noyes pellets. The sequence of 
reward (R) and nonreward (NR) for 
Groups PP and PE was as follows: R NR R 
NRNRRNRRNRRRNRR NR NR 
R. The sequence was repeated every 16 
days, there being six such sequences. Groups 
CC and CE were continuous rewarded 
throughout. 

Extinction—Groups CE and PE were run 
80 extinction trials, 1 trial per day. The 
schedule of reinforcement for Groups CC 
and PP remained unchanged. 

Reacquisition—All Ss were run 40 addi- 
tional trials to CRF, 1 trial a day. For Ss: 
in Groups CE, PP, and PE, but not for 
Group CC, this represented a change in the 
reward schedule, 


RESULTS 


Figure 1 presents the data of the 
entire experiment plotted as mean 
reciprocals of median times adjusted to 
ft/sec for blocks of four trials. In- 
cluded are data for four measures: 
Start, Run 1, Run 2, and Goal. All 
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Fic. 1. Mean reciprocals of median running times adjusted to ft/sec in blocks of 
four trials over the three phases of the experiment. 


of the statistical analyses were carried 
out on speed scores (reciprocals of raw 
scores adjusted to ft/sec). 
Acquisition 

There appeared to be differences in 
performance between Groups CC and 
CE and Groups PP and PE, groups 
run under identical conditions in the 
acquisition phase. However, analyses 
of variance showed that in no measure 
was either of these group differences 
significant, nor were any of the Group 
x Day interactions (F < 1.00 in every 
case). The Day effect was, as ex- 
pected, significant in both analyses and 
for all measures (p < .01). 

The acquisition data were, therefore, 
combined for the two CRF groups and 
the two PRF groups, and are presented 
in Fig. 2, which plots mean speeds 
(reciprocals of median times adjusted 


to ft/sec) in blocks of four trials for 
all response measures. Initial acquisi- 
tion performance is slower for PRF 
than for CRF in all measures. How- 
ever, the Goodrich-type crossover oc- 
curs in the Start, Run 1, and Run 2 
measures following Blocks 9, 10, and 
12, respectively. In the Goal measure, 
there is some suggestion that the curves 
come together at Blocks 14, 15, and 16, 
but PRF performance remains gen- 
erally slower than CRF. 

Analysis of variance over the 96 
acquisition days revealed no significant 
group differences in any measure, F 
(1, 36) < 1.00 for the first three meas- 
ures and F (1, 36) = 321, p> .05 
for the Goal measure. However, the 
Group X Day interactions were signifi- 
cant, F (95, 3420) = 1.82, 3.10, 3.62, 
and 1.87, p < .01, for the Start, Run 1, 
Run 2, and Goal measures, respectively. 
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Fie. 2. Mean reciprocals of median running times adjusted to ft/sec for. Groups GG and 
CE combined and Groups PP and PE combined, over the 96-trial acquisition phase, in 


blocks of four trials. 


These results reflect the PRF acquisi- 
tion effects evident in Fig. 2. 

As a further test for the presence of 
the crossover effect, we also computed 
a single degree of freedom contrast for 
the Run 2 measure which happened to 
show the change in relative positions of 
PRF and CRF curves from the first 
48 days to the last 48 days of acquisi- 
tion. "This contrast revealed a signifi- 
cant interaction, F (1, 3420) — 7.37, 
p<.01, confirming that PRF ran 
slower than CRF in the first half of 
acquisition and faster in the second half 
of acquisition. 


Extinction 


Returning to Fig. 1, Groups CC and 
PP remained, as expected, at approxi- 
mately the same level as in the acquisi- 
tion phase. Large differences were 
apparent in the initial rates of extinc- 
tion of Groups CE and PE, confirming 
the usual PRE finding and the Harris 
et al. (1962) data. It is interesting to 
note that extinction for both- groups 


starts earliest in the goal measure and 
last in the start measure. This is par- 
ticularly true of Group PE, which 
shows almost no extinction in the start 
measure until about Block 36. Since 
rp and rg are presumably strongest in 
the goal area, the suggestion that ex- 
tinction of rg and rp generalizes from 
the goal to the earlier segments of the 
runway is not unexpected. Over the 
last 40 days of extinction, however, 
the extinction curves for these groups 
appear to converge, and with the possi- 
ble exception of the Run 2 measure, 
there is little difference in their per- 
formance, Therefore, our purpose in 
extending the extinction phase seems to 
have been met. 

Analysis of variance of the first 40 
days of extinction for Groups CE and 
PE revealed that the Group differences 
were reliable in all measures, F (1, 18) 
= 1222 (p< 01), 474 (p «.05), 
9.20 (p< .01), and 13.81 (p « .01). 
for the four successive measures. 
Moreover, the Group X Day interac- 
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tions attained significance in all meas- 
ures, F (39, 702) = 3.00, 2.22, 2.52, 
and 2.37, respectively, p <.01 for all 
measures. 

Analysis of variance of the last 40 
days of extinction indicated no sig- 
nificant group differences, F (1, 18) 
< 1.00 for the first three measures, and 
F (1, 18) = 2.26, p > .05, for the goal 
measure. In no case, either, was the 
Group X Day interaction significant. 

The findings of these two analyses 
permit the conclusions (a) that Groups 
CE and PE extinguished at different 
rates, but (b) that at the end of the 
extinction phase they were not per- 
forming differentially. 


Reacquisition " 

Examination of the right-hand panels 
of Fig. 1 will reveal no tendency in the 
reacquisition phase for the extinguished 
groups to show higher asymptotes than 
the nonextinguished groups or than 
they, themselves, exhibited in the ac- 
quisition phase. Nor did PRF in 
the first two phases followed by CRF 
lead to a higher level of performance. 
The reacquisition curves of Groups 
CE and PE appear to be rather typical 
negatively ^ accelerated acquisition 
curves which tend to converge upon 
the curves of the nonextinguished 
groups. In no case did reacquisition 
yield the sharp rise in performance 
demonstrated by Harris et al. even 
though our reacquisition phase was 
carried to 40 trials compared to their 
30. An analysis of variance performed 
on the last 20 reacquisition days con- 
firmed that the four groups were not 
performing differently at asymptote. 
Neither the Group effects (F « 1.00 in 
every measure), nor the Group X Day 
interactions, F (57, 608) — 1.09, .58, 
1.12, and 1.07 for the four measures, 
approached significance. 
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Discussion 


The acquisition-phase findings in this 
experiment, and particularly the complex- 
ity of the relationships between PRF and 
CRF related to alley segments, confirm 
previous results, particularly Wagner 
(1961) at one trial a day, and these 
findings in themselves would seem to 
provide real difficulty for such a modifi- 
cation of Bush and Mosteller's (1955) 
theory as Harris et al. (1962) propose. 
It would seem difficult for such a theory 
—that the limit point of the reinforced 
trial operator increases with the number 
of NR trials—to account for the fact that 
NR trials in PRF lead to higher asymp- 
totic performance relative to CRF only 
in certain measures in the runway, at 
least without further assumptions. 

The evidence from our reacquisition 
phase is even more discomfiting since, 
with the very large number of nonrein- 
forced trials provided in PRF and extinc- 
tion in our experiment, one would have 
expected the reacquisition asymptotes of 
the critical groups to be even greater 
relative to the CC group than were 
Harris et al.’s. Such a prediction, avail- 
able from Weinstock’s interpretation of 
PRF phenomena, was not supported in 
the present experiment: all four groups 
were running at approximately the same 
rate after 40 reacquisition trials. More- 
over, the large and immediate increase 
in level of responding of the extinction 
groups over their previous asymptotic 
acquisition level, which was the case in 
the Harris et al. experiment, was not 
observed, Reacquisition under CRF after 
PRF alone did not lead to clearly elated 
performance, which is of some interest, 
since Theios (1962) did observe reliable 
changes toward improved performance of 
PRF when switched to CRF after 25 or 
70 trials. Theios, however, employed 
only one 78-in. running measure in a 
99-in. runway, and performance of his 
PRF groups was always inferior to that 
of his CRF groups. In the switch from 
partial to continuous, performance of his 
PRF Ss simply rose to the level already 
attained by the CRF Ss. 
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We are not led by the results of the 
present experiment to question the find- 
ings of Harris et al.; as a matter of fact, 
our data suggest that we would have 
duplicated their results had we repeated 
their experiment exactly. We are, how- 
ever, inclined to question their interpre- 
tation since it seems to us to depend on 
the exact conditions of their experiment. 

As we indicated in the. introduction, 
failure to find higher acquisition asymp- 
totic performance for PRF than for CRF 
conditions in all measures, when the alley 
is fractionated and a sequence of meas- 
ures are taken, suggests a more compli- 
cated interpretation than that provided 
by the revised Bush-Mosteller hypothe- 
sis. One explanation for these acquisi- 
tion phenomena has already been pro- 
vided by a theory of frustrative non- 
reward. 

Our extinction data are not different 
from those of Harris et al. as far as they 
go; however pursuing extinction until 
the CE and PE group curves converge 
apparently eliminates the asymptotic 
“elation” effects they report. We pro- 
vided two explanations, one to account 
for the predicted lack of difference in our 
data between the reacquisition asymptotes 
of CE and PE, the other to account more 
generally for any failure to find over- 
Shooting of the CC reacquisition asymp- 
tote by the extinguished groups. The 
first is simply that, in our experiment, 
Groups CE and PE start at approxi- 
mately the same level in reacquisition ; 
the second is that after prolonged extinc- 
tion Ss do not enter the reacquisition 
phase at a stage of great agitation as they 
do after a much smaller number of 
extinction trials. 
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109 preschool children played a game in which they guessed which of 
2 buttons would turn off a stimulus light. Success (turning off the 
light) and failure (not turning off the light) were controlled by E 
so that each S received a series of unconditionally successful trials 
followed by 6 consecutive unsuccessful trials. Ss were run under 1 of 
4 combinations of reward and punishment. To administer reward, Ss 
were given a penny after each successful trial; to administer punish- 
ment, a penny was taken away from Ss after each unsuccessful trial. 
Under nonreward and nonpunishment conditions, respectively, these 
events did not occur. Response patterns manifested over unsuccessful 
trials were classified as consistent (alternating or perseverating) or 
mixed. Ss with consistent patterns over unsuccessful trials responded 
faster over. these trials than did Ss with mixed patterns. Among Ss 
with Consistent patterns, those who were administered both reward and 
punishment Had faster response speeds following unsuccessful trials 
than did Ss run under any other reinforcement condition (reward-no 
punishment, no reward-punishment, or no reward-no punishment). 
Among Ss with mixed patterns, those who were both rewarded and 
punished had slower response speeds than did Ss run under any other 


reinforcement condition. 


Several investigations have been 
made of the differential effects of 
reward and punishment on the rate of 
discrimination learning (Brackbill & 
O'Hara, 1958; Penney & Lupton, 
1961; Stevenson, Weir, & Zigler, 
1959) ; others have dealt with the ef- 
fects on response speed of expectancy 
for reward in multiple-choice tasks 
(Lotsof, 1958; Tyler, 1958) and of 
frustrative nonreward in single-choice 
tasks (Penney, 1960). However, 
there is little if any information avail- 
able on the effects of various combina- 
tions of reward and punishment on 
response speeds in multiple-choice situ- 
ations, or on the critical factors to con- 
sider when predicting these effects. 


1 Thanks are extended to Gordon Cantor 
and David Parton for their helpful comments 
in preparing portions of this paper. 

2 At Institute of Child Behavior and 
Development. 


The research reported here was de- 
signed to provide this information. 
The stimulus cues present in a mul- 
tiple-choice task may often elicit two 
or more competing responses, and 
therefore may produce conflict. The 
degree of conflict may be inferred from 
response time or its reciprocal, response 
speed (Berlyne, 1957a, 1957b); the 
former variable has also been used to 
measure uncertainty in decision making 
(Hyman, 1953; Wyer, 1965). Con- 
flict may also be indicated by the con- 
sistency with which a single response 
or response pattern is used in perform- 
ing the experimental task. Responses 
that consistently alternate or consis- 
tently perseverate may indicate low 
conflict; one mode of responding is 
highly dominant over others. On the 
other hand, a mixed pattern (one that 
is neither consistently alternating nor 
consistently perseverating) may indi- 
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cate the presence of two or more 
equally strong response tendencies, or 
high conflict. If conflict produces a 
decrement both in the speed and in the 
consistency of responses to a multiple- 
choice task, these characteristics may 
be correlated ; i.e., response speeds over 
a set of trials may be higher among Ss 
whose responses fall into a consistent 
pattern than among those whose re- 
sponses either show no pattern or 
switch from one pattern to another. 

"The above relationship should be 
detected more clearly over a set of 
repeatedly unsuccessful trials (trials on 
which the criterion established for a 
correct response has not been met) 
than over a set of trials that are either 
intermittently or universally successful. 
Maintenance of a consistent pattern de- 
spite failure may generally indicate a 
stronger and more dominant response 
tendency than would merely consis- 
tency over successful trials. Alterna- 
tively, while a mixed pattern over 
successful trials may occur only because 
responses are made unsystematically, 
over trials following failure it is more 
apt to indicate uncertainty as to what 
response is appropriate. 

The above considerations led to the 
following prediction: 

Hypothesis 1.—Over a series of re- 
peatedly unsuccessful trials, response 
speeds will be higher among Ss whose 
responses are either consistently alter- 
nating or consistently perseverating 
than among Ss whose responses show 
a mixed pattern, The Ss manifesting 
alternation and perseveration over un- 
successful trials will not differ in their 
response speeds over these trials. 

Consideration of the effects of re- 
ward and punishment led to two addi- 
tional hypotheses. In the study re- 
ported here, success and failure in 
performing the experimental task were 
defined in terms of whether or not S 
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was able to guess which of two buttons 
would turn off a stimulus light. Under 
reward conditions, S was given a penny 
each time he was successful. Under 
punishment conditions a penny was 
taken away each time S was unsuccess- 
ful. Under nonreward and nonpunish- 
ment conditions, respectively, pennies 
were not given for success and were 
not taken away for failure. If the in- 
troduction of either reward or punish- 
ment increases the incentive to perform 
well on the experimental task, it should 
increase the strengths of whatever re- 
sponse tendencies are dominant in the 
situation. If only one response ten- 
dency is dominant, the introduction of 
reward and punishment should increase 
response speed. However, if two or 
more competing responses of nearly 
equal strength are dominant, the intro- 
duction of reward or punishment 
should increase the magnitude of re- 
sponse conflict and therefore should 
decrease response speed. 

Hypothesis 2—Among Ss who 
manifest consistent response patterns 
over unsuccessful trials, (a) those who 
have been previously rewarded will 
have higher response speeds over these 
trials than those who have not been 
rewarded, and (b) those who are pun- 
ished will have higher response speeds 
than those who are not punished. 

Hypothesis 3—Among Ss who 
manifest inconsistent (mixed) patterns 
over unsuccessful trials, (a) those who 
have been previously rewarded will 
have lower response speeds over these 
trials than those who have not been 
rewarded, and (b) those who are pun- — 
ished will have lower response speeds - 
than those who are not punished. 


METHOD 
Subjects and Apparatus 


The Ss were 109 children, ages 3-5 yr» 
from the University of Iowa Preschool and 
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the Parents’ Cooperative Preschool in Iowa 
City. 

"The apparatus incorporated an 18 X 18 in. 
vertical panel, at the base of which was 
a smaller horizontal panel. In the center of 
the vertical panel was a 2-in. diameter milk- 
glass aperture. Behind the aperture was 
a stimulus light that, when lit, could osten- 
sibly be turned off by pushing one of two 
buttons mounted 15 in. apart on the lower 
panel. In fact, whether or not either or 
both of these buttons would turn off the 
stimulus light was controlled by E from the 
back of the apparatus. Directly above each 
button on the lower panel was a small 
yellow light; these lights were used as 
"instruction" lights in a manner to be 
described. 


Manipulation of Success and Failure 


To encourage Ss to manifest each of the 
three response patterns considered (persever- 
ation, alternation, and mixed) over unsuc- 
cessful trials (trials on which they failed 
to turn off the stimulus light), an attempt 
was made to train Ss to use each of these 
patterns over a preceding series of successful 
trials. The experiment therefore consisted 
of the four phases or sets of trials described 
below. 

Phase I: Unconditional success with in- 
structions (Trials 1-9).—Each S was shown 
the apparatus and told that the purpose of 
the game was to guess, when the stimulus 
light came on, which of the two buttons on 
the lower panel would turn it off. He was 
instructed to push a button to see how it 
worked. He was then informed that when 
the stimulus light came on, the yellow in- 
struction light would be turned on over the 
button that would turn off the stimulus 
light. He was told to keep his hands in his 
lap between trials. 

By manipulating which instruction light 
came on, E was able to influence S either 
to alternate, to perseverate, or to respond in 
a mixed pattern. The mixed pattern in 
all cases was of the form "aabaabbba," where 
"a" was the button pushed by S during the 
demonstration trial. Approximately one- 
fourth of the Ss were run under each con- 
dition. For the remaining Ss, the instruc- 
tion lights were not used; these Ss were 
told only to push the button that they 
thought would turn off the stimulus light, 
All Ss were successful in turning off the 
stimulus light on every trial. 

, Phase II: Unconditional success without 
Instruction. (Trials 10-14) —Following Trial 
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9, the instruction lights were made inopera- 
tive. The Ss whose responses to the first 
nine trials had been guided by instruction 
lights were told that they would now have 
to guess which button would turn off the 
stimulus light without instruction. For Ss 
who had not been guided by instruction 
lights during Phase I, these instructions 
were omitted. Each S was made to be 
successful in turning off the light on every 
trial in this phase regardless of which button 
he pressed. 

Phase III: Unconditional failure (Trials 
15-20).—This phase, which consisted of six 
trials, followed Phase II without interruption. 
On every trial during this phase, S was 
made to be unsuccessful in turning off the 
light regardless of which button he pressed. 

Phase IV: Unconditional success (Trials 
21-25).—This phase, which followed Phase 
III without interruption, consisted of five 
trials on which S was made to be successful 
in turning off the light. This phase was 
added primarily to allow S to feel he was 
successful upon completion of the experiment. 


Reward and Punishment 


Reward and punishment conditions were 
instituted at the start of Phase II. Under 
conditions in which reward was administered, 
S was allowed to obtain a penny each time 
he turned off the stimulus light during 
Phases II and IV. Under conditions in 
which punishment was administered, S was 
required to give a penny to E each time he 
failed to turn off the light during Phase III. 
The Ss were assigned at random to one of 
the following four conditions: 

Reward, punishment (RP)—a penny was 
obtained by S after each successful trial and 
we given up by S after each unsuccessful 
trial. 

Reward, no punishment(RP’)—a penny 
was obtained by S after each successful trial 
bat was not given up after each unsuccessful 
trial. 

No reward, punishment (R'P)—a penny 
was not obtained by S after each successful 
trial, but a penny was given up after each 
unsuccessful trial. 

No reward, no punishment (R'P')—pen- 
nies were neither obtained after successful 
trials nor given up after unsuccessful trials. 
i Before the start of Phase II, Ss participat- 
ing in the first three of these reinforcement 
conditions were told that they would now 
play the game for pennies. They were told 
that if they had enough pennies at the end 
of the game they could win a prize. The Ss 
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run under RP and RP’ conditions were 
initially given two pennies whereas Ss run 
under R/P conditions were given seven 
pennies. The E kept a larger pile of pennies 
in front of him but within reach of S. As 
a result of this procedure, Ss run under all 
three conditions had seven pennies in their 
pile immediately prior to the first unsuccess- 
ful trial (T15). After the last unsuccessful 
trial (T20), Ss run under RP and R'P 
conditions had only one penny left in their 
piles while Ss run under RP’ conditions still 
had seven. In an attempt to minimize the 
effects of this difference, the number of 
pennies in E’s pile was made much greater 
than seven. This procedure presumably 
increased the likelihood that Ss run under 
RP' conditions would believe that the seven 
pennies obtained during Phase II were still 
far below the number required to win a prize. 

The Ss who were to receive reward were 
then told that each time they turned off the 
stimulus light they could take a penny from 
E's pile and place it in their own pile; Ss to 
receive punishment were instructed that each 
time they failed to turn off the stimulus light 
they were to take a penny from their own 
pile and return it to E's pile. All instruc- 
tions pertaining to reward and punishment 
were of course omitted for Ss run under 
R'P' conditions. 

Following each successful response E said 
either “You turned off the light so you can 
take a penny" or "You turned the light off," 
depending upon whether or not S was to be 
rewarded. Correspondingly, after each fail- 
ure E said either “You did not turn the light 
off so you lose a penny” or merely “You 
did not turn the light off,” depending upon 
whether or not S was to be punished. 

On each trial, the time between the onset 
of the stimulus light and the depression of 
a response button was measured with a 
Standard Precision timer and recorded along 
with the position of the button selected. To 
minimize skewness and the effects of extreme 
scores, times. were converted to reciprocal 
seconds (sec?) and used as indexes of 
response speed. 

It had originally been intended to run an 
equal number of Ss under each of the 16 
combinations of reinforcement and initial 
training (Phase I). While conducting the 
experiment, however, it was noted that sev- 
eral Ss did not continue during Phase IT 
the response pattern they used during Phase 
I (training). This was particularly true 
among Ss exposed to alternation training. 
To obtain an adequate sample of Ss who 
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alternated during unconditionally successful 
trials, several additional Ss were run under 
alternation training conditions. 


RESULTS 


Response patterns manifested under 
unconditional failure conditions were 
defined over Trials 16-21 ; each of these 
trials followed a failure.to turn off the 
stimulus light. A perseveration pat- 
tern (Per) was defined as a sequence 
of responses no more than one of which 
differed from the response immediately 
preceding it. An alternation pattern 
(Alt) was defined as a set of responses 
not more than one of which was identi- 
cal to the response immediately preced- 
ing it. Patterns not meeting either 
criterion were defined as mixed (M). 
Complex patterns (eg. *aabaabaab 
.. ." “aabbaabb . . .” etc.) were con- 
sidered to be mixed since they occurred 
rarely and could be attributed to 
chance. 


Preliminary Analyses 


Before testing specific hypotheses, 
the effects of experimental manipula- 
tions both on the type of response pat- 
tern manifested during Phase III and 
on the response speed during this 
phase were investigated. Chi-square 
analyses were performed on the num- 
ber of Ss under each training condition 
who manifested Per, Alt, and M. over 
Trials 16-21 (Table 1) and on the 
number of Ss under each reinforcement 
condition who manifested each of these 
patterns (Table 2). Neither of these 
analyses produced significant results 
(p> 10). Tt was concluded that the 
type of response pattern manifested 
over unsuccessful trials was not sub- 
stantially affected either by initial train- 
ing conditions or by the type of rein- 
forcement given for success or failure. 

To investigate the effects of experi- 
mental manipulations on response 
speeds over unsuccessful trials, 80 Ss 
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TABLE 1 


NUMBER OF Ss UNDER EACH TRAINING CON- 
DITION MANIFESTING PERSEVERATION 
(PER), ALTERNATION (ALT), OR 
MixED PATTERNS OVER 
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TABLE 2 


NUMBER OF SS UNDER EACH REINFORCEMENT 
CONDITION WHO MANIFESTED PERSE- 
VERATION (PER), ALTERNATION 
(ALT), OR MIXED PATTERNS 


T16-21 OVER T16-21 
ae Pattern, T16-21 kaa Pattern, T16-21 
paint ment 
NI Per |' ax Mixed |Total Cond Per Alt Mixed | Total 
P / (4.5) | 10 (10.9)| 10 (11.6)| 27 RP 3 (4.8)| 13 (11.8)| 13 (12.5)| 29 
Alt 5 ER 14 (14.1)| 16 (15.1)| 35 RP. 2 (4.0)| 12 (9.7) 10 (11.3)| 24 
Mixed, 3 (4.1) | 13 (10.1)| 9 (10.8)| 25 RP' 5 (4.9)| 9 (12.1) 16 (12.9)| 30 
No train-| 3 (3.6) | 7 (8.9) 12 (9.5)| 22 RADO 8 (4.3)| 10 (10.5)| 8 (11.2) 26 
ing Total | 18 44 47 109 
Total | 18 44 47 109 


Note.—Expected values given in parentheses. 


Note,—Expected values given in parentheses. 


distributed evenly over the 16 combina- 
tions of reinforcement conditions and 
training conditions were selected. 
These Ss were chosen at random with 
one restriction: Ss were included in 
alternation and perseveration training 
groups only if they continued the train- 
ing pattern during Phase II. Analysis 
of variance of these data, which are not 
Shown, yielded a significant main ef- 
fect similar to that found in subsequent 
analyses of the entire sample (Fig. 1), 
but no significant main or interaction 
effects involving type of training or 
type of reinforcement. 

The failure for training conditions to 
have an appreciable influence on re- 
sponse speed and response pattern over 
unsuccessful trials could indicate that 
nine trials in Phase I were insufficient 
to establish strong response tendencies. 
One indication of whether Ss did have 
Strong prefailure response tendencies 
may be the consistency of their re- 
sponses over successful trials in the 
absence of instruction (T10-15). To 
explore the possibility that these ten- 
dencies affected performance over fail- 
ure trials, patterns over T10-15 were 
first classified as perseveration, alterna- 
tion, and mixed according to the same 
criteria used to classify patterns over 


T16-21. ^ Patterns manifested over 
successful and unsuccessful trials were 
then compared (Table 3). A chi- 
square analysis of these data was sig- 
nificant x? (4) —1002, p< .025. 
However, this significance is attribut- 
able primarily to the fact that fewer S's 
than expected by chance switched from 
a consistent pattern (Per or Alt) on 
successful trials to a different consistent 
pattern over unsuccessful trials. There 
was no indication that Ss who mani- 
fested mixed patterns over unsuccessful 
trials used mixed patterns more fre- 
quently than they used consistent pat- 
terns over the set of trials preceding 
onset of failure. 


TABLE 3 


NUMBER OF Ss MANIFESTING PERSEVERATION 
(PER), ALTERNATION (ALT), AND MIXED 
PATTERNS (M) over SUCCESSFUL 
TRrALs (T10-15) AND Ux- 
SUCCESSFUL TRIALS 


(T16-21) 
WU di Unsuccessful Trials (T16-21) 
Trials 
VOTE A Mss Alt M Total 
Per 13 (7.5) 13 (18.2) 19 (19.4)| 45 
Alt 3 (7.3)| 22 (17.8)| 19 (19.0)| 44 
M 2 (3.3)| 9 (8.| 9 (8.6). 20 
Total | 18 109 


Note.— Expected values given in parentheses. 
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Fic. 1. Response speeds over unsuccessful 
trials as a function of the type of response 
pattern manifested over these trials. 


The relationship of response pattern 
over T10-15 to response speed over 
'T16-21 was also investigated. Only a 
main effect of trials was significant, 
F = (5, 530) 23.53; p< 01; MS.= 
054, Therefore, while the type of re- 
sponse used over successful trials may 
in some cases have affected the type of 
pattern over subsequent unsuccessful 
trials, it did not have any effect on the 
speed of responses made over these 
trials. 

Hypothesis 1 stated that Ss with 
consistent perseveration and alternation 
patterns over a set of unsuccessful trials 
would have higher response speeds 
over these trials than would Ss whose 
responses fell into an inconsistent or 
mixed pattern. To test this hypothesis, 
an analysis of variance was performed 
on the response speeds of Ss over 
Trials 16-21 as a function of the type 
of response pattern manifested over 
these trials, These speeds, in units of 
reciprocal seconds (sec), are shown 
in Fig. 1. The analysis yielded signifi- 
cant main effects of both trials, F (5, 
530) = 3.53; p< .005, MS. =.053, 
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and type of response pattern, TOT (2 
106) = 5.34, p < .01, MS, = .247, and 
a significant Pattern X Trials interac- 
tion, F (10, 530) =1.94, p< 05. 
The mean response speed of Ss exhibit- 
ing mixed patterns over unsuccessful 
trials was substantially less (M = .472 
sec!) than that of Ss either perseverat- 
ing (M —.602 sec?) or alternating 
(M —.600 sec?) over these trials. 
Furthermore, these differences became 
more pronounced as the number of 
successive failures increased. No dif- 
ferences occurred, however, between 
perseverators and alternators. Both 
portions of Hypothesis 1 therefore re- 
ceived support. 

Hypotheses 2 and 3, each of which 
pertained to the interactive effects of 
response pattern and reinforcement, 
were not tested on the entire sample 
due to the unequal numbers of SS, 
manifesting the three types of patterns 
considered. Furthermore, the small 
number of perseverators over utisuc- 
cessful trials precluded: a separate 
analysis of this group. Since the rê- - 
sponse speed characteristics of per- 
severators and alternators collapsed 
over reinforcement conditions were not 
significantly different (Fig. 1), 55 
manifesting these patterns were com- 
bined into a single group of Ss with 
consistent patterns and compared with 
Ss with mixed or inconsistent patterns. 

To test Hypotheses 2 and 3, 14 Ss 
exhibiting a consistent pattern and 8 
Ss exhibiting mixed pattern Were 
selected at random from Ss exposed 
to each of the four reinforcement 
conditions considered. Mean response 
speeds of Ss in these groups over Trials 
16-21 are shown in Fig. 2. Analyses 
of variance of these data yielded SIE 
nificant main effects of pattern, F (^ 
80) = 9.00, p < .005, MS. = .237, ant 
trials, F (5, 400) = 342, p < 01, MS 
= .056, and a significant Pattern X 
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A. CONSISTENT PATTERNS 
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B. MIXED PATTERNS 


*—— RP (n*8) 
---- R'P (n*8) 
4—— RP'(n*8) 
a-a R'P'(n« 8) 


18 
TRIAL 


Fic. 2. Response speeds over unsuccessful trials as a function of reinforcement conditions 
among Ss using (a) consistent patterns and (b) mixed patterns. 


Trials interaction, F (5, 400) = 3.30, 
p € 01; these results paralleled those 

* found in analysis of the total sample. 
In addition, there were significant in- 
teractions between pattern and reward, 
F (1, 80) =6.42, p< .025, pattern 
and punishment, F (1, 80) = 4.05, p 
< .05, and pattern, reward, and pun- 
ishment, F (1, 80) =4.38, p < .05. 
Data relevant to these results are 
shown in Table 4. 

Hypotheses 2 and 3 stated that re- 
sponse speeds of Ss with consistent 
patterns would be high when reward 
or punishment was administered while 


the response speeds of Ss with mixed 
patterns would be low when reward or 
punishment was given. While data 
pertaining to the two lower-order inter- 
actions support these hypotheses, the 
triple interaction of reward, punish- 
ment, and response pattern indicates 
that all hypothesized relationships 
should be qualified. Simple effects of 
the data in Table 4 indicated that both 
reward and punishment were neces- 
sary conditions for response speed dif- 
ferences to occur; this was true both 
among Ss with consistent patterns and 
among Ss with mixed patterns. 


TABLE 4 


MEAN RESPONSE SPEED OVER TRIALS 16-21 AS A FUNCTION OF REWARD, PUNISHMENT, 
AND TYPE or RESPONSE PATTERN OVER UNSUCCESSFUL TRIALS 


Type of Pattern, T16-21 
NE Per or Alt Punishment Cond. Mixed Punishment Cond. 
P P M P r M 
R .747 (14) | .558 (14). .652 .322 (8. 495 (8). -408 
R’ 1567 (14). | -559 (14)a 1563 7550 (8)a | -535 (8). 543 
M .657 1959 .608 436 515 A75 


Note,—Cells with unlike subscripts differ at p <.05, F (1,80) < 4.00. N in each cell given in parentheses, 
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Discussion 


The major results of this study can be 
summarized fairly concisely. Among Ss 
both rewarded and punished, those who 
either consistently perseverated or con- 
sistently alternated over repeatedly un- 
successful trials had higher average re- 
sponse speeds than did those who used 
mixed patterns over these trials. Among 
Ss not rewarded and/or not punished, 
those using consistent patterns did not 
differ substantially from those using 
mixed patterns, although in each case 
results were in the predicted direction. 

As anticipated, the introduction of re- 
ward and punishment appeared to pro- 
duce higher speeds among Ss with 
consistent patterns but to produce lower 
speeds among Ss with mixed patterns. 
However, it was somewhat surprising 
that both reward and punishment in con- 
junction were mecessary for this effect 
to occur. The introduction of only one 
type of reinforcement apparently did not 
produce an increment in incentive large 
enough to differentiate the response 
speeds of Ss run under these conditions 
from those of Ss run under control 
(R’P’) conditions. 

Caution should be taken in attaching 
much weight to a comparison of change 
in response speed over trials at each 
combination of reward, punishment, and 
type of pattern, since a significatit quadru- 
ple interaction of these variables on re- 
sponse speed was not obtained (F = .43). 
Nevertheless certain features of the char- 
acteristics shown in Fig. 2 are of interest. 
The response speeds of mixed pattern Ss 
run under R’P, RP’, and R'P' conditions 
were strikingly similar, increasing follow- 
ing the second unsuccessful trial and then 
consistently decreasing until the last trial 
following failure, where they did not dif- 
fer from response speeds of Ss run under 
RP conditions. The response speeds over ' 
trials of Ss described by other combina- 
tions of reward, punishment, and type of 
pattern were either irregular or fairly 
constant over trials. Learning theory 
(Hull, 1952; Spence, 1956) assumes that, 
for a given level of habit, change in 
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response strength is an increasing func- 
tion of both drive level (D) arid incentive 
(K). If this is the case, data in Fig. 1 
and 2 suggest that among mixed-pattern 
Ss, at least those run under lower rein- 
forcement conditions, D or K increases 
with the number of successive failures, 
while among. Ss with consistent patterns 
no change occurs over trials. Whether 
change in D or change in K is more 
likely to account for change in response 
speed over trials is unclear. 

The type of pattern manifested over 
unsuccessful trials could not be predicted 
from the type of pattern manifested on 
trials preceding the onset of failure. At 
least in this study, differences in the type 
of pattern used by Ss over failure trials 
must be attributed primarily to differ- 
ences in response tendencies existing 
prior to Ss’ participation in the experi- 
ment. Research designed to investigate 
antecedents or correlates of such tenden- 
cies may be worth considering. For ex- 
ample, the tendency to maintain a single 
manner of responding despite repeated 
failure may indicate a general rigidity of 
behavior, and therefore may be correlated 
with other measures of this characteris- 
tic (Goodstein, 1953; Luchins & Luchins, 
1959). 

Despite certain ambiguities, the results 
reported here have important implications 
for future research on the effects of suc- 
cess and failure on responses in two- 
choice situations. In the study reported 
here reinforcement conditions considered 
independently of the type of response | 
pattern manifested over unsuccessful 
trials had no effect on response speed 
over these trials. The type of response 
pattern used may therefore be an impor- 
tant factor to consider when interpreting 
situational effects on performance in 
multiple-choice tasks. t 
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Sets of letter pairs were constructed from 4 classes of letters, M, N, P, 


and Q to form MN and PQ sequences. 


In free recall, Ss produced 


more intrusions of the form MQ and PN than would be expected 
if intrusions were produced by randomly combining the letters appear- 


ing in the presented pairs. 


An adequate account of the intrusion data 


is provided by a theory of position learning proposed by Braine. An 
alternative proposal by Jenkins and Palermo (1964) based upon 
mediated transfer fails to predict the high proportion of MQ and PN 


intrusions, 


When Ss attempt to recall sets of 
letter pairs which have been con- 
structed from two classes of letters, A 
and B, with A letters occurring first 
in all pairs and B letters second, Smith 
(1963) has shown that the most promi- 
nent class of intrusions produced dur- 
ing recall consists of AB combinations 
which were omitted during presenta- 
tion. That is, Ss "recall" pairs they 
have not heard but which are formally 
consistent with the pairs to which they 
have listened. Smith argued that these 
rule-governed intrusions were, on a 
limited scale, grammatically appropri- 
ate generalizations of the presented 
pairs and illustrated the phenomenon 
of “rule-governed creativity” which 
Chomsky (1964) characterized as the 


1A brief description of the design of these 
experiments and a report of the results from 
Group B1 of Exp. I appeared in an earlier 
paper (Smith, 1965). The research re- 
ported here as Exp. I and several pilot 
studies for Exp. II were undertaken while 
the author was a research fellow at the Uni- 
versity of Minnesota and supported by 
National Science Foundation Grant G. 18690, 
James J. Jenkins, principal investigator. The 
author has profited from discussion with 
Charles Clifton, Jr., concerning the design of 
the experiments. 


essential feature of a command of the 
grammar of a natural language. The 
recall of grammatically structured non- 
sense material was proposed as a tech- 
nique for investigating the variables 
relevant to grammatical competence. 
Braine (1965a) has recently proposed 
the label “verbal reconstructive mem- 
ory” for the technique. 

Two explanations of rule-governed 
intrusions are presently available. 
Braine (1963) in an account of the 
development of grammatical skill pro- 
posed that what is learned is the posi- 
tion occupied by the morphemes and 
phrase units of the language. Braine’s 
theory is elaborated in such a way that 
the definition of position is readily 
articulated with a  phrase-structure 
model of grammar. When applied to 
the letter-pair recall experiment, 
Braine's proposal asserts that Ss will 
learn that the A letters come first in 
the pair and B letters second. Learn- 
ing here would be a type of perceptual 
learning in which the A letters come 
to sound familiar in the first temporal 
position and B letters, in the second 
temporal position. Hence, any AB 
combination will sound familiar whe- 
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ther the particular letters have occurred 
together or not. 

An alternative explanation, also of- 
fered as an account of the develop- 
ment of. grammatical competence in 
natural language, is suggested by Jen- 
kins and Palermo (1964), who pro- 
posed that the formation of word 
classes could be explained as acquired 
stimulus and response equivalence 
based upon implicit mediating re- 
sponses assumed to intervene between 
explicit verbal responses. Their ex- 
planation focused upon the occurrence 
of rule-governed novelties observed by 
Esper (1925) in the learning of two- 
unit utterances from a miniature lin- 
guistic system. Although the account 
is not completely explicit (cf. Smith, 
1963), it appears that the following 
logic: would apply to the letter-pair 
recall experiment, When the utter- 
ances, ab, cb, and ad, are learned in 
that order, mediation theory assumes 
that a contingency between an implicit 
b response and an explicit d response 
is learned during the acquisition of ad, 
because b continues to occur implicitly 
in the presence of a; therefore, the ut- 
terance cd is expected to have some 
strength due to the strength of the two 
contingencies, c — b and b (implicit) 
— d, which can be run off sequentially 
to form a chain between c and d. 

In the present experiments, Ss were 
presented letter pairs constructed from 
four classes of letters, M, N, P, and Q, 
and exposed only to MN and PQ se- 
quences. Mediation theory predicts 
that if some of the MN and PQ com- 
binations are omitted, Ss will produce 
them as intrusions so long as a triplet 
(mn, mmi, mn) can. be constructed 
for any omitted pair, mjn} Similar 
predictions are made for any omitted 
pair pjqj mutatis mutandis; however, 
so long as no MQ and PN combina- 
tions are presented, S cannot acquire 
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any contingencies such that the pairs 
mg; or pm; will be mediated. Media- 
tion theory therefore predicts that 
mediated generalization cannot occur 
on the basis of position alone. Rule- 
governed intrusions are expected to be 
confined to MN and PQ combinations. 
On the other hand, Braine’s position- 
learning proposals predict that since 
both M letters and P letters occur in the 
first position and N letters and Q letters 
in the second, Ss should learn only a 
grammar XY such that the X class 
includes M and P and the Y class, N 
and Q. Because MN, PQ, MQ, and 
PN intrusions are all XY intrusions, 
Braine would predict that all four types 
of pairs will be highly probable in- 
trusions. 

The two theories may be said to 
predict the learning of two slightly dif- 
ferent grammars, only one of which 
characterizes MQ and PN combina- 
tions as “grammatical.” The present 
experiments provide a direct test of the 
adequacy of both theories in explaining 
the occurrence of rule-governed intru- 
sions in retention data. 


ExPERIMENT I 
Method 


The letter pairs were constructed from the 
following classes of letters: M: v, s, r; N: 
g, k, 1; P: z, x, d; and Q: m, f, j. Of the 
18 possible MN and PQ pairs, vg, sk, rl, 
zm, xf, and dj were omitted, leaving a set ol 
12 combinations. Randomly ordered lists 
of these pairs were read to groups of Ss at 
the rate of 20 per min. with a 1-sec 
pause between pairs. The Ss were told 
they would be required to recall the pairs at 
the end of the list. The reading of the list 
was followed by 3 min. of written free recall. 
The Ss were instructed to write the pairs 
". . . in any order that they occur to you.” 

Two presentation lists were used. List 
A was 60 pairs long and contained five 
appearances of each of the 12 letter combina- 
tions. List A was presented to a group of 18 
college students (Group A). A second list 
(B), 180 pairs long with 15 appearances of 
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TABLE 1 


Mean NUMBER.OF LETTER COMBINATIONS PRODUCED IN Exp. | 


| Group 
Type of Letter Combination 7 — 
A | B1 mE B3 
Presented Pairs 4.44 8.62 905 | 8.38 
MN and PQ Intrusions 0.56 1.18 0.89 | 0.90 
MQ and PN Intrusions 1.78 2.85 2.21 2.48 
Other Intrusions with Correct Vocabulary 2.89 1.30 0.90 0.92 
Other Intrusions with New Vocabulary 3.72 1.65 0.66 0.87 
All Combinations (excludes repetitions) 13.39 15.60 13.71 13.55 
SDs 5.45 4.68 


3.31 3.71 


* SD of the number of combinations of all types produced. 


each combination, was administered to three 
groups of college students. The three repli- 
cations will be designated Group Bl (N 
=40), Group B2 (N = 38), and Group B3 
(N —40). Group B1 had previously partici- 
pated in an experiment involving the recall 
of words. 


Results 


The mean number of letter combina- 
tions produced by Ss in five different 
classes is shown in Table 1. There 
appears to be considerable variation 
among the groups. Group A recalled a 
little more than half as many presented 
pairs as the remaining three groups 
while producing many more intrusions, 
particularly those introducing a letter 
which did not appear in the presented 
pairs, ie. new vocabulary. Among 
the three groups receiving 15 exposures 
to the pairs, Group B1 produced sig- 
nificantly more different letter com- 
binations ? (p < .05), and Groups B2 
and B3 differed significantly in mean 
presented pairs recalled (p< .05). 
These differences suggested that statis- 
tical analysis should be performed on 
each group independently. Since this 


2 All measures reported here exclude a 
small number of illegible responses as well as 
“repetitions,” pairs which merely repeated a 
letter combination already appearing on S's 
response sheet. The mean number of repeti- 
tions for Group A was 1.67; for Group Bl, 
0.95; B2, 0.79; and B3, 1.05. 


leads to identical conclusions for all 
groups, the question is not crucial. 

The t test was used to determine 
whether the mean number of MQ and 
PN intrusions was greater than would 
be expected if Ss were producing in- 
trusions by randomly combining the 
letters which appeared in the presented 
pairs. Under the latter. hypothesis, it 
is expected that 18/120 (0.15) of all 
intrusions would be MQ and PN com- 
binations, (Twelve letters can form 
144 combinations of which 12 are com- 
binations of a letter with itself. Since 
the latter were never produced in re- 
call, it seemed reasonable to exclude 
them from the set of potential intru- 
sions. Twelve other combinations are, 
of course, presented to Ss, leaving 120 
potential intrusion combinations.) An 
expected number of MQ and PN intru- 
sions was computed for each S by mul- 
tiplying 0.15 by the total number of 
intrusions involving two letters from 
the correct vocabulary. The mean dif- 
ferences between obtained and ex- 
pected numbers of MQ and PN intru- 
sions are shown in Table 2. 

All mean differences for the MQ and 
PN intrusions were significant at the 
01 level. Comparable values for MN 
and PQ intrusions, predicted by both 
mediation and position-learning theory, 
are also shown in Table 2. A direct 
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comparison of MQ and PN intrusions 
with MN and PQ intrusions is also 
presented. The number of MN and 
PQ intrusions was multiplied by three 
in the latter comparison to compensate 
for the fact that possible MO and PN 
combinations outnumbered possible 
MN and PQ combinations 18 to 6. 
None of the comparisons of the latter 
type were significant. 


Discussion 


The differences between Group A and 
the three groups receiving 15 exposures 
seem readily attributable to differential 
exposure. Not only does the number of 
presented pairs recalled increase with ex- 
posure, but intrusions with new vocabu- 
lary appear to decrease in frequency. 
The data in Table 1 suggest that while 
increased exposure does not reduce the 
number of intrusions incorporating cor- 
rect vocabulary, intrusions of this type 
become more concentrated in the MN, 
PQ, MQ, and PN classes. 

The somewhat increased production of 
all types of combinations in Group Bl 
is readily explained by the fact that this 
group had received practice in the recall 
procedure prior to participation in the 
present experiment. Group Bl consist- 
ently produced more intrusions than 
Groups B2 and B3 in every classification, 
but the relative frequencies do not appear 
to differ. 

Mediation theory, as developed by 
Jenkins and Palermo, also suggests that 
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the learning of AB combinations should 
lead to the strengthening of AA and BB 
combinations. This conclusion is a con- 
sequence of applying three-stage media- 
tion paradigms (see Jenkins, 1963, for 
details) to the structure of the letter 
pairs. When any two combinations a,b, 
and a,b» are learned, the three-stage para- 
digms predict transfer to bjbe. How- 
ever, such a hypothesis not only fails to 
account for the high proportion of AB- 
type intrusions in the earlier studies and 
for the similarly high relative frequency 
of MN, PQ, MQ, and PN intrusions in 
the present study; it also fails to account 
for the other types of intrusions involving 
correct yocabulary. Among the 537 in- 
trusions produced from letters appearing 
in the presented pairs by Ss in the three 
groups given 15 exposures, 77% were 
MN, PQ, MQ, or PN intrusions. Only 
11% were of the types MM, PP, NN, 
or QQ. Eight percent were MP, PM, 
NQ, or QN intrusions, which for reasons 
similar to those discussed for the four- 
stage paradigms are excluded by media- 
tion theory. On the basis of the number 
of potential intrusions, 20% (24/120) of 
the intrusions would be expected to be 
MM, PP, NN, and QQ intrusions, if 
Ss were randomly combining correct 
vocabulary to produce intrusions. An- 
other 30% (36/120) would be expected 
in the MP, PM, NQ, and QN classes. 
Clearly, the intrusion data do not sup- 
port the mediation-theory account of how 
Ss produce grammatical intrusions in the 
recall of structured letter pairs. 


TABLE 2 


MEAN DIFFERENCE BETWEEN VARIOUS OBTAINED AND EXPECTED 
NUMBERS OF INTRUSIONS 


Group 
Difference 
zu A Bi B2 B3 
Obtained vs. Expected 
MQ + PN Intrusions 0.99** 205778 1.61*** 1.83*** 
MN + PQ Intrusions 0.29 0.91*** 0.70*** 0.69*** 
MQ + PN vs. 3(MN + PQ) —0.11 0.68 0.47 0.23 
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TABLE 3 
LETTERS USED IN Exp. II By CLASSES 


Letter Class 
Letter 
Set 
M N P Q 

1 vsrh gklw zdxq míjy 
2 vhrs fyjm qxdz wkgl 
3 jwgy xzdv mfkl qrhs 
4 JYwg rhsq fmkl zvdx 


EXPERIMENT II 


The second experiment was under- 
taken to answer a variety of methodo- 
logical objections concerning the first 
experiment and to investigate the gen- 
erality of the results. The motivations 
for the various procedural modifica- 
tions were as follows: 

l. The opportunities for mediation 
in the first experiment are severely 
limited. Among the presented letter 
pairs there are no triplets of pairs 
(mm, mm, mnj) such that myn;, the 
pair to which mediated transfer is pre- 
dicted, is another presented pair. That 
is to say, mediation processes cannot 
assist S in producing correct pairs, or 
to use Jenkins’ (1963) phrasing, S will 
not be reinforced for mediating. This 
limitation is due to the poverty of the 
MN and PQ systems, In Exp. II the 
membership of the four classes was in- 
creased from three to four items with 
the result that every presented pair 
could be mediated on the basis of two 
different triplets of presented pairs. 
Concretely, if M consists of ma, Ms, 
n, and m, and N, of m, ny Mz, and n, 
and the presented set includes all MN 
combinations except m,n,, mn, Mang, 
and m,n, then the two triplets (mn, 
Mfg, MN) and (mun, myn, Mn) 
each predict mediated transfer to the 
presented pair m;n,. 

2. The failure to demonstrate medi- 
ated transfer might conceivably reflect 
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characteristics of the particular letter 
pairs, especially since both presented 
pairs and intrusions have preexperi- 
mental histories of exposure. In Exp. 
II, four sets of letter pairs were con- 
structed in such a way that any specific 
combination was used as a presented 
pair in only one set and would be 
treated as an MQ or PN intrusion in 
another set and as a completely un- 
grammatical pair (e.g, QM type) in 
the remaining two sets. 

3. The first experiment provided 
only fragmentary information on the 
changes in recall and intrusion meas- 
ures with increasing exposure to the 
material The second experiment 
measured retention at four different 
levels of exposure. 

4. To obtain a different, and possibly 
more sensitive measure of retention, 
Exp. II included three recognition tests 
in addition to two recall tests foreach 
S. The advantage of the recognition 
test is that a controlled set of potential 
intrusions of selected types may be 
presented and a response to this set 
obtained from every S. 


Method 


Materials.—Four sets of 24 letter pairs 
were constructed from the 16 letters shown 
in Table 3. The presented pairs for each set 
consisted of all combinations of Class M 
followed by Class N and of Class P followed 
by Class Q except pairs formed from letters 
sharing the same ordinal position within a 
class. For example, in Set 1 the presented 
MN pairs consist of all combinations of Y, 
S, r, or h followed by g, k, 1, or w except 
the Pairs vg, sk, rl, and hw. 

The composition of the recognition tests 
can be described in terms of the four classes 
of letters, as follows : Eight presented pairs— 
four MN and four. PQ; eight potential gram- 
matical intrusions—two each, MN, PQ, MQ, 
PN; eight other potential intrusions—one 
each, MM, MP, PM, PP; NN, NQ, QN, 
QQ. Recognition Tests No. 1 and 2 had 
no specific pairs in common. Because the 
design of these two tests exhausted the poten- 
tial MN and PQ intrusions, it was necessary 
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to reuse four combinations of this type in 
Test No. 3. For all other types of pairs 
used in Test No. 3, an equal number of pairs 
which had and had not been previously 
used was selected. 

Procedure.—The Ss, 28 enlisted men sta- 
tioned at Walter Reed Army Medical Center, 
were run individually. All materials for 
the experimental session were tape recorded 
on a single tape, including all instructions, 
presentation lists, recognition tests, and a 
2-min, period of silence during the recall 
tests. 

The 24 presented pairs of each letter set 

were randomly arranged in 16 different 
orders in such a way that a pair did not 
occupy the same ordinal position more than 
once. Each random order constituted an 
"exposure" of the set. There were four 
“trials” consisting of four complete exposures 
each. Pairs were read onto the tape at the 
rate of one pair every 3 sec. with a pause 
of 6-8 sec. between each of the four expos- 
ures in the trial Recognition Tests No. 1 
and 2 followed Trials 1 and 3, respectively. 
Two-minute recall periods followed Trials 2 
and 4. The second recall test (Trial 4, 16 
exposures) was followed by Recognition 
Test No. 3. Pairs in the recognition tests 
were presented in a random order at the 
rate of one every 10 sec. The S was in- 
structed to say “yes” or “no” on the basis 
of whether he had heard the pair read in 
the presentation list. He was informed in 
advance that “about half” of the pairs would 
be ones he had actually heard. During recall 
periods S was required to say back aloud as 
many as he could remember. He was not 
informed of the correctness of any pair he 
produced. No important changes were made 
in the instructions of Exp. I concerning 
recall, Some additional procedural informa- 
tion was supplied including the nature of the 
test to be conducted at the end of each pres- 
entation period. The order in which the 
pairs were presented and recalled was de- 
scribed as unimportant. 
. The Ss were randomly assigned to one of 
the four letter sets in blocks of four with one 
S in every block assigned to each letter set 
and each block completed before a new one 
was begun. 


Results 


Recall.—The. two recall tests will be 
identified by the trial on which they 
were administered. The two tests 
yielded identical conclusions. ^ The 
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number of different letter combinations 
produced was 13.43 (Trial 2) and 
16.96 (Trial 4). The Ss produced 
few repetitions of pairs already men- 
tioned within a single recall test (means 
of 1.54 and 1.64 repetitions for Trials 
2 and 4, respectively) and even fewer 
pairs incorporating a letter not occur- 
ring in the presented pairs (means of 
11 and .00). The mean number of 
presented pairs recalled was 10.00 on 
Trial 2, 14.18 on Trial 4. The two 
classes of intrusions which are the 
focus of interest showed no change 
from the second to the fourth trial (for 
both trials; MN-PQ intrusions: X 
= 54; MQ-PN intrusions: X = 1.50). 
The mean difference between the num- 
ber of intrusions of each class produced 
and the expected value based on the 
number of intrusions composed of let- 
ters occurring in presented pairs was 
Al (Trial 2) and .43 (Trial 4) for 
MN and PQ intrusions and 1.01 (Trial 
2) and 1.09 (Trial 4) for MQ and PN 
intrusions. These values are signifi- 
cantly greater than zero at or beyond 
the .02 level. The mean differences 
between the number of MQ-PN in- 
trusions and four times the MN-PQ 
intrusions were not significantly differ- 
ent from zero—.64 (Trial 2) and .71 
(Trial 4). No significant differences 
were obtained between the four groups 
exposed to different letter sets for the 
following measures : number of different 
combinations produced, number of pre- 
sented pairs recalled, difference be- 
tween obtained and expected number of 
MN-PQ intrusions, the parallel differ- 
ence for MQ-PN intrusions, and the 
difference between MQ-PN intrusions 
and four times the MN-PQ intrusions. 

Recognition tests —Recognition tests 
will be identified by number, as fol- 
lows: No. 1 following Trial 1, No. 2 
following Trial 3, and No. 3 following 
the second recall test (Trial 4). Again 
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TABLE 4 


MEAN Proportion OF "Yes" RESPONSES 
IN RECOGNITION TESTS 


Type of Pair 
No. Intrusions 
Presented 
Pai | MN-PQ | MQ-PN | Others 
1 71 99 .46 23 
2 .88 39 30 12 
3 .89 .30 28 an 


identical conclusions were reached in 
each test. The mean proportions of 
"yes" responses given to four classes 
of intrusions are shown in Table 4. 
There appears to be a trend of increas- 
ing proportions for presented pairs as 
exposure increases and decreasing pro- 
portions for all classes of intrusions. 
With each test treated separately, there 
was a significant effect due to type of 
pair tested and neither a significant 
main effect due to letter sets nor a sig- 
nificant interaction in all three tests. 
The component of the main effect at- 
tributable to the difference between 
MN-PQ and MQ-PN intrusions did 
not reach significance. A Student 
Newman-Keuls range test indicated 
that for all three recognition tests the 
proportions of “yes” responses to pre- 
sented pairs, grammatical intrusions 
(MN-PQ and MQ-PN intrusions com- 
bined), and other intrusions were sig- 
nificantly different from each other. 


Discussion 


The results of the second experiment 
confirm those of the first and suggest 
that the prominence of MQ and PN in- 
trusions holds over a somewhat wider 
range of conditions. Mediated transfer 
as an explanation of grammatical intru- 
sions appears to be ruled out completely 
for the early stages of learning. How- 
ever, mediation cannot be excluded as a 
secondary process operative perhaps at 
some later stage of learning, but prior 
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to complete mastery of the presented 
pairs. If this were the case, a decline 
in the number of MQ and PN intrusions 


‘would be expected before a decline in MN 


and PQ intrusions began, The present 
evidence does not extend beyond 16 ex- 
posures, at which point the mean number 
of presented pairs recalled was 14.18, 
roughly 6095 of the total. Furthermore, 
no S achieved complete mastery of the 
presented pairs. 

A pilot study using 24 exposures per 
pair with frequent recall tests failed to 
reveal any new trends with greater ex- 
posure, In this sample (N = 12) only 
one S produced all 24 pairs follqwing the 
twenty-fourth exposure. The mean for 
the other 11 Ss was 11.63 presented pairs. 
The one atypical S produced five intru- 
sions of the MN-PQ type and one of the 
MQ-PN type on a test following 12 
exposures, at which time he produced 
only 15 presented pairs. He continued 
to produce five MN and PQ intrusions 
(but not always the same five) on every 
subsequent test (following 16, 20, and 24 
exposures); no other MQ or PN intru- 
sions appeared thereafter. His perform- 
ance may be compared with that of 
another S in the 24-exposure study who 
produced 23 presented pairs and no in- 
trusions of any kind on the final recall 
test. The two best performances in the 
present experiment appeared on Trial 4 
where one S produced 23 presented pairs 
without intrusions and another S, 22 pre- 
sented pairs and one intrusion of the 
MN-PQ type. These individual results 
suggest that the intrusions predicted by 
mediated transfer (MN-PQ type) may 
be the last to be removed from Ss’ recall 
productions. The practical difficulty in 
using the verbal reconstructive memory 
technique to investigate the problem lies 
in motivating Ss sufficiently to insure 
complete learning for all Ss without 
altering some fundamental aspect of the 
procedure. (For example, the most ob- 
vious procedural change, the introduction 
of specific reinforcement of pair produc- 
tion, has not been investigated to deter- 
mine whether rule-governed creativity 
would be affected by this alteration.) 
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The question of how class formation 
occurs with classes having identical posi- 
tional restrictions is an important one 
because the process occurs in natural 
language, even if it is not demonstrable 
in the verbal reconstructive memory situa- 
tion. Mediated generalization of the kind 
proposed by Jenkins and Palermo (1964) 
may find application in explaining word- 
class formation under such conditions. 
Braine (1965b) has recently discussed 
two other mechanisms which would ac- 
count for the discrimination of two or 
more classes of morphemes sharing the 
same position in natural languages. 
Neither is applicable to the present study, 
however. In the first, a frequently oc- 
curring morpheme may serve as a marker 
or function word. If each class were 
associated with a different marker (e.g., 
a different affix), then the language would 
provide an overt index to extrapositional 
regularities. For example, if the struc- 
ture of the letter pairs had been mN and 
pQ (where lower-case letters indicate a 
single entry or “function letter"), the 
markers would provide a basis for the 
discrimination of the N and Q classes 
and combinations such as mq; and pn; 
would not be expected to occur, although 
both N and Q items occupy the second 
position. However, in the present ex- 
periments all letters have an equal fre- 
quency of appearance in the presented 
pairs. Hence, none of them may be ex- 
pected to form index markers. A second 
mechanism proposed by Braine involves 
semantic correlates and obviously can- 
not apply to semantically empty systems. 
Semantic support would be available to 
the MN and PQ pairs if the two types 
of pairs were associated as names for two 
different types of stimuli. 

Evidence from a quite different experi- 
mental technique corroborates the present 
findings concerning the predictions of the 
four-stage stimulus and response equiva- 
lence paradigms.’ Braine (1963) taught 
Ss to complete a set of utterances in a 
nonsense language. The utterances may 


3 The writer is indebted to M. D. S. Braine 
for calling this to his attention. 
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be characterized as ab, cb, ad, and cd. 
The Ss were given training on problems 
such as a— with response alternatives 
b and c until all utterances had been 
mastered. Braine then presented the 
problem 4— and found that Ss chose b 
and d in preference to a and c. A suit- 
able four-stage mediation paradigm con- 
cluding in xb or xd cannot be constructed 
from the training utterances. 

The account of free-recall learning pro- 
posed by Garner (1962) is implicitly a 
position-learning analysis of what Ss 
learn in a verbal free-recall experiment 
in that the "variables" used by Garner 
to describe verbal material are the left- 
to-right positions occupied by the letters, 
words, or other units from which the 
material is composed (cf. Garner & Whit- 
man, 1965). Hence, the production of 
grammatical intrusions in the recall of 
pairs having a simple AB structure (as 
in Smith, 1963) can be explained as the 
learning of which letters belong to which 
variables, i.e., positions. However, Gar- 
ner (1962, p. 168) has also suggested 
that simple contingencies between varia- 
bles making up the material should result 
in a structure which is relatively easy to 
learn. The MN and PQ letter pairs 
contained a contingency between classes 
of letters in the two positions rather than 
a contingency between individual letters. 
The evident difficulty of these pairs sug- 
gests that a class contingency may be 
more difficult to learn than a simple con- 
tingency between letters. The present 
data, which do not involve a comparison 
of the MN and PQ pairs with other types 
of internal structure, cannot be used to 
determine whether class contingencies 
are in fact more difficult to learn. 
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The effects of word-position and stress-nonstress experimental condi- 
tions on performance in serial-verbal learning were investigated for 
high- (HA) and low- (LA) anxiety college males. The same 12- 
syllable CVC list (42.7% Glaze association value) was learned by Ss 
in Neutral and Stress conditions; in the latter, Ss were told “speed of 
learning is strongly related to intelligence.” Significant differences 
were found only in the Stress condition in which the performance of 
HA Ss was inferior to that of LA Ss early in learning, and superior 
later in learning. For words at the extremes of the serial list, the 
performance of HA Ss exceeded that of LA Ss at an earlier stage in 
learning than for words embedded in the list. It was concluded that 
the effects of anxiety on serial learning depend upon experimental 
stress, characteristics of stimulus materials, serial-position phe- 
nomena and, in some cases, on individual differences in intelligence. 


According to Spence’s (1958) theory 
of emotionally based drive, noxious or 
aversive stimuli arouse a hypothetical 
emotional response, re and drive level, 
D, is a function of the strength of this 
response, The Taylor (1953) Mani- 
fest Anxiety scale (MA scale) was 
developed as an operational measure of 
individual differences in re and the 
construct validity of the MA scale as 
an index of D has been consistently 
demonstrated in classical defense condi- 
tioning experiments (Spence, 1964). 

Complex learning tasks typically do 
not involve noxious stimuli, and evi- 
dence bearing on whether or not Ss 
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with high anxiety (HA) as measured 
by the MA scale have higher D on such 
tasks than Ss with low anxiety (LA) is 
inconclusive. Spence (1958) has pro- 
posed two alternative hypotheses con- 
cerning the relation between D and 
MA scale scores: (a) the "chronic" 
hypothesis posits that HA Ss manifest 
higher D than LA Ss in all situations, 
whether stressful or not; (b) the 
“emotional reactivity" hypothesis posits 
that HA Ss react with higher D than 
LA Ss to situations containing some 
degree of stress. The findings in re- 
cent studies have favored the reactive 
hypothesis (Spence, 1964). 

Regarding the implications of drive 
theory for complex learning tasks, 
Spence and Spence (1966) have re- 
cently stated, 


If the correct, to-be-learned response is 
... initially weaker than one or more of the 
competing response tendencies, then the 
higher the drive level, the poorer the per- 
formance during the early stages of learning. 
However, as learning of the correct responses 
increases over trials, the habit strength of 
these responses would be expected to equal 
and then exceed those of competing re- 
sponses. Thus, while the performance of a 
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high drive group would be expected to be 
inferior to that of a low drive group in early 
stages of learning, it should become superior 
in later stages [1966, p. 204]. 


The findings in experiments which 
compared the performance of HA and 
LA Ss on serial-learning tasks have 
been only partially consistent with pre- 
dictions derived from drive theory 
(eg., Lazarus, Deese, & Hamilton, 
1954; Lucas, 1952; Montague, 1953). 
In these studies, HA Ss were superior 
to LA Ss on serial tasks in which it 
could be reasonably assumed that cor- 
rect responses were dominant relative 
to incorrect responses, and inferior on 
tasks in which competing responses 
were considered stronger. However, 
the prediction that HA Ss would be 
inferior to LA Ss early in learning and 
superior later in learning was either not 
evaluated or not confirmed, 

Investigations of the effects of 
anxiety (drive) on serial-verbal learn- 
ing have largely ignored serial-position 
phenomena, On the assumption that 
words in the middle of a serial list 
evoke more competing responses than 
those at the extremes, drive theory 
would predict that high D would begin 
to facilitate performance sooner for 
words at the extremes of a serial list 
than for words embedded in it. The 
evidence available from serial-verbal 
learning studies in which the perform- 
ance of Ss who differed in anxiety 
level were compared do not support 
drive theory. Malmo and Amsel 
(1948) reported that psychiatric pa- 
tients with “severe anxiety” made more 
errors on words at the beginning of a 
serial list than did “normal” (presum- 
ably less anxious) controls. Kalish, 
Garmezy, Rodnick, and Bleke (1958) 
found, in response to failure stress, that 
HA Ss made fewer errors than LA Ss 
for words im the middle and at the end 
of a serial list. However, the results 
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of two experiments using serial maze 
tasks were generally consistent with 
drive theory (Farber & Spence, 1953; 
Taylor & Spence, 1952). 

The purpose of the present study 
was to investigate the effects of word- 
position and stress-nonstress experi- 
mental conditions on  serial-verbal 
learning for Ss who differed in anxiety 
(drive) level as measured by the MA 
scale. Assuming that HA Ss react 
with higher D than LA Ss to threaten- 
ing situations, it was expected that 
differences in the performance of HA 
and LA Ss would be found only in 
the stress condition. ^ Furthermore, 
on the basis of drive theory, the per- 
formance of HA Ss was expected to be 
inferior to that of LA Ss in the early 
stages of learning, and relatively better 
later in learning. | Taking serial-posi- 
tion phenomena into. account, it was 
also expected that the facilitative effects 
of high drive would occur earlier in 
learning for words at the extremes of 
a serial list than for words embedded 
in it. 


METHOD 


Subjects—The Ss were 44 undergraduate 
males enrolled in the introductory psychology 
course at Duke University who obtained 
extreme scores on the MA scale. |The Ss 
with scores of 21 or higher were designated 
the HA group; Ss with scores of 9 or lower 
were designated the LA group. These de- 
fined, approximately, the upper and lower 
20% of the total MA scale distribution after 
Ss with MMPI Lie scale scores of 8 or 
higher had been eliminated. Twelve HA and 
12 LA Ss were run in a Neutral condition; 
10 HA and 10 LA Ss were run in a Stress 
condition. The random assignment of.HA 
and LA Ss to experimental conditions took 
place before each S reported for the experi- 
ment so that E, a female senior undergradu- 
ate honor student, would not know. whether 
S was considered anxious or nonanxious: 

Stimulus materials and apparatus.—The 
stimulus materials consisted of a practice list 
and a test list typed in capital letters on end- 
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less white tapes and presented on standard 
Hull-type memory drums. Each syllable ap- 
peared in the drum aperture for 23 sec. with 
a i-sec. interval between syllables and a 5- 
sec. intertrial interval. The practice list 
contained 8 CVC nonsense syllables of 90% 
Glaze (1928) association value and low 
intralist similarity selected from Montague’s 
(1953, p. 92) List III. The test list con- 
sisted of 12 CVC nonsense syllables of 42.7% 
association value and low intralist similarity.* 
Montague (1953) found no significant differ- 
ences in the performance of HA and LA 
Ss on this list. The words in the test list 
and the order in which they were presented 
are given in Table 1. X^ 

Neutral condition procedure.—These Ss 
were given standard instructions for the 
serial-anticipation method of verbal learning, 
followed by 6 trials on the practice list. 
Upon completion of the practice trials, Ss 
were told to: “Spell out all the syllables 
aloud as they appear on the first time through 
the list, and after that, try to spell out the 
syllable which follows the one you see in the 
opening.” The test list was repeatedly pre- 
sented until S either attained the learning 
criterion of 2 successive perfect trials, or re- 
ceived a maximum of 25 trials. 

Stress condition procedure.—In order to 
induce an “ego-stress,” Ss were given a brief 
concept-attainment task (Denny, 1963) after 
which they were told, “The part of the ex- 
periment which you have just done is one 
measure of thinking and is related to one 
‘kind of intelligence. The rest of the ex- 
periment has to do with a somewhat different 
method of thinking in which your task is to 
memorize a list of nonsense syllables. We 
have done some research on the relationship 
between nonsense syllable learning and in- 
telligence. Here is a graph of the relation- 
ship we found." A graph in which “Num- 
ber of Correct Responses” increased rapidly 
and dramatically as a function of “1.Q.,” was 
shown to Ss who were told, “you see, speed 
of learning increases with intelligence.” The 
Ss were then given the practice and test lists 
exactly as these were presented in the 
Neutral ‘condition. 


»8 Phe words in the test list, and the order 
in. which-they were presented, were exactly 
the same as for Montague’s List II, except 
that the syllable "wrP" was replaced with 
“ug”; pilot data showed that “mrp” had 
a very high association strength (“mP”) for 
the,population of male Ss used in this study. 
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‘RESULTS 


The number of correct anticipations 
given by each S on each block of 5 
trials served as the dependent variable 
in all of the statistical analyses. The 
Ss who reached the learning criterion 
prior to the twenty-fifth trial were 
given credit for perfect performance on 
all trials subsequent to the criterion 
trial. The mean number of correct 
responses given on successive trial 
blocks by HA and LA Ss in the Neu- 
tral condition is presented in Fig. 1A 
and comparable data for the Stress 
condition is presented in Fig. 1B. The 
Neutral and Stress conditions were 
evaluated separately by analyses of 
variance in which Anxiety (drive level) 
was the between-Ss variable, and Trial 
Blocks the within-Ss variable (Lind- 
quist, 1953). In the Neutral condition, 
only the Trial Blocks effect was sta- 
tistically significant, F (4, 88) = 
353.51, p<.001. For the Stress 
condition, the Anxiety X Trial Blocks 
interaction, F (4, 72) = 2.97, p < .05, 
and the main effect of Trial Blocks, F 
(4, 72) = 1387, p < .001, were both 
significant. These findings, when con- 
sidered along with the learning curves 
presented in Fig. 1, indicated that there 
were no differences in the performance 


i2 —— nee Was (Wiad High MAS (N10) 


[-- Lo MAS (N*I0) 


MEAN NUMBER OF CORRECT RESPONSES 
-ow €» o 6 «4 0 o 5 


A Neutral Condon 


B. Stress Condition 


1 ym mu x 


Ee ee a ES 
‘mocks OR FiVETTAALS BLOCKS OF FIVE TRIALS 


Fic. 1. Learning curves for HA and LA 
Ss: Fig. 1A. The mean number of correct 
responses given over trial blocks by Ss in the 
Neutral condition; Fig. 1B. The mean num- 
ber of correct responses given over trial 
blocks by Ss in the Stress condition, 
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TABLE 1 


MEAN NUMBER OF Errors GIVEN BY HIGH- 
ANXIETY AND Low-ANXxiETY SS IN THE 
STRESS CONDITION ror Each WORD 
IN THE TesT List 
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of HA and LA Ss over trial blocks in 
the Neutral condition, and that in the 
Stress condition the performance of 
HA Ss was inferior to that of LA Ss 
early in learning but superior later in 
learning. 

Effects of anxiety and serial position 
on performance.—Since performance in 
the Neutral condition was unrelated to 
anxiety, further analysis of the data for 
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Fic. 2. Mean number of correct responses 
given by HA and LA Ss on successive trial 
blocks for EASY and HARD words. The EAsv 
words appeared in Serial Positions 1, 2, 3, 
and 12; the HaRD words appeared in Serial 
Positions 6, 7, 8, and 9. 
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TABLE 2 


ANALYSIS OF VARIANCE OF THE EFFECTS OF 
ANXIETY LEVEL, HARD-EASY WORDS, AND 
TRIAL BLOCKS ON THE PERFORMANCE 
or HIGH- AND LOW-ANXIETY 
Ss IN THE STRESS 


CONDITION 
Source af MS F 
Anxiety (A) 1 15.68 16 
Error (b) 18 | 100.17 
HARD-EASY Words 1 | 1800.00} 73.17*** 
(W) 
Trial Blocks (Tr) 4| 976.71 | 136.19*** 
A XTr 4| 19.39 2,70* 
W X Tr 4| 51.66 8.22*** 
AXW 1 7442 3.03 
AXW X Tr 4 25.38 4.04** 
Error (1)—W, 18 24.60 
AXW 
Error (2)—Tr, 72 747 
AX Tr 
Error (3)— 72 6.29 
W X Tr, 
AXW X Tr 
Total 199 
«.05. 
**p <01. 
FD < .001 


this condition was considered unwat- 
ranted. In order to evaluate the joint.» 
influence of anxiety and serial position 
on performance in the Stress condition, 
the mean number of errors made by 
HA and LA Ss was determined for 
each word in the test list. These 
means are given in Table 1 in which it 
may be noted that the data reflect the 
well-known, bow-shaped, serial-posi- 
tion curve, and that this curve was 
more stable for the combined data of 
the HA and LA Ss than for the data 
of these groups taken separately. For 
the combined data, the words in Serial 
Positions 1, 2, 3, and 12 individually 
elicited fewest errors in contrast to 
words in Serial Positions 6, 7, 8, and 
9 which elicited the largest number of 
errors; the words in these serial posi- 
tions were, respectively, designated 
EASY and HARD. 

The mean number of correct re- 
sponses for Easy and Harp words 
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given on successive trial blocks by the 
HA and LA Ss in the Stress condition 
are presented in Fig. 2. These data 
were evaluated in an analysis of vari- 
ance, summarized in Table 2, in which 
the most important finding was the sig- 
nificant triple interaction involving 
Anxiety, HARD-EASY words, and Trial 
Blocks. This finding indicated that 
the influence of serial position on per- 
formance differed for HA and LA Ss 
at different stages of learning, as may 
be noted in Fig. 2. On Trial Block I, 
the performance of HA Ss was inferior 
to that of LA Ss on both rasy and 
HARD words; on Trial Blocks II and 
III, the performance of HA Ss was 
superior to that of LA Ss for Easy 
words but inferior for HARD words; by 
Trial Block IV, the performance of 
HA. Ss was superior to that of the 
LA Ssfor both types of words, and this 
superiority was maintained in Trial 
Block V. 


Discussion 


In this study, the performance of HA 
and LA Ss showed differences attrib- 
utable to drive level in an “ego-stress” 
condition, but not in a neutral condition. 
The amount of stress induced in the 
Neutral condition, in which E was an 
undergraduate female regarded by her 
peers as pleasant and relaxed (and ex- 
ceedingly considerate of the needs of 
others), was evidently not sufficient to 
produce differences in the D level of HA 
and LA Ss. However, when Ss in the 
Stress condition were informed by the 
same E that they were participating in 
an intelligence-testing situation, the per- 
formance of HA and LA Ss suggested 
that they reacted to these “ego-stress” 
instructions with differential levels of D. 
These findings were generally consistent 
with results obtained in previous studies 
using neutral and ego-stress or failure 
instructions (e.g., Lucas, 1952; Nichol- 
son, 1958; Sarason, 1960; Spence & 
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Spence, 1966), and provide empirical 
support for Spence's (1958) “emotional 
reactivity" hypothesis which posits that 


HA Ss react with higher D than LA Ss 


to situations containing some degree of 
stress. 

The performance of HA Ss in the pres- 
ent study was inferior to that of LA Ss 
early in learning and superior later in 
learning. Assuming that correct response 
tendencies are weaker than competing 
(error) tendencies in the early stages of 
learning and gradually become stronger 
as learning progresses, these findings 
were consistent with predictions derived 
from Spence and Spence's (1966) theory 
of emotionally based drive. Further 
support for the theory was observed when 
error tendencies associated with serial- 
position phenomena were taken into ac- 
count. For words at the beginning and 
the end of the serial list (EAsv words), 
the performance of HA Ss exceeded that 
of LA Ss at an earlier stage in learning 
than was the case for words in the middle 
of the serial list (mard words). 

At what stage in learning does anxiety 
(drive) begin to facilitate performance? 
In Fig. 1B, it may be noted that the 
performance of HA Ss began to exceed 
that of LA Ss at a point in learning at 
which the mean number of correct re- 
sponses given by both groups was just 
over 50%. Similarly, in Fig. 2 the 
performance curves for the HA Ss 
crossed those of the LA Ss at different 
stages in learning for the Easy and the 
HARD words, and correct responses were 
given approximately 50% of the time by 
both groups at these crossover points. 
On the basis of the consistency in these 
findings, it is tempting to generalize that 
high drive will begin to facilitate per- 
formance when the probability of occur- 
rence of a correct response just exceeds 
5096. But the performance of HA Ss 
has been observed to be ‘superior to that 
of LA Ss in seriallearning studies in 
which correct responses were given at 
well below the 50% level (e.g., Deese, 
Lazarus, & Keenan, 1953; Kalish et al., 
1958; Montague, 1953). Such findings 
serve to emphasize that the relative 
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strengths of correct and competing re- 
sponse tendencies determine the effects of 
high drive on performance, rather than 
the strength of correct response ten- 
dencies alone (Taylor, 1956). 

There is also evidence that intelligence, 
like task complexity, may interact with 
anxiety to influence performance on learn- 
ing tasks. For example, in a serial- 
verbal learning study Deese et al. (1953) 
reported that the performance of HA Ss 
was superior to that of LA Ss at the 
Johns Hopkins University, an institution 
noted for the quality of its students. 
They (Deese et al, 1953) also refer to 
unpublished data indicating that, “with 
the identical design and task used in the 
present experiment, but with a population 
of lower intelligence . . . high-anxious Ss 
performed more poorly than low-anxious 
Ss [Footnote 2, p. 59.]” Thus, HA ap- 
parently facilitated the performance of 
Ss with high intelligence while leading 
to performance decrements for less able 
Ss. Similarly, it has been observed that 
very bright HA college students achieve 
higher grades than LA Ss of comparable 
ability, whereas the academic achieve- 
ment of less able HA Ss is inferior to 
that of LA Ss of comparable ability 
(Spielberger, 1962; Spielberger & Katz- 
enmeyer, 1959). 

Viewed within the context of drive 
theory, the observation that HA facili- 
tates the performance of bright Ss and 
leads to performance decrements for less 
intelligent Ss suggests that task difficulty 
is an inverse function of intelligence. 
For bright Ss, a given learning task 
may elicit only correct responses whereas 
the same task may evoke a large number 
of strong competing (error) tendencies 
in less intelligent Ss. Denny (1963) has 
recently demonstrated that while the per- 
formance of HA-high intelligence Ss was 
superior to that of LA-high intelligence 
Ss on a concept-formation task, the per- 
formance of HA-low intelligence Ss was 
inferior to that of LA-low intelligence S¥ 
on the same task. Thus, it would seenr 
especially important to take individual 
differences in intelligence into account 
when deriving predictions from drive 
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theory concerning performance on com- 
plex learning tasks. 

On the basis of the findings of the 
present study, it may be concluded that 
the performance of HA and LA Ss will 
reflect differences in drive level only 
where experimental conditions contain 
some degree of stress. When differences 
in D are induced, performance in serial 
learning would seem to depend upon 
serial-position phenomena, the inherent 
characteristics of stimulus materials that 
determine the relative strengths of cor- 
rect and competing response tendencies 
(e.g. intraserial duplication and associa- 
tion value), and other individual differ- 
ence variables such as intelligence. Fur- 
thermore, these variables are likely to 
interact in different ways at different 
stages of learning. Surely such a state of 
affairs affirms Spence's (1958) conten- 
tion that, “in order to derive implications 
concerning the effects of drive variation 
in any type of complex learning task, it 
is necessary to have, in addition to the 
drive theory, a further theoretical net- 
work concerning the variables and their 
interaction that are involved in the par- 
ticular learning activity [p. 137.]" 
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EFFECTS OF ANXIETY 


AND INTELLIGENCE ON 


CONCEPT FORMATION * 


J. PETER DENNY 


University of Western Ontario 


Spence’s theory of anxiety and performance was extended to include 
individual differences in intelligence, and the extended theory was then 
employed to predict the effects of anxiety on concept formation. 4 
groups (14 male undergraduates each), representing the 4 possible 
combinations of high and low anxiety (defined by the Taylor MA 
scale) with high and low intelligence (defined by CEEB aptitude 
scores), were given a modified Hovland-Bruner type concept-forma- 
tion task. The measure of concept-formation proficiency was the 
number of erroneous conclusions about whether or not attributes were 
included in the concept. The effects of anxiety and intelligence upon 
this measure were interactive: among high-intelligence Ss those with 
high anxiety made fewer errors than those with low anxiety; whereas, 
among low-intelligence Ss those with high anxiety made more errors 
than those with low anxiety. These results were consistent with 


expectations derived from the extended anxiety theory. 


When considering the effects of 
anxiety upon complicated processes, 
Spence (1958) has emphasized the 
importance of specifying additional 
variables relevant to the process under 
investigation. Therefore, in order to 
make predictions about the effects of 
anxiety on concept formation, anxiety 
theory (Spence, 1958) was extended to 
include individual differences in intel- 
ligence. This extension seemed neces- 
sary for two reasons. First, investi- 
gations of the effects of anxiety on con- 
cept formation that have not taken in- 
telligence into account have reported 
contradictory results. For example, 
Seigman (1956) found a decrement 

1Based on a dissertation, supervised by 
Charles D. Spielberger, and submitted to the 
Graduate School of Duke University in 
partial fulfillment of the requirements for 
the PhD degree. Thanks are also due to 
C. Alan Boneau, David T. Wells, and Lou 
Ella Smith for their assistance with this 
research, which was supported in part by a 
grant, OM-362, awarded by the National In- 
stitute of Mental Health to C. D. Spielberger 
and H. Weitz. This experiment was re- 
ported at the Midwestern Psychological As- 
sociation Meetings, Chicago, May 2-4, 1963. 


in concept-formation proficiency re- 
lated to anxiety, whereas Wesley 
(1953) obtained a nonsignificant in- 
crement. Second, intelligence is a 
variable known to be related to concept- 
formation proficiency (e.g., Baggaley, 
1955; Hoffman, 1955; Osler & Fivel, 
1961). 

Previous investigators (Holbrook, 
1954; Lipman & Griffith, 1960; Roma- 
now, 1958) attempted to resolve these 
inconsistent findings by introducing the 
variable of task complexity. They 
assumed that complex tasks would 
elicit stronger incorrect than correct 
habits, which when energized by in- 
creased anxiety drive, would lead to 
increased excitatory potentials for in- 
correct compared to correct responses. 
Thus, they expected high-anxious Ss 
to make more incorrect responses than 
low-anxious Ss, a prediction which was 
supported in all these studies. On the 
other hand, they assumed that simple 
tasks would elicit stronger correct than 
incorrect habits, which when energized 
by anxiety would lead to more correct 
responses. However, only Holbrook 
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demonstrated this expected facilitation 
of performance as a function of anxiety ; 
Romanow and Lipman and Griffith 
failed to do so. 

These contradictory findings con- 
cerning the effects of anxiety on con- 
cept formation, even when task com- 
plexity was manipulated, suggest that 
the relative strengths of correct and 
incorrect habits in concept formation 
are not wholly determined by task 
complexity. Another possible deter- 
minant is S’s capacity for learning. 
Given a task of fixed complexity, Ss 
of high capacity should learn correct 
habits more readily than those of low 
capacity. Capacity for learning con- 
cept-formation habits may be regarded 
as an aspect of intelligence and may be 
assessed by intelligence tests. This 
procedure is justified since many 
models of intelligence include con- 
ceptual processes (e.g., Guilford, 1959) 
and many intelligence tests seek to 
assess them (Wechsler, 1958, pp. 
72-74). 

The present study sought to demon- 
strate the relevance of Ss' intelligence 
levels to the effects of anxiety upon 
concept formation. High- and low-in- 
telligence Ss were given the same con- 
cept-formation task? ^ The level of 
task complexity was fixed by empirical 
pretesting so that the task would be 
fairly easy for high-intelligence Ss and 
fairly difficult for low-intelligence Ss. 
Therefore, a high level of anxiety was 
expected to yield more proficient con- 
cept formation than low anxiety for 
these Ss. On the other hand, it was 
assumed that the task would elicit 


?Tn the original study (Denny, 1963) two 
tasks were used. However, only the data for 
the first task are reported here because dif- 
ferences between the tasks in complexity 
were confounded with order effects, and also 
because no interactive effects of the tasks 
variable with anxiety or jointly with anxiety 
and intelligence were obtained. 
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stronger incorrect than correct habits 
in low-intelligence Ss. Therefore, a 
high level of anxiety was expected to 
yield Jess proficient concept formation 
than low anxiety for these Ss. This 
hypothesized ^ interaction between 
anxiety and intelligence has not been 
obtained in other studies of the effects 
of these variables on concept formation 
(Holbrook, 1954; Williams & Corder, 
1957) because an appropriate orthog- 
onal experimental design has not been 
employed. 


METHOD 


Subjects and design—The Ss were male 
college students enrolled in introductory psy- 
chology at Duke University. Four groups 
of 14 Ss each represented the four possible 
combinations of high and low anxiety with 
high and low intelligence. Anxiety level was 
defined by scores on the Taylor (1953) 
Manifest Anxiety scale (MA scale), and in- 
telligence level by summed scores on the 
Verbal and Mathematical aptitude tests of 
the College Entrance Examination Board 
Scholastic Aptitude Test (CEEB). After 
Ss with scores of 8 or above on the Lie scale 
of the Minnesota Multiphasic Personality 
Inventory were eliminated, Ss with MA 
scale scores of 20 or above (approximately 
the upper quartile) were designated high 
anxiety, and those with scores of 8 and be- 
low (approximately the lower quartile) were 
designated low anxiety. Volunteer Ss were 
solicited from these groups for the experi- 
ment. Following the experiment, the high- 
and low-anxiety groups were each split into 
high- and low-intelligence groups by dividing 
them at the median CEEB score (Mdn= 
1156) for the total experimental sample. 
The mean MA scale and CEEB scores for 
the four S groups are shown in Table 1, in 


TABLE 1 


MEAN MA SCALE AND CEEB SCORES FOR THE 
Four S Groups 


MA 
Scale 


High Anxiety—High Intelligence | 23.9 
High Anxiety—Low Intelligence | 23.7 
Low Anxiety—High Intelligence | 5.4 
Low Anxiety—Low Intelligence 4.1 


Group CEEB 


1270 
1033 
1270 
1042 
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TABLE 2 
CoxcEPT-FORMATION TASK AND CORRECT CONCLUSIONS 


Task 


Conclusions 


Instance 
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which it can be seen that the corresponding 
anxiety and intelligence groups were well 
matched. 

Task and procedures.—The concept-forma- 
tion task is shown in Table 2. The left half 
of this table shows the task as seen by Ss 
(the right half shows the correct conclusions 
about the concept as explained below). 
Each of the rows, 1 through 9, was defined as 
an instance. Each instance consisted of eight 
positions, lettered A through H. At each 
position a symbol appeared, either line (/), 
star (*), or colon (:). The lettered posi- 
tions were defined as the attributes, and the 
symbols were the values of the attributes. 
Thus, each instance contained one value for 
each of the eight attributes. Familiar sym- 
bols and letters were used for the values and 
attributes so that the task would require 
minimal perceptual skills. Each instance 
was also marked at the left margin to indi- 
cate whether it was a positive instance (+ 
sign) or a negative instance (— sign). 

Pilot work was done to select a task of 
appropriate complexity by varying: (a) total 
number of attributes, (b) the proportion of 
attributes included in the concept, and (c) 
the number of attributes in each instance 
whose values differed from those in the first 
instance. The levels of these variables 
chosen for the experimental task, 8, .5, and 
2, respectively, were those which yielded 
fairly high proficiency scores for high-intel- 
ligence pilot Ss and fairly low proficiency 
scores for low-intelligence pilot Ss. 

The instructions to Ss began with state- 
ments designed to induce a mild stress by re- 
lating the investigation to success in college 
courses, since Spielberger and Smith (1966) 
had shown that anxiety measured by the MA 
scale influenced performance in a serial- 
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verbal learning task only under such condi- 
tions. The remainder of the instructions 
contained definitions and examples of at- 
tributes, values, positive and negative in- 
stances, and concepts. In addition, Ss were 
told that only one value of each attribute was 
included in the concept, but not how many 
attributes were included. Jj 

The task was initially presented to S with 
the first instance exposed and Instances 2 
through 9 hidden by a cover card. The 
first instance was designated a positive in- 
stance by E. After S had inspected the 
first instance as long as he wished, E ex- 
posed the second instance by sliding down the 
cover card. Each successive instance was 
presented in this manner. In order to 
reduce S’s memory load (Cahill & Hovland, 
1960), previously presented instances were 
left in view. 

Task information and dependent variables. 
—The single concept to be formed by all Ss 
contained four attributes with one value 
specified for each attribute; this concept was 
B (/), E C), F (*), H (c). The task 
provided information enabling S to discover 
the specific attributes which were in the 
concept? This information was of two 
kinds; (a) each instance was marked either 
positive or negative, and (b) a particular 
value for each attribute appeared in each 
instance. The specific aspect of the latter 
information which was useful for concept 
formation was the difference in the values 
of attributes between each instance and the 
first instance. In the present task, the 


3 The value of each attribute in the concept 
could be readily identified from any positive 
instance. The Ss’ proficiency in doing $0 
was not studied in this experiment. 
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values of two attributes differed between 
the first instance and each successive in- 
stance. For example, it can be seen in the 
left half of Table 2 that Instance 2 differed 
from the first instance with respect to the 
values of attributes D and E. Instance 3 
differed from the first instance with respect 
to the values of two other attributes, B and 
G, but did not differ with respect to At- 
tributes D and E. The rest of the task was 
similarly constructed. 

From these two kinds of information it was 
possible to derive correct conclusions about 
the concept by deductive reasoning. There 
were three kinds of ‘correct conclusions: (a) 
that an attribute was included in the concept 
(symbolized I); (b) that an attribute was 
not included in the concept (symbolized N) ; 
and (c) that insufficient information had 
been given to determine whether an attribute 
was or was not included (symbolized ?). 
Each instance contributed relevant informa- 
tion so that, following the exposure of any 
given instance, correct conclusions for each 
attribute were derivable from the informa- 
tion contained in all of the exposed instances. 
These correct conclusions are indicated in 
the right half of Table 2. Each row of this 
table shows the correct conclusions derivable 
jointly from the information provided by the 
instance shown to its left and all preceding 
instances. It can be seen that in the early 
instances insufficient information was pro- 
vided to enable S to deduce either that an 
attribute was included or not included in the 
concept; therefore the correct conclusion 
was (?). In later instances each attribute 
was shown to be either included (I) or not 
included (N) in the concept. 

The Ss were required to decide, for every 
attribute after the presentation of each in- 
stance except the first one, either: (a) that 
the attribute was included in the concept (1) ; 
(b) that it was not included in the concept 
(N) ; or (c) that they did not know whether 
or not it was included (?). The Ss wrote 
their conclusions on a record sheet using I, 
N, and ? as responses. Each conclusion 
formed by S' was scored correct or incorrect, 
by virtue of its correspondence to the correct 
conclusion derivable for the attribute in 
question following the presentation of the 
particular instance. The number of correct 
conclusions made by S was the main de- 


*'The use of differences in the values of 
two attributes was suggested by Bruner's 
focus gambling strategy (Bruner, Goodnow, 
& Austin, 1956; pp. 89-90.) 
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pendent variable for the-study. These were 
counted for the whole task, excluding the 
first instance, yielding scores which could 
range from 0 to 64. An additional dependent 
variable was S’s best guess about the correct 
concept at the end of the task; the number 
of Ss in each group who formed the correct 
concept was counted. 


RESULTS 


Figure 1 shows the results for num- 
ber of correct conclusions. The points 
on the graph indicate the mean number 
of correct conclusions drawn by each of 
the four groups. It can be seen that 
the expected interactive influence of 
anxiety and inteligence on concept 
formation was obtained: the high- 
anxiety-high-intelligence group made 
more correct conclusions than the 
low-anxiety-high-intelligence group, 
whereas the high-anxiety-low-intelli- 
gence group made fewer correct con- 
clusions than the low-anxiety-low-in- 
telligence group. Expressed more 
simply, intelligence was a crucial vari- 
able only for high-anxious Ss. These 
data were evaluated by an analysis of 
variance in which the Anxiety X In- 
telligence interaction was significant, 
F (1, 52) — 7.09, p « .01. These find- 
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sions formed by each of the four experimental 
groups. 
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TABLE 3 


FREQUENCY OF Ss IN Each Group FORMING 
CORRECT AND INCORRECT CONCEPTS 


Low Intelligence | High Intelligence 


Cor- | Incor- Cor- Incor- 

rect rect rect rect 
High Anxiety o 8 13 1 
Low Anxiety | 12 2 li 3 


ings may be interpreted as showing 
that the effects of anxiety upon concept 
formation depended upon S's intelli- 
gence level: high anxiety facilitated 
concept formation for high-intelligence 
Ss and impaired it for low-intelligence 
Ss. The only other significant result 
was that high-intelligence Ss formed 
more correct conclusions about the con- 
cept than did those of low intelligence, 
F (1, 52) = 7.74, p < .01, a finding 
consistent with many previous studies. 

The above results which were based 
upon a novel measure of concept-for- 
mation proficiency were compared to 
those for a more conventional measure, 
the number of Ss forming the correct 
concept. The results given in Table 3 
show an interaction of anxiety and in- 
telligence similar to, but not so clearcut 
as that obtained for number of correct 
conclusions. The difference between 
the two high-intelligence groups was 
quite small, and the results, when statis- 
tically evaluated by a general partition 
of x? (Sutcliffe, 1957), yielded an in- 
teraction of anxiety and intelligence at 
a lower level of significance, x*(1) 
= 6.09, p < .025, which should be in- 
terpreted conservatively since 50% of 
the cells had expected frequencies of 
3.5. These data yielded no other sig- 
nificant effects. It was concluded that 
this conventional dependent variable 
was less sensitive to variations in pro- 
ficiency as a function of anxiety and 
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intelligence than was the novel depend- 
ent variable developed for this study. 


Discussion 


In the present study, anxiety and intel- 
ligence had interactive effects upon con- 
cept-formation proficiency. This interac- 
tion was interpreted to show that high 
anxiety facilitated concept formation for 
high-intelligence Ss and impaired it for 
low-intelligence Ss, The relationship be- 
tween anxiety, intelligence, and concept 
formation was reflected in its most dra- 
matic form in this study: Anxiety had 
opposite effects for high- and low-intelli- 
gence Ss. However, it must be recog- 
nized that the relation is conditioned by 
another variable, task complexity, which 
in this study was held constant at a 
level designed to elicit the opposing effects 
which were obtained. 

An integrated theoretical analysis of 
task complexity and intelligence as they 
influence concept-formation habits is sug- 
gested by the findings of the present 
study. It would seem that task complex- 
ity specifies the stimulus demands placed 
upon the organism, whereas intelligence 
refers to the organism’s capacity to 
learn habits useful for responding to the 
stimuli of the task. These two variables, 
along with others such as amount of 
practice in forming concepts, determine 
the relative strengths of the correct and 
incorrect concept-formation habits elicited 
within S in a particular experimental 
context. In general a lower level of 
task complexity and higher intelligence 
will favor the development of correct 
habits, and vice versa. These ideas are 
helpful in relating the findings of the 
present experiment to previous research, 
in that the facilitating effects of anxiety 
suggested by at least one investigator 
(Wesley, 1953), and the debilitating ef- 
fects found by others (e.g, Seigman, 
1956), might be expected if the concept- 
formation tasks used in these studies 
varied in complexity, or if the populations 
of Ss used differed in intelligence. The 
fact that most anxiety and concept-forma- 
tion studies have reported debilitating ef- 
fects of anxiety suggests that the tasks 
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used were probably relatively difficult for 
the experimental Ss employed. 

The application of anxiety theory to 
concept formation in this study was 
couched primarily in behavior-theory 
terms. Certain limitations of such a 
theory for studies of concept formation 
were discovered when subsidiary analyses 
were performed, based upon an informa- 
tion processing model (Hovland, 1952; 
Hunt, 1962), according to which the task 
provides information input which is 
transformed by S into conclusions about 
the concept. Following this model, Ss’ 
erroneous conclusions were divided into 
two classes: (a) errors from complete in- 
formation processes, which occurred when- 
ever one conclusion relating an attribute 
to the concept (an I or N conclusion) had 
been established as correct by the infor- 
mation given, and another such conclusion 
had been drawn erroneously by S (ie. 
correct conclusion, I, and S's conclusion, 
N, or the reverse) ; (b) errors from in- 
complete information processes, which 
were all others. 

An analysis of variance for errors from 
complete information processes was per- 
formed, following a square-root trans- 
formation of scores to improve homo- 
geneity of variance. This revealed a sig- 
nificant interaction of anxiety and intel- 
ligence, F (1, 52)= 892, p «.05, very 
similar to that found for number of cor- 
rect conclusions: high anxiety was asso- 
ciated with fewer errors for high-intelli- 
gence Ss and more errors for low-intelli- 
gence Ss. However, the corresponding 
analysis of errors from incomplete infor- 
mation processes revealed no such inter- 
action, nor a main effect of anxiety. 
Thus, anxiety effects consistent with the 
extended anxiety theory used in this 
paper were found for complete informa- 
tion processes but not for incomplete ones. 
Spencian anxiety theory was conceived 
for situations which could be described 
as complete information processes, in that 
stimuli useful for eliciting a particular 
response were presented and definite re- 
sponses to the stimuli were required from 
S. However, in many concept-formation 
tasks, incomplete information processes 
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also occur because stimuli are given 
which contain no information useful for 
forming the concept, or Ss are permitted 
to make noncommittal responses such as 
the ? response in this experiment. 
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THE von RESTORFF EFFECT IN SERIAL LEARNING: 


SERIAL POSITION OF THE ISOLATE AND 
LENGTH OF LIST* 


JOHN P. McLAUGHLIN 
New York University ? 


Nature of effects in serial learning of an isolated item (I) were 
examined as list length and serial position of I were varied. 180 
soldiers learned series of 3, 7, 11, and 15 items with single Is and 
corresponding control items (CC) in varying serial positions to 3 
errorless repetitions. Significant facilitation (lower errors) of I 
relative to CC occurred, but did not vary in magnitude with 
independent variables. Items immediately following I and all other 
items in I’s series were facilitated by penultimate Is. The concept of 
generalization interference reduction does not adequately account 


for isolation effects. 


The von Restorff effect is the facili- 
tation of the learning or retention of 
an item due to its difference on some 
dimension from surrounding items 
(von Restorff, 1933). The different, 
or isolated (1), item was thought by 
von Restorff to act as a figure on the 
ground of surrounding, or nonisolated 
(NI), items. Gibson (1940), on the 
other hand, accounted for the phenom- 
enon by postulating that the I item is 
removed from generalization interfer- 
ences due to its unique properties, and 
therefore is learned more quickly than 
NI items. 

Newman and Saltz (1958) attempted 
to test Gibson’s hypothesis in a serial- 
learning paradigm. The isolating di- 
mension in their study was meaning- 
fulness, defined by the m values re- 


1 Based on a dissertation submitted to the 
faculty of the Graduate School of Arts and 
Sciences of New York University in partial 
fulfillment of the requirements for the PhD 
degree. The author wishes to express his 
thanks to Edith Neimark for her encourage- 
ment and direction during the entire research 
effort. A portion of these data was re- 
ported at the 1965 Convention of the Eastern 
Psychological Association at Atlantic City, 
New Jersey. 

? Now at the University of Delaware. 
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ported by Noble (1952). A low mean- 
ingful item was placed in the seventh 
position of a 13-item list. The re- 
maining 12 items had high 77 values, as 
did all 13 items of a control list. They 
hypothesized that reduction of response 
generalization for the I item should 
result both in facilitation of the I item 
as a correct response to its effective 
stimulus, and in reduction of the fre- 
quency with which the I item is given 
as a response to a wrong stimulus, i.e., 
an intrusion error. Newman and 
Saltz (1958) found facilitation of the 
I item as a response, but reported that 
the I item occurred more frequently as 
an intrusion error than a control item 
in the corresponding position of the 
control list. 

They also predicted facilitation of 
the item immediately following the I 
item in the series (FI). Gibson’s hy- 
pothesis suggests that the I item, as a 
stimulus, is less affected by generaliza- 
tion, and therefore the response to that 
stimulus would be facilitated. This 
prediction was not strongly supported 
by the data (Newman & Saltz, 1958). 
The difference in errors between the FI 
items and their control counterparts 
reached .05 significance only in a one- 
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tailed test. The prediction of this ef- 
fect is based on the assumption that 
the I item functions as the effective 
stimulus for the FI item. The more 
recent view of serial learning is that 
the effective stimulus, at least in the 
middle of the series, is not the preced- 
ing item in the series, but rather, posi- 
tion in the series (Ebenholtz, 1963; 
Young, 1962). Since Newman and 
Saltz placed the I item in the middle 
of the series, their prediction of FI 
facilitation from Gibson’s hypothesis of 
stimulus-generalization reduction seems 
inappropriate. If, as Young (1962) 
suggested, the preceding item is an 
effective stimulus at the extremes of 
the series, and if the I item were 
located in the extreme portion of the 
series, the prediction would be appro- 
priate. 

Newman and Saltz (1958) also 
theorized that the reduction of generali- 
zation interferences for the I item 
should result in fewer errors to the NI 
items in the list, since there were fewer 
interfering tendencies in the series as 
awhole. This prediction was not con- 
firmed, i.e., no difference in total errors 
was detected between experimental and 
control NI values. 

Wishner, Shipley, and  Hurvich 
(1957) reported that the serial-position 
curve of errors for a list containing 
seven consecutive black items and 
seven consecutive red items showed 
two maxima at approximately the mid- 
dle of each half of the series. No evi- 
dence of this type of error pattern was 
obtained by Newman and Saltz (1958). 
An I item in the middle of a series did 
not have the effect of dividing the 
series into functional halves. 

There are two methodological fac- 
tors which limit the generality of the 
Newman and Saltz findings. First, 
their use of the meaningfulness dimen- 
sion to define isolation raises some 
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questions of interpretation. Since 
items that are low on the 5 scale are 
difficult to learn, their data may be 
viewed as the result of two opposing 
effects, the facilitative effects of isola- 
tion and the inhibitory effects of low 
meaningfulness. 

Secondly, the degree of isolation, i.e., 
the amount of difference between the I 
and NI items, was not varied. The 
magnitude of isolation effects seems 
dependent on the amount of difference 
between the I item and the NI items 
in paired-associate learning (Erickson, 
1963). It seems possible that the na- 
ture and extent of isolation effects in 
serial learning may be dependent on the 
degree of enhancement of the I item. 

To avoid the problems of interpreta- 
tion raised by the low meaningfulness 
of the I item in the Newman and Saltz 
studies, a different, and more common, 
isolating operation was employed. In 
the present investigation the size, 
shape, and color of the I items were 
different from those of the NI items. 
This operation provides a discriminable 
difference among I and NI items, but 
does not encounter the problems inher- 
ent in the definition of isolation used 
by Newman and Saltz. The tech- 
nique permits the use of the same item 
as I item and as control and, in addi- 
tion, permits the selection of all items 
from a narrow range of association 
values. 

To manipulate the degree of isolation 
of the I item from the NI items, two 
independent variables, (a) number of 
NI items in a series and (b) serial 
position of the T item and its control 
item, were chosen. In Gibson's theory; 
the difficulty of an item in a series 
increases directly with the number of 
response-generalization interferences 
operating at that item’s position in the 
series. An I item, however, it not 
prey to these interferences. There- 
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fore, the amount of facilitation of an I 
item in a given serial position, relative 
to a control item in the same position 
of another list will increase as the num- 
ber of interferences at that position is 
increased equally for the two lists. 
Facilitation of the I item, then, should 
be greatest when it is located in the 
middle portion of a series, and when it 
appears in a long series of items. 

Some predictions drawn by Newman 
and Saltz (1958) remain appropriate 
for the present study. Response-gen- 
eralization reduction predicts that the 
I items will be facilitated in serial 
learning and that they will appear less 
frequently as intrusion errors than con- 
trols. The concept of stimulus-gener- 
alization reduction predicts that the FI 
items will be facilitated relative to their 
controls, if the I item is located near 
either extreme in the series. The 
prediction does not apply to conditions 
in which the I item appears in the mid- 
dle of the series. The NI items in 
experimental lists should be facilitated 
over controls since the presence of an 
isolate has reduced the absolute num- 
ber of interfering associations in the 
series. The predictions added in the 
present study are that (a) the effects 
of isolation will become stronger, i.e., 
the I, FI, and NI items will be facili- 
tated to a greater degree relative to 
controls, as the number of nonisolates 
is increased, and (b) the effects of iso- 
lation on the I item will be stronger 
when the I item appears in the middle 
portions of the series than at the begin- 
ning or end portions. 


METHOD 


Learning materials—The items were 15 
consonant-vowel-consonant (CVC) trigrams 
drawn from the 27-36% association-value 
categories of Archer’s (1960) standardiza- 
tion. No two items began or ended with the 
same consonant, 
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Experimental lists of 3, 7, 11, and 15 items 
were composed so that they included one 
trigram typed in red, upper-case letters (I 
item) and an appropriate number of trigrams 
typed in black, lower-case letters (NI items). 
The I item was placed in the middle position 
of the three-item lists; in the second, middle, 
and penultimate positions of the 7- and 15- 
item lists; and in the second, fourth, sixth, 
eighth, and tenth positions of the 11-item 
list. Control lists, identical in all respects 
to the experimental lists, except for the print 
characteristics of the I item, were also 
typed. In these control lists, the I item 
was printed in black, lower-case letters, and 
thus became a critical control (CC) item. 

Thus the experiment contained 12 experi- 
mental conditions, defined by the serial posi- 
tion of the I item and the number of NI 
items in the series, and 12 corresponding 
control conditions. To control for possible 
idiosyncratic effects of specific items, five 
trigrams were used as I or CC items in each 
condition. Within a given condition, differ- 
ent lists were randomly drawn from the 
original 15 items for each specific critical 
trigram. To select the 3-, 7-, and 11-item 
lists, an I item was determined, and an ap- 
propriate number of NI items was randomly 
selected from the remaining 14 items. All 
of the original 15 trigrams were included in 
each random order of the 15-item list. 

Subjects and procedure.—One hundred and 
eighty United States Army enlisted men, 
students at an Army medical training center, 
volunteered as Ss. Each individual had 
been examined for color blindness with the 
American Optical Company test (Hardy, 
Rand, & Rittler, 1946) during induction 
physical examinations. These data, ob- 
tained from medical records, were used to 
exclude from the study those with detectable 
color anomalies. 

Each condition was assigned 10 experi- 
mental and 5 control Ss. Ten random orders 
of the experimental conditions and 5 orders 
of the control conditions were chosen. One 
control S' was tested for every 2 experimental 
Ss, to space the control-data collection over 
the entire data-collection period. The as- 
signment of individuals to conditions was 
determined by the order in which they ap- 
peared for the experiment. In each condi- 
tion, 2 experimental Ss and 1 control S were 
assigned to each specific critical trigram. 

Each S was instructed to learn his list by 
the anticipation method, and to respond by 
spelling the items. Initially, each S learned 
a practice list of four common three-letter 
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Fic. 1. Mean errors as a function of item 


serial position in 3-, 7-, 11-, and 15-item 
experimental (solid line) and control (broken 
line) lists. (The I- and CC-item positions 
are marked by arrows. The number of 
points represented on each function indicates 
the length of the series.) 


words to a criterion of three consecutive 
errorless repetitions. If S had no questions 
after the practice list, he then learned the 
appropriate experimental or control list to 
the criterion of three consecutive errorless 
repetitions. The 3-sec. cycle of the Ger- 
brands memory drum was used to present 
the items, which were typed on endless paper 
tapes. The intertrial interval was 6 sec. 


The Ss’ responses were recorded manually 
by E. 


RESULTS 


The effects of isolation on the learn- 
ing of the I items, FI items and the 
NI items will be considered separately. 
Figure 1 presents the mean number of 
errors for 3-, 7-, 11-, and 15-item lists 
as functions of the serial positions of 
the list items. On each set of axes, 
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curves for (a) an experimental list, 
and (b) its corresponding control list 
are plotted. The data were examined 
to determine (a) if the items of the 
experimental lists were affected by the 
presence of an I item, and (b) if the 
degree of facilitation varied as a func- 
tion of list length and the serial position 
of the I items. Since the means and 
standard deviations of the data were 
proportional, a logarithmic transforma- 
tion of the error scores was calculated 
by the formula: T = log (E + 1), in 
which T signifies the transformed score 
and E represents the raw error score, 

I item.—The presence of facilitative 
effects on the learning of the I item 
was clear. An analysis of variance 
of the mean transformed scores for the 
I and CC items in the second, middle, 
and penultimate positions of the 7-, 11-, 
and 15-item lists revealed a significant 
facilitative effect of isolation, F (1, 
117) = 40.16, p < 001. No signifi- 
cant first- or second-order interactions 
were obtained in this analysis. The 
absence of significant interaction be- 
tween isolation and serial position, F 
(2, 117) = 1.89, p > .05; between iso- 
lation and list length, F (2, 117) = 
0.58, p> .05; or between isolation, 
serial position, and list length, F (4 
117) = 0.38, p > .05, suggests that the 
magnitude of facilitation of the I item 
did not vary as a function of the inde- 
pendent variables. 

The indication that changes in the 
serial position of the I item did not 
affect the magnitude of facilitation. of 
the I item was pursued by examining 
separately the effects of isolation in the 
ll-item list. An analysis of variance 
of the transformed scores for the I and 
CC items located in the second, fourth, 
sixth, eighth, and tenth positions of the 
li-item lists indicated that facilitation 
of the I item occurred, F (1, 65) — 
20.77, p < .001. The interaction be- 
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tween isolation and serial position, 
however, remained insignificant, F (4, 
65) = 0.06, p > .05. 

The lack of effect of list length on 
the magnitude of facilitation of the I 
item was also further investigated. An 
analysis of variance of the transformed 
scores for the I and CC items appear- 
ing in the middle positions of the 3-, 
7-, 11-, and 15-item lists indicated that 
the I item was facilitated, F (1, 52) 
= 11.90, p< .01. The magnitude of 
the facilitative effect, however, did not 
vary detectably with list length, F (3, 
52) = 0.16, p > .05. 

The I item appeared as an intrusion 
very infrequently and was not analyzed. 
Fifty-eight percent of experimental Ss 
and 59% of control Ss did not commit 
any errors of this type. 

FI items.—The effects of isolation 
on this item were less clear than those 
on the isolated item itself. When the 
transformed scores for the items fol- 
lowing those isolates located in the 
second, middle, and penultimate posi- 
tions of the 7-, 11-, and 15-item lists 
were compared to their corresponding 
control items, the main effect of the im- 
mediately preceding I item failed to 
reach significance, F (1, 117) = 3.36, 
p> .05. The effects of isolation did, 
however, interact significantly with 
those of serial position, F (2, 117) 
= 3.09, p<.05. Inspection of the 
functions in Fig. 1 indicates that the 
FIitem was maximally facilitated when 
the I item was located in the penulti- 
mate position of the series. When 
the data for all ll-item lists were 
examined separately, the effects were 
reversed. The main effect of isolation 
did reach significance, F (1, 65) = 
4.92, p < .05, but no interaction be- 
tween isolation and serial position was 
detected, F (4, 65) = 170, p> .05. 
These data, however, indicate a trend 
toward an increased effect at the end 
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position of the list, i.e., the same trend 
in the 3 x 3 x 2 analysis. 

NI items.—The presence of an I 
item in a list did not seem to have a 
detectable effect on the distribution of 
errors for the remaining items in the 
series, but did seem to affect the 
amount of errors to those items in some 
conditions. The effects of isolation 
were first examined separately in each 
condition. Data for the I item and the 
FI item and their controls were ex- 
cluded, and the transformed scores for 
the remaining serial positions were 
analyzed. The data for each pair of 
experimental and control lists were 
analyzed separately in a Positions X 
Groups design. In these 11 independ- 
ent comparisons, only one such analy- 
sis [7-item list, critical item position 
(CIP) — 6] detected a facilitative ef- 
fect of isolation, F (1, 13) — 479, p 
<.05. No other analysis revealed any 
effects of isolation on the NI items of 
the series. With one exception in 
these analyses then, the presence of an 
I item did not result (a) in an overall 
decrease or increase in the number of 
errors to NI items, or (b) in a change 
in the pattern of errors at the NI item 
serial positions. 

Figure 1 indicates that the experi- 
mental list consistently received fewer 
errors than the control list when the I 
item was located in the penultimate 
position. While the individual analy- 
ses indicated that only one difference 
was significant alone, the apparent 
trend was pursued further. Excluding 
errors to the I and CC items, mean 
errors per item were calculated for 
the experimental and control lists in 
the conditions in which I and CC items 
occurred in the second, middle, and 
penultimate positions of the 7-, 11-, 
and 15-item lists. An analysis of these 
data indicated that there was no sig- 
nificant difference between the treat- 
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ment groups, ie. experimental and 
control lists, F (1, 117) = 0.08, p 
2.05. The interaction, however, be- 
tween treatment groups and I-item 
position groups did reach significance, 
F.(2,117) = 3.32, p< .05. The re- 
sults suggest that isolation of the pen- 
ultimate item of a serial list reduces 
errors for the remaining items in the 
series relative to a control series. 

Smith and Stearns (1949) reported 
that experimental Ss reached a cri- 
terion of two consecutive errorless 
repetitions more quickly than control 
Ss. This effect was not discovered in 
the data of the present experiment. An 
analysis of variance was performed on 
the mean trials to criterion per item 
for criteria of one, two, and three con- 
secutive errorless repetitions of the 
list for the 7-, 11-, and 15-item lists. 
No significant reduction in number of 
trials to criterion for the experimental 
groups was noted, F (1, 129) — 075, 
p> .05. 


Discusston 


The data of this experiment yield in- 
formation for two general problems: (a) 
The extent of isolation effects, and (b) 
the magnitude of isolation effects. The 
findings do not provide extensive support 
for Gibson’s (1940) explanation of the 
von Restorff effect. 

Extent of isolation effects —The nature 
of isolation effects on the FI items is 
somewhat ambiguous. The effect, how- 
ever, seemed strongest when the FI item 
ended the series. The prediction derived 
from Gibson’s (1940) hypothesis that 
the FI item would be facilitated when the 
I item appears in the extremes of the 
series, is supported by this trend, but is 
not supported by the absence of a similar 
effect when the I item appeared in the 
second position of the series. The effect 
on the FI items in the final position of 
the series, of course, may only reflect the 
observed effect of the I item’s location 


JOHN P. McLAUGHLIN 


in the penultimate position on all NI 
items. 

The facilitation of the NI items of the 
experimental series only when the I item 
is located in the penultimate position is 
difficult to explain. The prediction that 
reduction of response-generalization in- 
terference for the isolate would enhance 
learning of NI items by lowering the ab- 
solute number of such interferences in 
the series is not supported by the data. 
It might be expected that reduction of 
interferences would have greater func- 
tional significance for shorter lists, but 
no facilitation of NI items was found to 
be a function of list length in the analysis 
of mean errors per item. The effect 
seems specific to the position of the I item. 

Evidence for the type of restructuring 
of the list reported by Wishner et al. 
(1957) was not found in this study. It 
is apparent that the presence of a single I 
item does not result in a major reorgani- 
zation of the series to be learned, even 
when the I item occurs in the middle of 
the series. 

Magnitude of isolation effects on I 
items.—Some difficulty is posed for Gib- 
son's hypothesis by the absence of changes 
in degree of facilitation of the I item 
when its serial position and the series 
length is varied. If the I item is facili- 
tated because it is not subject to response- 
generalization interferences, the differ- 
ences between I and CC items should have 
increased as the number of interferences 
affecting that point in the series increased: 

It appears that Gibson's hypothesis 
does not adequately explain these results. 
Since no single model can explain the 
remainder of facts concerning the von 
Restorff effect (Wallace, 1965), it may 
be more appropriate to obtain more in- 
formation before engaging in additional 
theorizing. 
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RESISTANCE TO EXTINCTION AS A FUNCTION OF 


NUMBER OF NONREINFORCED TRIALS AND 
EFFORTFULNESS OF RESPONSE * 


A. GRANT YOUNG 
Ohio State University 


48 male hooded rats were conditioned to bar press in a modified 
Skinner box for sucrose. The design was a 3 X 2 factorial incorporat- 
ing 11-, 22-, or 44-gm. bar weightings, and 480 or 960 nonreinforced 
trials (PRF). There was also a 100% reinforcement (CRF) condi- 
tion constituting a 332 factorial incorporating the designated bar 
weightings and 960 or 1,920 total trials. Results showed that, for 
PRF-trained Ss with the lesser amount of training, resistance to 
extinction was a positive linear function of effortfulness of response. 
For PRF-trained Ss with the greater amount of training, resistance 
to extinction was a negative linear function of effortfulness of 
response, Differences in resistance to extinction between CRF-trained 
Ss were not significant. The results present difficulty not only for 
cognitive-dissonance theory but for others as well unless additional 


assumptions are made by these theories. 


Increased resistance to extinction 
following intermittent as compared to 
continuous reinforcement, termed the 
partial-reinforcement effect (PRE), is 
a well-established phenomenon and nu- 
merous theoretical explanations have 
been advanced to account for it (eg., 
Amsel 1958; Estes, 1959; Humph- 
reys, 1939). A recent theoretical for- 
mulation of the PRE is the cognitive- 
dissonance theory of Lawrence and 
Festinger (1962). This theory is of 
added interest because it offers predic- 
tions not generated by previous the- 
oretical formulations. 

Cognitive-dissonance theory main- 
tains that under conditions of partial 
reinforcement (PRF) during acquisi- 
tion a state of dissonance is aroused 
within the organism on each non- 
reinforced trial, and dissonance inten- 
sity increases with increased effort ex- 
pended in making the response. Dis- 
sonance is reduced by the organism 


1 This investigation was supported by the 
National Institutes of Health through Grant 
MH-08016 to R. C. Miles, 


610 


through the attribution to the environ 
mental situation of added values 0 
“extra attractions” which serve 
maintain responding when extinctio 
begins. Extra attractiveness beco 
attached to the environmental stimul 
as a positive function of dissonanc 
intensity and the total number 
dissonance-arousing occasions. 

Lawrence and Festinger thus p! 
dict that resistance to extinction W 
be a positive function of both the abs 
lute number of nonreinforced tria 
during PRF training, and of effort e 
pended in responding during PRF am! 
CRF training. 1 

The present study was designed. 
assess factorially the effects of numbe 
of nonreinforced trials and effortful 
ness of response on resistance íi 
extinction. 


METHOD 


Subjects—The Ss were 48 male hoode 
rats, 200-250 gm. in weight upon receipt 
the laboratory. 

Apparatus—The apparatus consisted | ‘ 
two identical operant chambers with Plexi d 
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glas walls (123 in. deep; 13 in. long; 9 in. 
wide), each enclosed in a ventilated sound- 
insulated box. Centered on the front wall 
of this box was a 1.12-w. light which was 
on at all times. A retractable metal bar (1 
in. wide; ł in. thick), adjustable so as to 
require 11, 22, or 44 gm. force to depress 
fully, was centered on the front wall 3 in. 
above the floor. The bar protruded into 
the box $ in. The liquid dipper, which dis- 
pensed .01 ml. of solution, was located 2 in. 
from the right edge of the bar, 13 in. above 
the floor, and protruded into the box $ in. 
All E-controlled events were operated by an 
electronic programming device and bar 
presses were automatically recorded on 
counters. 

Procedure—Upon arrival at the labora- 
tory, Ss were placed in individual cages and 
kept on an ad libitum food and water sched- 
ule for 4 days, after which Ss were placed 
on a food-deprivation schedule consisting of 
12 gm. Purina chow every 24 hr. This 
feeding schedule continued throughout the 
experiment. 

After a 5-day adaptation period to the 
daily feeding schedule, Ss began magazine 
training on a VI-30 sec. schedule. Experi- 
mental periods were 15 min. in length and 
continued for 4 days. Reinforcement was a 
40% liquid sucrose solution, On the four- 
teenth day all Ss were conditioned to bar 
press. Those Ss which were to receive 
training on the ll-gm. bar were allowed 
to make 50 reinforced responses with the 
bar weight set at 11 gm.; Ss which were to 
receive training on the 22-gm. bar were 
allowed to make five reinforced responses 
with the bar weight set at 11 gm. and 45 
reinforced responses with the bar weight 
set at 22 gm.; Ss which were to receive 
training on the 44-gm. bar were allowed to 
make five reinforced responses with the 
bar weight set at 11 gm., five reinforced 
responses with the bar weight set at 22 gm., 
and were finally conditioned and allowed to 
make 40 reinforced responses with the bar 
weight set at 44 gm. On the following day 
each S was allowed to make 160 continuously 
reinforced bar presses. 

On the sixteenth day Ss began acquisition 
trials on the assigned bar weight and rein- 
forcement schedule. Twelve different se- 
quences of the 50% reinforcement schedule 
were used and were varied daily. The Ss 
were randomly assigned to training condi- 
tions and each daily experimental session 
consisted of 160 bar presses. The bar was 
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never retracted except following the last bar 
press of the experimental session. 

On the day following completion of acqui- 
sition trials, extinction was started. During 
extinction the stimulus conditions were ex- 
actly the same as during acquisition with 
the exception that the dipper was inopera- 
tive. Extinction sessions were 15 min. in 
length and continued for 3 days. 

Design —The basic design was a 3x2 
factorial incorporating 11, 22, or 44 gm. 
required force on the bar to operate the 
dipper, and 480 or 960 nonreinforced trials. 
All Ss which received nonreinforced trials 
were on a 5096 reinforcement schedule. In 
addition, there was a 100% reinforcement 
condition constituting a 3X 2 factorial in- 
corporating bar weightings above and 960, 
or 1,920 total trials (the number of trials 
required under 50% reinforcement to achieve 
480 or 960 nonreinforced trials, respectively). 
The Ss were run in squads of 12, with a 
squad consisting of one S from each cell in 
the experimental design. 


RESULTS 


The total responses made by each 
S in extinction were recorded and the 
data for PRF-trained Ss were sub- 
jected to an analysis of variance, the 
results of which are summarized in 
Table 1. As shown therein, the Bar 
Weighting X Nonreinforced Trials in- 
teraction was significant beyond the 
.001 level, and therefore the data were 
subjected to analyses of variance for 
simple effects. 

Results of the analysis of variance 
for bar weighting at the two levels of 
nonreinforced trials (Table 2) indicate 


TABLE 1 


ANALYSIS OF VARIANCE OF RESPONSES 
IN EXTINCTION 


Source df MS F 

Bar Weighting (A)| 2 | 2409.50 
Nonreinforced 

Trials (B) 1 | 1584.37 
AXB 2 |43218.50 | 16.987*** 
Within 18 | 2544.18 
Total 23 

+p «c .001. 
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TABLE 2 


ANALYSIS OF VARIANCE OF EXTINCTION 
RESPONSES ron Bar WEIGHTING 


Source df MS E 


Bar Weighting for 
480 Nonrein- 


forced Trials 2 |25615.75 | 10.068** 
Bar Weighting for 

960 Nonreiu- 

forced Trials 2 | 20012.25| 7.865** 
Within 18 | 2544.18 

** p <.01, 


that bar weighting was significant be- 
yond the .01 level under both condi- 
tions. These data are presented in 
Fig. 1, which shows that at 480 non- 
reinforced trials the number of re- 
sponses in extinction was a positive 
linear function of bar weighting, 
whereas at 960 nonreinforced trials 
the number of Tesponses in extinction 
was a negative linear function of 
bar weighting. Comparisons between 
groups at 480 nonreinforced trials 
showed significant differences attribu- 
table to bar weighting—11 gm. vs. 22 
gm.: £ (6) = 2.070, p< .01; 22 gm. 
vs. 44 gm.: t (6) — 1.974, p < .05. 
Comparisons between groups at 960 
nonreinforced trials showed that only 
the 11-gm. vs. 22-gm. difference was 
significant—t (6) = 3.964, p < .01. 
Results of the analysis of variance 
for nonreinforced trials at the three 


Summen 9560 NONREINFORCED TRIALS 
79 480 NONREINFORCED TRIALS 


400 


Pe 


GRAMS BAR WEIGHTING 


Fic. 1. Mean responses in extinction 
for PRF-trained Ss. 
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TABLE 3 


ANALYSIS OF VARIANCE OF EXTINCTION 
RESPONSES FOR NONREINFORCED 
TRIALS 


Source df MS F 


Nonreinforced 

Trials for 11 gm. 

Bar Weighting 1 
Nonreinforced 

Trials for 22 gm. 

Bar Weighting 1 
Nonreinforced 

Trials for 44 gm. 

Bar Weighting 1 
Within 18 


57630,12 | 22.651*** 
10.12 


30381.12 
2544.18 


11.941** 


levels of bar weighting (Table 3) indi- 
cate that the effect of nonreinforced 
trials was significant beyond the .001 
level for the ll-gm. bar weighting, and 
significant beyond the .01 level for the 
44-gm. bar weighting. The effect was 
not significant for the 22-gm. bar 
weighting. These data are also pre- 
sented in Fig. 1. 

The data for CRF-trained Ss were 
subjected to an analysis of variance 
and the results showed that both main 
effects and the Bar Weighting x Total 
Trials interaction were not significant. 
These data are presented in Fig. 2, 
where responses in extinction are 
plotted against bar weighting by total 
trials. 


GRAMS BAR WEIGHTING 


Fic. 2. Mean responses in extinction 
for CRF-trained Ss, 
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Discussion 


An examination of the extinction data 
for the three groups of Ss which experi- 
enced 480 nonreinforced trials during 
acquisition clearly indicates that resist- 
ance to extinction was a positive linear 
function of the effortfulness of the re- 
sponse, This result is in agreement with 
the findings of Lawrence and Festinger 
(1962, Exp. 15), and supports one pre- 
diction of cognitive-dissonance theory. 

Not rigorously explained is the man- 
ner in which “extra attractiveness” is 
attributed to environmental stimuli asso- 
ciated with nonreinforcement, however 
Lawrence and Festinger maintain that 
these attractions build up slowly and con- 


.tinuously as a function of the number of 


dissonance-producing experiences. Yet 
inspection of the extinction data for Ss 
which experienced 960 nonreinforced 
trials during acquisition indicates that 
resistance to extinction was a negative 
linear function of effortfulness of the 
response, 

Cognitive-dissonance theory further 
predicts that increased work required in 
making the response during CRF train- 
ing will result in increased resistance to 
extinction. Inspection of Fig. 2 shows 
that the differences between CRF-trained 
Ss were small, and none of these differ- 
ences was significant. These results, 
again opposed to predictions of cognitive- 
dissonance theory, are in agreement with 
other findings (e.g., Aiken, 1957). 

The finding in this study that, for 
PRF-trained Ss, resistance to extinction 
was a positive linear function of effort 
with smaller amounts of training, and a 
negative linear function of effort with 
larger amounts of training is of critical 
importance in the evaluation of theoreti- 
cal treatments of the PRE, Clearly, 
cognitive-dissonance theory is unable to 
account for these results. Likewise, any 
hypothesis that relies on S’s ability to 
discriminate between acquisition and ex- 


tinction conditions (cf. Bitterman, Fed- 
derson & Tyler, 1953; Humphries, 1939) 
could not predict these results. Estes’ 
(1959) explanation of the PRE would 
not predict this effect nor would vari- 
ous other contiguity-type interpretations 
(eg. Hulse & Stanley, 1956). Specific 
predictions generated by conditioned 
frustration theory (Amsel, 1958; Spence, 
1960) would likewise fail to include these 
results. 

It appears that these data present dif- 
ficulty not only for cognitive-dissonance 
theory but for others as well unless addi- 
tional assumptions are made by these 
theories, 
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CONSTANT ORDER OF PAIRS IN THE PRESENTATION AND 
TESTING OF PAIRED ASSOCIATES 1 


LANCE CARLUCCIO anp ROBERT G. CROWDER 
Yale University 


60 Ss received 8 presentations of a list of 10 paired associates, On 
the 9th trial they attempted to anticipate the response terms. The 
design was a factorial combination of 2 levels of intralist stimulus 


competition and 3 conditions of presentation. 


Of 2 groups (for each 


level of competition) which saw the pairs in the same order on all 8 
presentation trials, one group was tested in that same order [S(S)] 
and the other group was tested in a different order [S(R)]; a third 
group received a different order on each of the 9 trials [R(R)]. The 
results showed a significant effect of stimulus competition but no 


effect of presentation condition and no interaction, 


These findings 


were generally consistent with those of prior investigations, 


Newman and Saltz (1962) and Martin and 
Saltz (1963) have investigated the effects 
of constant serial ordering of paired asso- 
ciates during a series of study trials upon a 
subsequent recall test. In these experiments 
S(S) groups were given five study trials 
with invariant serial order of pairs, then a 
recall test with the stimuli preserving the 
same order; S(R) groups differed from 
S(S) groups only in that the recall test 
did not preserve the order of pairs during 
study trials; and R(R) groups were given 
random orders during the study trials and 
still a different order during the test. Martin 
and Saltz (1963) showed significant transfer 
from the S(S) and S(R) conditions to a 
derived serial list of the response terms 
which maintained the constant order of the 
paired-associate study trials, Although these 
authors concluded that serial associations 
had therefore been learned during paired- 
associate training, Battig, Brown, and Nel- 
son (1963) failed to replicate such positive 
transfer among roughly comparable condi- 
tions. It is, however, with the effects of 
constant serial presentation upon paired-asso- 
ciate learning itself that the present study is 
concerned. Both Newman and Saltz (1962) 
and Martin and Saltz (1963) found no 
superiority of S(S) to S(R) groups, for 
example. Newman and Saltz had predicted 
a difference in this direction based on their 
use of highly confusing, unfamiliar stimulus 
terms, which “. , . would lead to consider- 
able difficulty in the learning of S-R pairs, 

>This investigation was conducted by the senior 


author, under the direction of the junior author, as 
part of a course requirement, 


and a consequent increase in the tendency for 
serial position to serve as a cue [Newman & 
Saltz, 1962, p. 103]. ^ Given a reliance 
on serial-position cues, their reasoning con- 
tinued, Ss would tend to reproduce the 
response terms in serial order during the 
test trials and the S(R) groups, for whom 
this serial order had suddenly become in- 
appropriate, would show increased errors 
over the S(S) groups. When this predic- 
tion failed, Newman and Saltz proposed that 
both groups actually had been able to use 
serial-position cues; during the test, they 
suggested, the S(R) group had followed a 
mediated association from the stimulus term 
to its former serial position and used this 
serial-position cue to arrive at the correct 
response. Newman and Saltz considered the 
10-sec. anticipation interval in their experi- 
ment to be adequate time for such mediation 
to occur and they predicted that an experi- 
ment with a shorter anticipation interval 
might be sensitive enough to detect the extra 
mediational step for S(R) groups. Thus, in 
a subsequent study, Martin and Saltz (1963) 
used a 2-sec. anticipation interval in recall. 
At the same time, however, Martin and Saltz 
introduced several other departures from the 
Newman-Saltz procedures in order to in- 
crease degree of paired-associate learning. 
Their decision to employ noncompeting stim- 
ulus terms was, in particular, a direct viola- 
tion of the “stimulus selection" logic of 
Newman and Saltz. The failure of Martin 
and Saltz, with the shorter anticipation in- 
terval and the altered stimulus materials, to 
find a significant difference between S(S) 
and S(R) groups must thus be regarded 


614 


ha | 


p^ 


l 
| 
| 
| 


SUPPLEMENTARY REPORTS 615 


as inconclusive since the low-stimulus com- 
petition would have reduced Ss' tendencies 
to use serial position as a cue. The present 
study was intended as a more adequate test 
of the Newman-Saltz hypothesis, through the 
comparison of S(S) and S(R) groups at a 
2 sec. recall-anticipation interval and with 
variation in stimulus-term competition. 

Another purpose of the present study was 
to resolve empirical inconsistencies among 
the Newman and Saltz (1962), Martin and 
Saltz (1963), and Battig et al. (1963) 
articles. In agreement with the earlier 
literature Newman and Saltz found constant 
serial-presentation conditions—S(S) and 
S(R)—to yield better recall than the condi- 
tion with random presentation and recall 
orders, (R(R)). They explained this su- 
periority by supposing that both serial-pres- 
entation conditions enjoyed a complex of two 
stimuli associated with each response—the 
S-term itself and its serial position. The 
presence of both stimulus components at 
recall would then allegedly produce higher 
recall probability than that of either one 
alone. (It should be kept in mind that 
serial-position cues were considered available 
to S(R) Ss through backward mediate as- 
sociations.) Yet Martin and Saltz found 
no superiority of S(S) and/or S(R) groups 
over R(R) groups; indeed, recall was non- 
significantly higher in the latter condition 
than in either of the former. In a series of 
five experiments comparing varied with con- 
stant ordering of paired associates, Battig 
et al. (1963) explored a wide range of 
methodological variables; they concluded that 
constant conditions generally facilitate 
paired-associate learning, but primarily late 
in learning and only for certain experimental 
procedures (see below). Battig et al 
favored a compound-stimulus explanation for 
this facilitation, as had Newman and Saltz, 
and suggested that intralist stimulus compe- 
tition should be directly related to the in- 
volvement of serial-position cues; however, 
their experiments included no deliberate vari- 
ation of competition. Thus, even the gross 
effects of presentation order and their 
dependence upon intralist competition are not 
yet entirely clear. 

Method.—Sixty Yale undergraduates 
served in six independent groups of 10 Ss, 
as defined by two levels of intralist stimulus 
competition and three schedules of presenta- 
tion—S(S), S(R), and R(R). All Ss re- 
ceived eight study trials followed by a single 
recall test. During study trials the list of 
10 pairs was projected onto the wall of the 


experimental room by a Graflex compact 
filmstrip projector, each pair appearing for 
2 sec., with no appreciable interpair interval 
and with a 5-sec. intertrial interval. Ap- 
proximately 75 sec. after the eighth study 
trial, the stimulus terms were projected alone 
at a 2-sec. rate and S was told to call out 
the associated response term before the ap- 
pearance of the next stimulus. Before the 
experiment began Ss were instructed on the 
nature of the task and told to pay no atten- 
tion to the order in which pairs occurred (as 
was the explicit instruction of both Newman 
& Saltz and Martin & Saltz). 

The response terms were the same for all 
Ss, and were 10 of the 12 dissyllabic nouns 
with the highest m value from Noble’s list 
(Underwood & Schulz, 1960). Stimuli in 
the high-competition conditions were CVC 
nonesense syllables containing only the letters 
X, V, F, Z, U, and Y; all had Glaze asso- 
ciation values of from 0% to 20% (Under- 
wood & Schulz, 1960). Stimuli in the low- 
competition lists were CVCs ranging in asso- 
ciation value from 67% to 87% with no 
duplication among either initial or terminal 
letters and with each of the five vowels used 
exactly twice. Within each constant-order 
presentation group, two different orders of 
items were used, while orders for the R(R) 
groups were chosen from the rows of a 
balanced Latin square so as to avoid repeti- 
tion of adjacent pairs across trials, 

Results—Table 1 displays the results of 
the present study along with those of New- 
man and Saltz (1962) and Martin and Saltz 
(1963). Although the pattern of means in 
the present data bear some similarity to the 
earlier data, a 2X3 factorial analysis of 
variance showed that only Stimulus Competi- 
tion was a significant source of variance, F 
(1, 54) = 90.09, p < .001. 

When percentage of total correct responses 
was calculated for each pair as a function 
of its "serial position" during presentation 
trials in S(S) and S(R) conditions, there 
was, in agreement with the report of Battig 
et al, no similarity to the classical serial- 
position curve under either level of competi- 
tion. The actual observed values were: 
9.56, 12.50, 9.56, 9.56, 11.03, 11.76, 10.29, 7.35, 
8.82, and 8.56 for low-competition Ss and 
8.57, 14.29, 14.29, 5.71, 11.43, 5.71, 14.29, 5.71, 
8.57, and 11.43 for high-competition Ss. 

Discussion.—The failure to show signifi- 
cant differences between the S(S) and S(R) 
conditions, particularly with high-stimulus 
competition, tends to argue against the 
original Newman-Saltz logic—that high com- 
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a Orders A and B combined and first retention-test data combined for Groups S(S)S(S) and S(S)S(R). 


petition would favor the "selection" of serial- 
position cues. This is a surprising conclu- 
sion, for the overall effect of stimulus compe- 
tition was very large and Underwood’s 
(1963) characterization of stimulus selection 
as dependent upon the discriminability of 
various elements of compound stimuli would 
seem to fit the present paradigm admirably. 
One possibility is that the backward media- 
tion proposed by Newman and Saltz and by 
Battig et al. can take place very much faster, 
in S(R) Ss, than was thought and that a 
briefer interval is necessary to detect it. To 
accept the equivalence of S(S) and S(R) 
conditions is to embrace the null hypothesis, 
of course, and in this case calls for more than 
usual reluctance, especially until further re- 
search determines whether S(S)-S(R) dif- 
ferences obtain under those experimental pro- 
cedures which Battig et al. found most 
conducive to serial-presentation effects. 

With respect to serial-presentation effects 
(ie. the comparison of S(S) and S(R) con- 
ditions with R(R) conditions) the present 
study supports the failure of Martin and 
Saltz to find significant differences and thus 
contrasts with the data of Newman and 
Saltz and also with certain subexperiments 
of Battig et al. where constant serial order 
facilitated learning. The strongest evidence 
Battig et al. presented for serial facilitation 
came from (a) their Exp. III, where CVC- 
CVC lists were learned to criterion by the 
“recall” method (alternating study and test 
trials), and (b) their Exp. V, where either 
CVC-CVC or shape-number lists were 
learned by the same method. Although these 
authors found borderline facilitation with 
certain other procedures, the present results, 
like those of Martin and Saltz, tend non- 
significantly towards the other direction, 
Battig et al. suggest that failure to show 


superiority of constant over varied-presenta- 
tion order may be attributed to (a) the use 
of a block of study trials with a single termi- 
nal test, or (b) the inadvertant choice of a 
particularly difficult serial order for constant- 
order conditions, or both. It is difficult to 
assess the importance of Factor b in the 
present experiment, although two different 
serial orders were used. Factor a is pre- 
sumably related to the finding of Battig et al. 
that facilitative effects occur primarily late 
in learning. Since procedures employing a 
fixed number of trials can hardly be expected 
to reveal processes active late in learning, 
this methodological consideration would ap- 
pear especially important. Generally, there- 
fore, the present data are quite consistent 
with those from comparable conditions in the 
Battig et al. study and with the empirical 
generalizations these authors draw concern- 
ing which experimental procedures militate 
against serial-presentation effects. The only 
qualification must be that the Newman and 
Saltz data violate these generalizations, since 
they found significant facilitation under con- 
ditions which have subsequently proved in- 
adequate to do so. 
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EFFECT OF INTERSTIMULUS INTERVALS AND REST-PERIOD LENGTH 


UPON HABITUATION OF TH 


JAMES H. 


E ORIENTING RESPONSE? 
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University of Pennsylvania 


The effects of 3 different interstimulus interval (ISI) lengths and 2 
different rest-period lengths upon habituation of the Orienting response 
(OR) were studied in a factorial design employing 84 Ss. GSRs toa 


series of tones were employed as the 


measure of the OR. Rest-period 


length did not effect OR habituation, while increased ISI length was 
associated with an increase in resistance of the OR to habituation. 
Latency, duration, and amplitude measures of the GSR were employed 


with amplitude being the most sensitive index of change. Spontaneous 


GSRs during rest were positively correlated with the number of ORs. 


Tt appeared that the relationship of 
increased with lengthened ISI. 


In recent years investigators in this coun- 
try have become more interested in what is 
generally called the Orienting response 
(OR). This interest stems in part from 
reports of Soviet investigators (e.g., Sokolov, 
1963) suggesting that many important rela- 
tionships exist among the OR, conditioning, 
discrimination, and other basic psychological 
phenomena. In this study two basic pro- 
cedural variables were examined using GSR 
as the index of the OR. One variable was 
the length of time S rested prior to institu- 
tion of experimental procedures. Two levels 
(5 and 15 min.) of rest-period length were 
employed; these values include the range of 
rest periods generally used in GSR studies. 
The second independent variable was inter- 
stimulus interval (ISI). Schaub (1965) and 
Coombs (1938), using independent groups, 
report that the GSR to simple stimuli habitu- 
ates more rapidly with decreased ISIs. This 
study attempts to extend the range and 
generality of their findings. 

The study also deals with measurement 
problems in GSR. Three measures of the 
GSR component of the OR will be evaluated ; 
(a) amplitude, (b) latency, and (c) dura- 
tion of the response. Further, the relation- 
ship of spontaneous GSRs to the OR will be 
investigated. 

Method.—Ninety-eight undergraduates from 
the State University of New York at Buffalo 
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the OR and spontaneous GSRs 


served as Ss. Of this group, 8 were dis- 
carded for procedural or equipment problems, 
and 6 were dropped from the analysis as 
fhey failed to produce any GSRs to the 
stimuli. 

The Ss were seated in an armchair in an 
8x13 ft sound-deadened room. The S 
wore a Lafayette Model F-767 stereo head- 
phone through which the stimuli—tones— 
were presented. A 15-w. ceiling light re- 
mained on at all times. Beckman biopoten- 
tial electrodes with Beckman electrode paste 
were attached to S's right palm and the 
ventral side of S's right wrist. GSRs were 
obtained from a Yellow Springs Instrument 
Company's dermohmmeter with its output 
recorded on an Esterline-Angus milliameter 
recorder. The stimuli were produced by a 
Huston Instrument Corporation Model H-1 
signal generator. Length of rest intervals, 
and ISIs were regulated by a Gerbands 
Model 1A program timer, and length of 
stimulus presentation was determined by a 
Hunter timer. Stimuli were 2-sec., 1,000-cps., 
square wave, 65-db. tones. 

The S was seated in the chair, recording 
electrodes were attached, a headset was put 
in place, and he was told that he was to 
relax but not fall asleep. The S was told 
that a rest period would precede the presenta- 
tion of some tones through the headset. The 
E then left the room and S remained alone 
throughout the study. A period of either 5 
or 15 min., elapsed following which the stim- 
uli were presented. Stimuli were presented 
until S yielded no GSR to the tone for three 
consecutive presentations or until a total of 
20 tones had been presented. 

The experiment was a 2 X 3 factorial with 
two levels of rest periods and three levels of 
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ISIs. One half of the Ss had a rest period 
of 5 min. while the remaining Ss had a rest 
period of 15 min. For one third of the Ss 
the ISIs were either 15, 20, or 25 sec. with 
a mean of 20 sec. For a second third of the 
Ss the ISIs were either 50, 60, or 70 sec. 
with a mean of 60 sec.; and for the remain- 
ing Ss the ISIs were either 95, 105, or 115 
sec. with a mean of 105 sec. 

Resulis—An OR was considered to have 
occurred if S's skin resistance showed a 
measurable decrease between 1 and 5 sec, 
after stimulus onset. Three measures of the 
GSR were obtained and analyzed. The first 
was the “latency” of the GSR and was the 
time, in seconds, from stimulus onset until 
the beginning of the GSR deflection. The 
second was the “duration” of the GSR and 
was the time, in seconds, from the beginning 
of the GSR until the point of maximum de- 
crease in skin resistance. Since Ss in the 
20-sec. ISI group had some ISIs of only 15 
sec., a maximum of 15 sec, was placed upon 
the duration measure. The final measure 
was the "amplitude" of the GSR. To 
obtain this measure the reciprocal of the skin 
resistance at point of stimulus onset was 
subtracted from the reciprocal of the skin 
resistance at the point of maximum deflec- 
tion. The resultant was multiplied by 10* 
and +1.0 was added. The log of that figure 
was then used as the amplitude measure, 
This provided a log conductance score for 
the measure of GSR amplitude. Each S's 
data was summed in successive blocks of 
four trials each. The value of 0 was as- 
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signed to any trial on which S did not re- 
spond. These sums were then analyzed by 
a2X 3X 5 analysis of variance with the five- 
level variable being trial blocks. For both 


the duration and the latency measures, only — 


the trials variable was significant. Latency 
increased over trials, F (4, 312) — 39.66, 
~<.001, and duration decreased over trials 
F (4, 312) = 1347, p < .001. 

The amplitude measure yielded a signifi- 
cant trials effect, F (4, 312) = 6675, p < .001, 
a significant ISI effect, F (2, 18) = 3.27, 
p <.05, and a significant interaction of Trials 
X ISI effect, F (8, 312) 2 251, p<.02, 
Figure 1 displays the data relating to the 
interaction of ISI with trials. It may be 
clearly seen that the mean amplitude of the 
GSR decreased over trials and that the 
decrease was more rapid with shorter ISIs. 
Rest-period length did not effect the GSR. 
measures employed in this study. 

The relationship between spontaneous 
GSRs and the OR was also investigated in 
this study. Spontaneous GSRs that occurred 
during the last 3 min. of the rest period were 
tabulated. A spontaneous GSR was defined 
as having two characteristics; (a) a change 
in skin resistance that exceeded 500 ohms, 
and (b) the skin resistance returned to the 
predeflection level. The number of spon- 
taneous GSRs was correlated with the num- 
ber of ORs that each S yielded. The corre- 
lation for all Ss was r = .22, p < .05. When 
the correlations were performed separately 
on the ISI groups, it was found that in- 
creased ISI length apparently was associated 
with increased correlation between spontane- 
ous GSRs and number of ORs. For the 
20-sec, ISI group the correlation was r 
= .001, for the 60-sec. IST group the corre- 
lation was r=.31, and for the 105-sec. ISI 
group the correlation was r= 44, p<.05. 
There were no Significant differences between 
the number of spontaneous GSRs among the 
three IST groups, 

Discussion —The data of this study demon- 
strate that within the limitations of the 
procedures employed, the length of rest inter- 
val employed prior to eliciting the OR has 
no demonstrable effect on OR habituation. 
It might have been expected that a longer 
rest period would have resulted’ in higher 
basal Tesistances as Ss relaxed or settled 
down in the situation. This was assessed by 
computing a # test between the mean pre- 
stimulus skin resistance on Trial 1 for the 5 
vs. the 15 min. rest groups. The resultant ¢ 
was less than 1 indicating that for the two 
different rest Periods no significant differ- 
cnces were found between mean basal skin 


SUPPLEMENTARY REPORTS 


resistances prior to the introduction of stim- 
uli, This finding might vary with different 
electrodes or different electrode paste. The 
data clearly corroborate other findings that 
shorter ISIs are associated with increased 
habituation of OR amplitude. Similar trends 
are noted in duration and latency measures of 
the GSR; however, they fail to attain ac- 
cepted levels of statistical significance. This 
argues for the use of the amplitude measure 
as the more sensitive index of the GSR 
component of the OR. 

The best-known theoretical model relating 
to the OR is Sokolov’s (1960) neuronal 
model. This model does not appear to pre- 
dict the findings of this study. By the same 
token, the results of this study are not in 
opposition to any of the model’s formulations. 
It would seem that any theoretical concep- 
tion of the OR should include an inhibitory 
concept implying that the OR is inhibited 
as a function of time since last stimulus 
presentation. Until a more complete analy- 
sis of the ISI effect is performed, any 
further specification of this inhibiting factor 
would be premature. It does suggest, how- 
ever, that the OR has properties that 
resemble conditioned responses. The data 
relating to ISIs resembles the finding that 
extinction of an OR occurs more rapidly 
under massed than distributed extinction 
trials (Reynolds, 1945). To determine what 
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extent if any OR extinction curves are 
confounded by OR habituation will have 
to await further study. 

The relationship of spontaneous GSR 
activity to the number of ORs elicited is of 
interest. While the relationship for com- 
bined S’s was low, separation into ISI groups 
suggested that the size of the correlation 
increased as the length of the ISI increased. 
At a simple descriptive level, it would appear 
that the factor that inhibits the OR with 
decreasing ISI lengths also attenuates the 
relationship of spontaneous GSRs to OR 
production. To the extent that both meas- 
ures (OR and spontaneous GSR) reflect 
activation, their relationship is consistent 
with theoretical predictions. 
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STIMULUS RECALL AND EXPERIMENTAL PARADIGM 1 


JOHN P. HOUSTON 
University of California, Los Angeles 


Following paired-associate learning conforming to 1 of 4 experimental 
paradigms (A-B, A-C; A-B, C-D; A-B, C-B; A-B, Rest) Ss were 


presented Ist-list responses and asked to recall lst-list stimuli. 


The 


results indicated that recall in the A-C paradigm did not differ from 
the Rest condition, that recall in the Rest condition was better than 
in the C-D condition, and that recall in the C-D paradigm was better 


than recall in the C-B paradigm. 


a significant variable. 


This experiment attempted to replicate, 
under slightly modified conditions, some of 
the findings reported by Keppel and Under- 

1This work was supported by research grants (MH 


10612-01, MH 11199-01) from the Nati itute 
of Mental Health, Public Health pee ue 


Stimulus meaningfulness was not 


These results are very similar to those reported 
by Keppel and Underwood (1962). 


wood (1962) in their study of first-list 
stimulus recall as a function of experimental 
paradigm. Following the learning of PA 
lists conforming to one of four paradigms 
(A-B, A-C; A-B, C-D; A-B, C-B; A.B, 
Rest) Ss were provided with first-list re- 
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sponses and asked to recall the first-list 
stimuli. Correct S-R pairings were not 
required. Two levels of stimulus meaning- 
fulness were used in each paradigm. Keppel 
and Underwood (1962) have examined each 
of these paradigms for the existence of 
extinctive A-B, A-C relationships which 
could contribute to losses in stimuli recall. 
The A-B, Rest paradigm is the control 
against which losses in the remaining para- 
digms were judged. In the A-B, C-D 
paradigm successive associations between the 
context and the stimuli (C-S) conform to 
the extinctive A-B, A-C relationship. 
Neither forward (S-R) nor backward (R-S) 
associations in this paradigm form the ex- 
tinctive relationship. The associations be- 
tween the experimental context and the re- 
sponses (C-R) do hold an A-B, A-C 
relationship to one another but, because first- 
list responses were provided during the re- 
tention test, this source of unlearning may be 
assumed to have been minimal in this study. 
Thus it appears that there is one factor, the 
A-B, A-C relationship between successive 
C-S associations, which should contribute to 
a decrement in first-list stimulus recall in 
the A-B, C-D paradigm. In the A-B, C-B 
paradigm the C-S associations once again 
conform to the extinctive relationship. In 
addition, successive R-S associations fit the 
A-B, A-C pattern, Because of these two 
sources of unlearning it was predicted that 
stimulus recall would be poorer in the A-B, 
C-B paradigm than in the A-B, C-D. In 
the A-B, A-C paradigm there appears to be 
one source of retroaction and one source of 
facilitation. Successive C-S associations in 
this paradigm form an A-B, A-B relation- 
ship. This might be expected to increase 
first-list stimulus recall. On the other hand 
successive S-R associations form an A-B, 
A-C relationship. To the extent that extinc- 
tion of forward first-list associations will 
affect recall of the first-list stimuli the facili- 
tative effect of identical successive C-S asso- 
ciations will be reduced. It was predicted 
that stimulus recall in this paradigm would 
be superior to recall in the control condition 
because S's were given unlimited recall time 
and were not required to pair the stimuli 
with the correct responses, In summary, the 
paradigms were expected to arrange them- 
ie i um of decreasing first-list stimu- 
us recall, in the followin, - 
C-D, C-B. g order, A-C, Rest, 
Method.—The Ss were 128 University of 
California. undergraduates whose participa- 
tion was in fulfillment of a course require- 
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ment. The eight conditions (two levels of 
stimulus meaningfulness in each of four 
paradigms) were unsystematically arranged 
16 times such that each occurred once in 
each successive block of eight conditions. 
The Ss were assigned to this arrangement 
as they arrived in the laboratory. Half the 
Ss in each condition learned PA lists com- 
posed of 8 nonsense syllables as stimuli 
paired with eight two-syllable adjectives as 
responses while the other half learned lists 
composed of eight paired adjectives. Two 
different sets of syllables and four sets of 
common adjectives were employed in the 
construction of the lists. The Glaze (1928) 
meaningfulness values of the syllables ranged 
from 33 to 73. Within each set of syllables 
no vowel was used more than twice while 
no consonant was used more than once, 
None of the syllables in the second set began 
with a letter which served as a first letter in 
the first set. None of the syllables in the 
second set had more than one letter in 
common with any of the syllables in the 
first set, and if the duplication involved a 
consonant it occurred in a different position, 

The adjectives were taken from those used 
by Houston (1966). Two different pairings 
of the first-list stimulus syllables with the 
first-list response adjectives and two pairings 
of the first-list adjective stimuli with the 
Tesponse adjectives were developed. Each 
Pairing was assigned equally frequently in 
the appropriate conditions, For each first 
list three different second lists were con- 
structed such that each corresponded to one 
of the three experimental second lists (A-C, 
C-B, C-D). Inter- and intralist meaning 
similarity was as minimal as close inspection 
would allow. All units within each set of 
stimuli and responses began with different 
letters. The two members of each pair 
always began with different letters. Four 
presentation orders of each list were devel- 
oped to minimize serial learning and each 
order was used equally frequently as the 
starting order. — AII learning was by the 
anticipation method at a 2:2-sec. presenta- 
tion rate on a Stowe drum with a 4-sec. 
intertrial and a 1-min. interlist interval. All 
Ss learned the first list to a criterion of one 
perfect trial. The Experimental groups 
were then presented the appropriate second 
lists for 20 anticipation trials. The Control 
Ss cancelled digits for a time equivalent to 
that required by the Experimental Ss to 
learn the second list. Following completion 
Of the second task Ss were given a blank 
sheet of paper and asked to recall and record 
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the first-list stimuli. The first-list responses 
were provided on an index card but Ss were 
told they were not required to match the 
correct stimuli with the responses and that 
the responses were merely provided as an aid 
in recalling the stimuli if S felt he needed it. 
The Ss were given as much time as they 
wished to recall the stimuli. 

Results and discussion—The mean num- 
ber of trials required by the High Meaning- 
fulness control, A-C, C-D, and C-B groups 
to reach criterion on the first list were 11.19, 
13.56, 10.31, and 10.56, respectively. The 
means for the corresponding Low Meaning- 
fulness groups were 10.88, 10.69, 9.00, and 
1231. An analysis of variance indicated no 
differences attributable to stimulus meaning- 
fulness, F <1, paradigm, F = 142, or their 
interaction, F < 1. 

The mean numbers of correct responses 
produced by the High Meaningfulness A-C, 
C-D, and C-B groups during second list 
learning were 124.88, 128.31, and 134.44, 
respectively. The corresponding low mean- 
ingfulness means were 137.75, 141.63, and 
137.00. Unexpectedly, more correct re- 
sponses were given to the items containing 
nonsense syllables as stimuli than to the 
items containing adjectives as stimuli, F (1, 
90) —10.76, p < .01. Also somewhat sur- 
prising was the fact that the paradigms did 
not differ in terms of second-list learning, 
F<1. The interaction was not significant. 

The mean numbers of stimuli recalled by 
the High Meaningfulness control, A-C, C-D, 
and C-B groups were 6.69, 7.13, 5.13, and 
2.50. The low meaningfulness means were 
6.33, 7.19, 5.06, and 2.75. The meaningful- 
ness variable was not associated with signifi- 
cant differences in recall, F<1. The 
paradigm main effect was highly significant, 
F (3, 120) = 54.58, p < .01. The interaction 
was not significant, F < 1. Subsequent indi- 


vidual comparisons of the paradigm totals 
indicated that the A-C paradigm did not 
differ from the Control, that recall in the 
Contro! was significantly better than in the 
C-D condition (p<.05), and that recall in 
the C-D condition was, in turn, significantly 
better than in the C-B condition (p<.01). 

These data are very similar to those ob- 
tained by Keppel and Underwood (1962). 
They extend the findings of these authors 
to two levels of stimulus meaningfulness. 
They lend further support to the general 
position that overall unlearning effects may 
be determined by the extinction of a number 
of discrete associations within the two-list 
RI situation. The significant difference be- 
tween recall in the Control and the C-D 
condition suggests that first-list C-S asso- 
ciations were extinguished during second-list 
learning. The additional loss in the C-B 
paradigm is consistent with the fact that R-S 
as well as C-S associations conform to the 
A-B, A-C relationship in this paradigm. 
The A-C recall means were, as predicted, 
greater than the control means but the dif- 
ferences were not significant. In the Keppel 
and Underwood (1962) study the control 
mean was greater than the A-C mean. This 
might possibly mean that the recall task in 
the present experiment may have been 
more dependent upon C-S strength than the 
task used by Keppel and Underwood in 
which S was asked to pair the stimuli with 
the correct responses. 
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REVERSAL AND NONREVERSAL SHIFT LEARNING IN RETARDATES 
AS A FUNCTION OF OVERTRAINING? 


ELIZABETH S. OHLRICH anp LEONARD E. ROSS 
University of Wisconsin 


Conflicting results have been reported with respect to the reversal (R) 
and nonreversal (NR) discrimination shift performance of retarded 


children. 


The present study tested the possibility that these results 


were due to degree of training on the original problem, with over- 


training (OT) leading to R, NR differences. 


Groups of retarded 


children were given R and NR training to criterion (C), or to 
criterion plus 125 trials of OT. As predicted the C groups did not 


differ, but OT led to an R, NR difference, with R easier. 


It was 


suggested that retardates are deficient in the mediating verbal and/or 
attentional processes which result in faster R shifts, but that they can 
be brought to a level comparable to that of normals through OT. 


Conflicting data have been reported with 
respect to the Reversal (R) and Nonreversal 
(NR) shift learning of retarded children. 
House and Zeaman (1962) found R easier 
than NR for retardates of MA 6-8, a finding 
similar to the results of studies using normal 
children of comparable age, or adults. In 
contrast Sanders, Ross, and Heal (1965) 
reported a study in which R was easier than 
NR for normal children, but in which re- 
tardates, approximately matched on MA and 
having almost identical original learning 
(OL) performance to that of the normals, 
showed small and nonsignificant R-NR dif- 
ferences, Sanders et al, suggested that the 
discrepant retardate results might be due to 
the overtraining (OT) procedure of House 
and Zeaman, an obvious Possibility in view of 
the general finding of a facilitating effect of 
OT on R with normal children, In the 
present study the Sanders et al, experimental 
Procedures were used to compare the per- 
formance of retardates under two shift condi- 
tions (R and NR) and two degrees of 
learning, criterion (C) and OT. On the 
assumption that OT was the variable that 
was responsible for the previous conflicting 
results, it was predicted that the C training 
groups would replicate the Sanders et al. 
finding of no R-NR difference, while the R 
superiority results of the House and Zeaman 
study would be obtained with overtraining. 

Method.—The Ss were 77 institutionalized, 
mentally retarded Children. Thirteen Ss 


1 Support was provided by Qj 
from the National Institute of Mental ee 
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were eliminated from the study before being 
assigned to experimental groups because of 
failure to learn the original problem. MA 
and IQ were determined by the Peabody 
Picture vocabulary test. The apparatus was 
that used by Sanders et al, 

The design was a 2X2X2X2x2x2 
factorial design involving two shift conditions 
(R and NR), two degree of training groups 
(C and OT), two dimensions (color, form), 
two cue sets within each dimension, two cue 
valences (each cue was positive once and 
negative once), and two sexes, One S filled 
each of the 64 cells, so that half of the S's in 
each shift degree of training group had each 
dimension relevant in OL. Shift condition 
and degree of training subgroups were bal- 
anced on the basis of set, MA, IQ, and OL 
errors. 

After original learning to criterion, half 
the Ss were immediately assigned to one of 
the two following shift problems: Reversal 
(R) : the original negative cue became 
positive and the original positive cue be- 
Came negative. "The cues on the irrelevant 
dimension were replaced with new cues from 
the same dimension, Nonreversal (NR): 
the original irrelevant cues became relevant. 
The original relevant cues were replaced by 
new cues from the same dimension, which 
now became irrelevant. The other half 
of the Ss were given 125 OT trials. At the 
completion of OT, these Ss were also as- 
signed to one of the two shift problems. 

The Procedure, including special training 
for nonlearners, was similar to that of the 
Sanders et al. study, except that after reach- 
ing the OL criterion S was either immedi- 
ately Started on the shift probem or OT 
trials, depending on his group assignment. 
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On the first day that OT was given, S 
received 80 OT trials. On the following 
day he received the last 45 OT trials and 
then immediately started his shift problem. 
As in the previous study, training on the 
shift problem was continued for 80 trials or 
until S reached a criterion of 15 consecutive 
correct responses. 

Results—The mean MA (in mo.) and 
IQ for each group was: C-R, 116.9 and 68.6; 
C-NR, 117.1 and 68.9; OT-R, 121.1 and 70.5; 
and OT-NR, 117.2 and 69.1. The Ss re- 
quiring special training numbered 5, 6, 2, and 
2, respectively. 

Table 1 presents the mean errors made in 
the OL and shift phases by the four experi- 
mental groups, with form and color designa- 
tions referring to the relevant shift dimension 
in each phase of training. As would be ex- 
pected from the matching procedure, the OL 
performances were very similar. The shift 
results are in obvious accord with prediction, 
the C-R, C-NR difference being quite small, 
in agreement with the Sanders et al. (1965) 
study, while the OT groups are considerably 
different, with R superior, as in the House 
and Zeaman (1962) experiment. Mann- 
Whitney Us were used to evaluate the R-NR 
effects due to considerable nonnormality in 
the distributions. The OT-R, OT-NR dif- 
ference reached the .048 level (s=1.98, all 
tests two-tailed), while the C-R, C-OT 
difference did not approach significance (s 
=.47, p=.6348). A U test analysis of the 
trials to criterion data showed a similar 
pattern with the OT-R, OT-NR and C-R, 
C-NR group differences reaching the .009 
and .5620 levels, respectively (z's— 2.60 
and .58). 

The fact that there were nonlearners in the 
present study raises the possibility that a 
majority of the final population of Ss had 
their preferred dimension relevant in OL, a 
factor that can lead to R-NR differences 
since a majority of R Ss are then reversing 
within their preferred dimension while a 
majority of NR Ss are shifted to their non- 
preferred dimension. This does not appear 
to have been the case in the present study, 
however, since both C and OT groups should 
have been affected, and the C-R, C-NR dif- 
ference was nonsignificant and in the opposite 
direction from that expected from such an 
OL preferred-dimension effect. Neither does 
it appear plausible that the C and OT groups 
differed greatly in the number of Ss as- 
signed to their preferred dimension since the 
groups were balanced with respect to OL 


TABLE 1 


MEAN ERRORS IN ORIGINAL AND 
SHIFT LEARNING 


Original Learning Shift Learning 
Group 

Form Color Form Color 
C-R 18.75 16.62 11.10 20.125 
C-NR 8.75 25.12 1.5 28.125 


OT-R 12.50 21.12 12.75 5.75 
OT-NR 12.88 23.13 13.25 24.38 


errors, and learning difficulty would be ex- 
pected to reflect the preferred-dimension 
factor. 

Discussion.—The C condition replication of 
the Sanders et al. retardate findings, and the 
replication of the House and Zeaman results 
after OT, provide strong support for the 
suggestion that the length of training on the 
original problem was responsible for the 
discrepant results of the two studies. The 
present results also indicate that although 
retardates may not show the normal Ss R 
superiority to NR effect under certain ex- 
perimental conditions, they can be brought 
to a similar R, NR performance pattern 
through OT on the original problem. It is 
interesting that similar effects of OT on R 
and NR have been reported for second- and 
third-grade normals by Furth and Youniss 
(1964), who used a different and probably 
more complicated task; and by Tighe and 
Tighe (1965) for normal Ss approximately 
21 mo. younger than the third-grade normals 
of the Sanders et al. study. A similar 
general pattern of R, NR effects, related to 
age rather than degree of training, has been 
found by Milgram and Furth (1964) for 
retarded children of MA 6 and 9. While 
NR was easier than R for younger children, 
the reverse was true for the older Ss (R 
improved with age, NR did not), although 
a separate analysis of the retardate data was 
not reported. It appears probable that the 
mediating verbal and/or attentional processes 
which result in faster R shifts can result 
from either age related factors or specific 
OT procedures, and that retardates are de- 
ficient in these processes as compared to 
normals, but not irrevocably so. 

The present data show an apparent Shift 
X Dimension interaction, with OT appearing 
to have had its largest effects on subgroups 
which had the more difficult color dimen- 
sion relevant during OL. This might be 
expected if mediating or identifying re- 
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sponses to the difficult dimension were at a 
low level initially, which would permit 
more change than was possible with the 
easier dimension where these responses were 
probably at a higher, perhaps nearly asymp- 
totic, level at the beginning of training. Any 
interpretation of shift-dimension interactions 
is of questionable value, however, since the 
OL-shift comparisons necessary to evaluate 
the effects of OL dimension difficulty are 
complicated by the lack of comparability of 
the OL (special training) and shift phase 
(80-trial cutoff) procedures. 
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REINFORCEMENT AFTEREFFECTS AND INTERTRIAL INTERVAL : 


SANFORD KATZ, GEORGE T. WOODS,2 anD JUDITH H. CARRITHERS 
Hobart and William Smith Colleges 


A. test of the relationship between intertrial interval (ITI) and the 
development of differential response to alternating partial-reinforce- 
ment schedules was performed using a sucrose solution as rein- 
forcement, 3 groups of 10 rats received 9 trials a day in a runway for 


30 days with ITIs of 1, 2, or 20 min. 
plished by blocking S's access to the 
Starting-, running-, 
marked superiority of the i-min. gr 


reinforcement, 


response. 
20-min. group, 


The use of the hypothetical reinforcement 
aftereffects mechanism is supported by the 
fact that rats come to respond more slowly 
in a runway on the nonreinforced trials of a 
single-alternation intermittent reinforcement 
schedule. It is assumed that the reinforce- 


1 This research was Supported by Res 
MH 10075-01 from the National Institute of Meant 
Health and a grant from Hobart and William Smith 


P à 
ae E at Connecticut College, New. London, Con. 


Nonreinforcement was accom- 


reinforcement rather than omitting 


and goal-time measures indicated 
oup in developing the differential 


There was little indication of differential responding in the 


The possibility exists that in the Capaldi 
and Stanley study the presence or absence 
of reinforcement on the given trial provided 
differential stimulation, Such an interpreta- 
tion is suggested by the use of wet mash as 
reinforcement and the presence of different 
goal sections on reinforced and nonreinforced 
trials. A similar interpretation was also 
applied to a study of Capaldi and Spivey 
(1964) by Surridge and Amsel (1965) who 
aet io ae Capaldi and Spivey's dem- 
onstration of differenti i i 
ITI of 24 tre ential responding with an 


measure. A test series in which a trial 
was Just as likely to be followed by a rein- 
forced trial as a nonreinforced trial permitted 
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the separation of within-trial cues from after- 
effects. The analysis of test-trial data indi- 
cated that such within-trial cues were opera- 
tive for goal-time measures only. These 
data point up clearly the need for control of 
potential within-trial cues, such as reinforce- 
ment odors, when attempting to assess the 
duration of aftereffects. In the present ex- 
periment, nonreinforcement procedures were 
employed which insured that the presence or 
absence of reinforcement during the trial did 
not provide differential stimulation. 

Method.—The Ss were 30 male albino rats 
obtained from the Holtzman Company, 
Madison, Wisconsin. They were 79 days old 
when placed on the  water-deprivation 
schedule. 

A middle-gray runway, the inside dimen- 
sions of which were 38 in. long and 3 in. 
high and wide, was employed. Starting-box 
and goal-box guillotine doors were placed 
114 and 184 in., respectively, from the start- 
ing end. Two 6-in. long floor panels were 
situated 12 and 24 in., respectively, from the 
starting end. Two inches from the goal end 
was situated a 4 in. high barrier behind 
which was placed the caster which served as 
a water cup. A movable wooden lid hinged 
to the back wall covered the water cup and 
rested on the barrier in such a manner that 
Ss could nose up the lid to reach the water. 
An angle iron was attached to the goal end 
of the alley in such a manner that it could 
block the lid from being opened enough to 
allow S to reach the reinforcement. On re- 
inforced trials, the angle iron was simply 
swiveled out of the locking position. Time 
scores were obtained to the nearest .01 sec. 
by means of Hunter klockounters. Starting 
times were taken from the opening of the 
starting-box door to S’s stepping on the first 
floor panel, running times from S’s stepping 
on the first panel to his stepping on the sec- 
ond panel, and goal times from the depression 
of the second panel to the lifting of the lid. 
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Water was made available to Ss for 
only 1 hr. a day starting 7 days prior to the 
first training day. On the fifth and sixth 
days of preliminary training, each S was 
given three trials in the goal box to train 
them to lift the lid covering the water dish. 
On Day 5 the lid was left open 14, 1, and 4 
in. on successive trials. On Day 6 the lid 
was left open 1, 3, and 0 in. The Ss were 
allowed to drink for 15 sec. from the water 
cup which contained a solution of 10 gm. 
of sucrose to every 90 ml. of distilled water. 
On Day 7, each S was given three regular 
trials in the straight alley, all of which were 
reinforced with 15 sec. access to the sucrose 
solution. 

For the next 30 days, Ss were given nine 
trials a day on the single-alternation rein- 
forcement schedule. Ten Ss each were 
assigned at random to one of three groups 
with ITIs of either 4, 2, or 20 min. The 
two daily orders, RNRNRNRNR and 
NRNRNRNRN, were alternated for each $ 
with half of the Ss in each group receiving 
the first order on the odd-numbered train- 
ing days and the remaining Ss on the even- 
numbered days. The first trial of each day 
provided an alternating schedule with ITI 
of 24 hr. The daily order of running 
within groups was systematically varied. 
Daily order of running among groups was 2, 
4, and 20 min. Each group was watered 14 
hr. after the middle trial of each day’s 
session. Thus the mean number of hours of 
water deprivation for each group’s set of 
trials was 213 hr. On nonreinforced trials 
when the goal lid was locked, S spent 15 sec. 
in the goal box. If S failed to stop any one 
of the three clocks on any trial within 60 
sec., he was placed in the goal box and left 
there for 15 sec. The remaining time meas- 
ures were recorded as 60+ sec. 

Results and discussion —The median start- 
ing, running, and goal time of the last four 
reinforced and the last four nonreinforced 


TABLE 1 
Summary OF WILCOXON'S TESTS 


Measure 
Grane Starting Running Goal 
No. of No. of Novak 
ist day days ist day days ist day EE 
15 16 17 14 16 15 
2 30+ 0 28 1 21 8 
20 30+ 0 304- 0 30 1 
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R-N RUNNING SPEED 
è o 


o 


R-N GOAL SPEED 


r6 18 19-24 25-30 


7-12 II 
TRAINING DAYS 
Fro. 1. Mean differences in speed of response. 
trials of each day was determined for each S. 
Wilcoxon’s (1949) test for paired replicates 
was employed to compare performance on R 
and N trials for each group, day, and 
measure. Table 1 summarizes these results 


indicating the first day on which reinforced 
trials were significantly shorter at the .025 
level (one-tail test), and the number of days 
on which reinforced times were significantly 
shorter. Wilcoxon’s test was also applied to 
the daily first-trial scores for groups, meas- 
ures, and successive pairs of days (a total of 
135 tests). None of the comparisons re- 
sulted in significantly longer times on non- 
reinforced trials. (The differences were 
significant in the opposite direction for four 
of the comparisons. ) 

The median scores and first trial scores 
were subjected to a 10 divided by time in 
seconds transformation and averaged over 
blocks of 6 training days for R and N trials, 
measures, and groups separately. Figure 1 
presents these data as R minus N difference 
Scores. The 24-hr. curves are based on the 
performance of all Ss. 

The results of this experiment support the 
assumption that the aftereffects of reinforce- 
ment and nonreinforcement dissipate with 
time, There is indication that aftereffects 
are markedly less available after 2 min. 
There is a suggestion that aftereffects are 
virtually eliminated after 20 min., but more 
training trials might not have supported this 
Statement. The use of sugar water as a 
reinforcer might underestimate the duration 
of aftereffects when food is used as a rein- 
forcer. This can only be determined by 
further research employing proper controls 
for potential goal-box cues, 
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SECONDARY REINFORCEMENT IN CHILDREN ASA 
FUNCTION OF TRAINING PROCEDURES? 


JEROME L. MYERS anp NANCY A. MYERS 


University of Massachusetts 


96 preschool children were tested in a 2-button choice situation after 
previous experience with 2 lights in either a 1-button or 2-button situa- 


tion. 
produced a light previously associ 


In the test, all groups chose between a button that always 
jated with nonreward (100% NS") 


and a button that produced a light previously associated with reward 


(S5 
the light (10076 


For 3 the Ss, responses on this S" button always produced 
S"); for the other 3 of the Ss, 20% of responses 


on the S" button produced the light and all other responses on this 


button produced nothing (20% S"). 


Greater preference for the S* 


button was shown by the groups trained with 1 button and by the 100% 


S" groups. 


In a number of recent studies, Myers and 
Myers (1964, 1965) have employed both 
extinction and choice tasks to test for sec- 
ondary reinforcement (S") effects, after 
various schedules of training in an operant 
situation. In training, all Ss were required 
to make a button-pushing response which was 
reinforced by an M & M candy and a light 
following 20% of the responses. In the 
extinction task, Ss were presented the light 
only following either 100% or 20% of button- 
press responses. One hundred percent pres- 
entation resulted in significantly more extinc- 
tion responses than did 20% presentation 
(Myers & Myers, 1964). In the choice task, 
Ss were required to choose between two 
buttons, one of which always produced the 
light previously associated with reward fol- 
lowing all (100% S") or 20% (20% S") of 
the responses on that button. Pushing the 
other button always produced a light not 
previously associated with reward (100% 
NS"). The preference for the S" over the 
NS" button was greatest under the 20% S* 
schedule (Myers & Myers, 1965). 

The results of response-rate data from ex- 
tinction tests suggest that 100% S* is more 
effective than 20% S"; the preference data 
from choice tests suggest the opposite con- 
clusion. Because of the difference in meas- 
ures, a direct experimental comparison of the 
two test procedures is difficult. To meet 
this difficulty, the present experiment em- 
ploys a procedure in which Ss are both 
trained and tested with two buttons. This 
is then compared with the usual choice 
procedure, training on one button and testing. 
with two. 


l'This research was supported by funds from Na- 
tional Institutes of Health Grant HD-00898-06. 


Method.—Ninety-six children attending 
kindergartens in Northampton, Massachu- 
setts? participated in a simple push-button 
task, with M & M candy reinforcement. 
They were assigned to four groups of 24 Ss 
each, differing with respect to the type of 
training and the percentage of S" in the test. 
In training, all groups received 20% candy 
reinforcement on the reinforcing button, or a 
total of 16 M & Ms for 80 responses. The 
One-Button groups received one colored light 
stimulus whenever candy was delivered for 
a button press on the single button present, 
and a different colored light stimulus when 
no candy was received. The Two-Button 
groups received one colored light stimulus 
whenever candy was delivered for a button 
press on one of the two buttons present, but 
no light stimulus when no candy was received 
on this button. The other colored light 
stimulus was received for each button press 
of the second button; no candy was ever 
delivered for presses of this button. 

During the test, no candy was delivered, 
and all groups were required to choose be- 
tween two buttons. One button always pro- 
duced the light previously associated with 
nonreward (100% NS"); the other button 
produced the light previously associated with 
reward (S"). However, for half of the Ss, 
responses on this S* button always produced 
the light (100% S"); for the other 48 Ss, 
20% of responses on the S* button produced 
the light and all other responses on this but- 
ton produced nothing (20% S"). Testing 
was continued for 15 min., or until the chil- 

2The authors wish to thank William Barry, Super- 
intendent of Schools, and Esther Wien, Elementary 
School Supervisor, Northampton, Massachusetts for 


their cooperation in providing facilities and Ss for this 
study and Florence Mador for assistance in running Ss. 
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TABLE 1 


MEAN DIFFERENCE FOR Each OF FIVE SUCCESSIVE 
3-Min. BLOCKS or TEST 


Successive 3-Min. Blocks 


Group 
| pel igre eae 

OneButton— 29.84 31.25 | 36.04) 36.46 | 36.13 
20% Sr 

One-Button— | 36.00 | 61.13 | 62.17 | 55.62 | 52,04 
100% Sr 

Two-Button— 16.08 | 22.88 | 25.04 | 28,50 | 28.58 
20% St 

Two-Button— | 17.75 | 27.66 | 31.08 | 31.75 | 38.04 
100% St 


dren terminated the session. Any S who 
stopped before the end of the test received a 
score of zero for each minute missed. 
Results and discussion—The mean differ- 
ences between the numbers of responses made 
by each group to the S' and NS" buttons 
may be seen in Table 1. The S" button is 
consistently preferred; the relevant F ratio 
(1, 92) is 108.19, p <.001. The one-button 
training procedure yields significantly greater 
preference for the S" button; F (1, 92) 
= 6.09, p<.05. Greater preference for the 
S" button is also shown by the 100% St 
groups; F (1, 92) —3.98, p<.05. The 
trials effect is significant, F (4, 368) = 12.87, 
p< .001, reflecting an average increase in 
preference for the S" button over five 3-min. 
blocks of time for all groups. The 100% S* 
groups showed a more rapid increase in 
preference than did the 20% S" groups; the 
appropriate F ratio (4, 368) is 275, p < .05. 
The major new finding of the present study 
is that one-button rather than two-button 
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training yields greater secondary reinforce- 
ment effects (as measured by button prefer- 
ence) in a two-button choice test. These 
results are contrary to the predictions of a 
discrimination theory of secondary reinforce- 
ment which has been discussed in detail else- 
where (e.g, Myers & Myers, 1965). Es- 
sentially, the theory predicts that the S* 
effect will increase as a function of similarity 
between training and test conditions. Cer- 
tainly in this study employing a two-button 
choice test, the training and test conditions 
are more nearly identical for the two-button 
training groups than the one-button training 
groups. 

The result that greater preference for the 
S" button was shown by the 100% S" groups 
than the 20% S" groups should also be 
noted. "These data are consistent with find- 
ings of studies employing an extinction test 
of S" (e.g, Myers & Myers, 1964), but are 
not consistent with previous two-button 
choice test data. This discrepancy may be 
due to the fact that the one-button training 
schedule of this experiment involved exposure 
to both the S" and the NS" stimuli in order 
to facilitate comparison with the two-button 
training schedule. In previous experiments 
involving the choice test, only the S" was 
present during the one-button training. 
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ADAPTATION TO A ROTATED VISUAL FIELD AS A 
FUNCTION OF DEGREE OF OPTICAL TILT 


AND EXPOSURE TIME* 


SHELDON M. EBENHOLTZ? 


Connecticut College 


3 groups of Ss were exposed to optically rotated visual fields of 10, 


20, and 32°, respectively, 


magnitude of adaptation was a linear 
negatively accelerated function of time. 


per unit of time occurred within the 


for a total adaptation time of 4 hr. 


The 
function of optical tilt and a 
The highest rate of adaptation 
1st hour. Alternative theoretical 


accounts of the time rate of adaption were discussed. 


Several contemporary theoretical 
points of view have been proposed con- 
cerning the nature of perceptual adap- 
tation to monotonic transformations of 
the optic array (e.g. Harris, 1965; 
Held & Freedman, 1963; Kohler, 
1964; Rock, 1965; Taylor, 1962). 
There have been, nevertheless, rela- 
tively few parametric studies (eg. 
Hay & Pick, 1966) and as a conse- 
quence theory construction has not 
been guided by quantitatively expressed 
functional relations and their boundary 
conditions. In the present study op- 
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tical tilt and exposure time were 
manipulated and the attempt was made, 
to specify the functional relation be- 
tween these variables and the magni- 
tude of adaptation. 


METHOD 


Apparatus—The Ss viewed the 'environ- 
ment monocularly through a set of Dove 
prisms mounted in tandem so as to eliminate 
right-left reversals. The visual field was 
approximately 30° in all directions and 
could be rotated clockwise (CW) or counter- 
clockwise (CCW) up to 45° from vertical. 
The prisms were mounted in cylinders and 
were held in place over each eye by means of 
a head gear. 

Adaptation was measured in a light-free 
room with prisms removed, by setting an 
electroluminescent rod to the apparent verti- 
cal. The rod was 12 in. in length and.4 in. 
wide and was pivoted about its center at 5 ft. 
from S. A chin and forehead rest were used 
to secure S's head in position such that the 
pivotal point of the rod was at eye level. 
Settings of the rod to the apparent vertical 
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were taken prior to the adaptation period 
and after each hour for 4 hr. during adapta- 
tion. At each test period the rod was set 
initially at 25° CW or CCW and S was 
instructed to direct E to move the rod until 
it appeared to be in the same direction as 
gravity, ie, upright The procedure was 
repeated four times, alternating CW and 
CCW starting positions, with the average of 

` the four measures representing S’s apparent 
vertical, Each eye was tested separately 
"with the right eye exposed first at each 
session. The level of adaptation was defined 
as the difference between the average setting 
at any given hour and the average pre- 
adaptation measure for the appropriate eye. 
Adaptation was judged positive provided the 
deviation of apparent vertical from the pre- 
adaptation level was in the same direction 
as the tilt produced by the prisms. Thus, 
under a 20° CW tilt condition complete 
adaptation would be indicated by a rod set- 
ting at 20° CW from the preadaptation meas- 
ure. Under this condition the rod should 
appear vertical to S. 

Design.—Three groups of eight Ss each 
were exposed to optically tilted visual fields 
of 10, 20, and 32°, respectively. The Ss 
were assigned to groups alternately in the 
order of arrival at the laboratory. The en- 
vironment was viewed through the right eye 
with the left eye occluded for a total adapta- 
tion period of 4 hr. exclusive of the test 
periods. Half the Ss in each group wore 
prisms that shifted the optic array CW with 
the remaining Ss exposed to a CCW shift. 
There were a total of four test sessions 
each lasting approximately 6 min. These 
took place after each hour of adaptation with 
the right and left eyes tested individually in 
the order given. The data for CW and 
CCW conditions were pooled and were 
treated in terms of a mixed-model analysis 
of variance with the left and right eye, and 
the four exposure intervals providi, ig the two 
within variables while the three degrees of 
optical tilt served as the between factor. 

During adaptation S's activity consisted 
primarily in walking through long corridors, 
and up and down steps. For approximately 
equal portions of each exposure interval Ss 
were also occupied with a jigsaw puzzle, card 
playing, and dart throwing. For a 7-min. 
period of the first and again at the fourth 
hour Ss read several Pages of printed text. 
The choice of activities was determined 
largely by the assumption that self-produced 
movement and the coincident afferent (vis- 
ual) feedback (eg, Held & Rekosh, 1963) 
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are significant antecedent conditions for the 
production of adaptation. Accordingly, Ss 
were encouraged continually to move their 
heads from side to side (viz, to look to 
their right and left). With such head 
movements, the flow pattern of images across 
the retinal surface normally is along the 
axis of head movement, When tilting prisms 
are worn, the identical head movement co- 
incides with a flow pattern whose axis of 
movement is parallel to the tilt in the optic 
array produced by the prisms. Thus a 
horizontal head movement correlated with 
a diagonal retinal flow pattern was assumed 
to represent a change from the preadaptation 
motor-sensory feedback relation to which the 
moving S would adapt. 

Subjects—Twenty-four paid Ss who were 
female undergraduates took part. All Ss 
had either normal vision or wore fully 
correcting contact lenses. None had prior 
knowledge of the experimental variables nor 
had they taken part in prior adaptation 
studies. 


ReEsutts 


The means and variances of the three 
groups tended to be correlated thereby 
indicating heterogeneity of variance. 
Consequently a logarithmic transfor- 
mation was performed and the analysis 
of variance was based upon the trans- 
formed scores, 

Exposed vs. covered eye.—The eye 
that was exposed (E) during adapta- 
tion produced significantly higher levels 
of adaptation than the covered (C) 
eye, F (1, 21) =21.00, p< .005. 
Since E vs. C did not interact with tilt 
conditions, F (2, 21) = .33, p> .05, 
data for the three degrees of tilt were 
pooled and differences between E and 
C were tested at each of the four time 
intervals. In each case E was signifi- 
cantly greater than C at p < .01, indi- 
cating that interocular transfer was less 
than perfect. . This result contrasts 
with that of Hajos and Ritter (1965) 
who found complete interocular trans- 
fer of adaptation to spatial displace- 
ment. The ratio of the level of adapta- 
tion of the covered eye to the exposed 
eye was determined for each exposure 
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condition and test interval. The ratios, 
averaged over the four time intervals 
were .41, .66, and .64 for optical tilts 
of 10, 20, and 32°, respectively. The 
magnitude of transfer at the 10° condi- 
tion was probably attenuated by the 
relatively high incidence of negative 
or zero C:E ratios due to the low 
absolute levels of adaptation in this 
condition. The values at 20 and 32° 
are therefore more reliable indicators 
of the extent of interocular transfer. 
It may also be noted that the covered 
eye was always tested after the exposed 
eye. Since testing required approxi- 
mately 3 min., if adaptation is assumed 
to decrease with time in the dark 
(Hamilton & Bossom, 1964), then 
measurements taken with the covered 
eye would tend to be lower than if they 
were taken immediately. Conse- 
quently, estimates of interocular trans- 
fer given above may be conservative. 
Adaptation time.—Adaptation time 
(t) was not a significant source of vari- 
ance, F (3, 63) = 1.00, p > .05; how- 
ever the interaction of t with C vs. E 
was significant, F (3, 63) = 5.00, p 
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« O01. This result may be interpreted 
to mean that the differences between 
E and C were not of the same magni- 
tude but varied as a function of the 
hour at which adaptation was tested. 
Accordingly, the mean differences be- 
tween E and C at each of the four test 
periods were compared with each other 
using ¢ tests. The differences tended, 
to increase with time. However, only - 
the differences between Hr. 3 and 1 
and 3 and 2 were significant in favor of 
Hr. 3 at p < .05. 

The failure to find a significant main 
effect of time indicates that either in- 
creases in the level of adaptation 
beyond the first hour were extremely 
small or that the adaptation process 
reached asymptotic levels within the 
first hour of exposure. Figure 1 rep- 
resents the level of adaptation as a 
function of exposure time for the three 
degrees of tilt. The smooth curves 
represent hyperbolic functions fitted to 
the data by the method of least squares. 
The functions fit the data reasonably 
well, yielding standard errors of esti- 
mate of .58, .12, and .82° for the ex- 
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Fic. 1. Magnitude of adaptation as a function of exposure time for the exposed and 


covered eye with optical tilt as parameter. 
method of least squares.) 


(Smooth curves represent the best fit by the 
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posed eye and .38, .40, and 46° for 
the covered eye at 10, 20, and 32°, re- 
spectively. All curves have y asymp- 
totes well below the level required for 
complete adaptation. This indicates 
that continuous wearing of the prisms 
beyond the 4 hr. of the present experi- 
ment would not increase the level of 
adaptation appreciably. Similar nega- 
tively accelerated functions have re- 
cently been reported by Hay, Pick, 
and Rosser (1963) for adaptation to 
chromatic fringes produced by pro- 
longed exposure to right-angled prisms, 
and by Wertheimer and Arena (1959) 
in a study of adaptation to lateral 
displacement. 

Optical tilt—Degree of optical tilt 
proved to be a significant source of 
variance, F (2, 21) = 347, p < .05. 
The mean level of adaptation averaged 
over the four time intervals was for 
the exposed eye 3.50, 5.59, and 8.24? 
at 10, 20, and 32° of optical tilt, re- 
spectively. The comparable data for 
the covered eye were 2.26, 3.94, and 
5.56°. Figure 2 represents the level 
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of adaptation as a function of optical 
tilt! With some minor variation, the 
functions, for each hour of exposure 
and for the exposed and covered eye, 
respectively, clearly are linear. How- 
ever, since most of the curves appear to 
have y intercepts slightly greater than 
zero, there is the possibility that the 
function would be nonlinear for optical 
tilts near zero. It is also possible to 
attribute the positive y intercepts to 
errors of measurement. Similar linear 
relationships have been reported by 
Morant and Beller (1965) using opti- 
cal tilts of 15, 45, and 75° with a 15- 
min. exposure interval, 


Discussion 


The rate of adaptation per unit time 
was greatest during the first hour of ex- 
posure and decreased rapidly thereafter. 
Assuming the hyperbolic functions shown 
in Fig. 1 adequately represent the course 
of adaptation over time, it follows that 
adaptation would never be complete, re- 
gardless of the total time spent in con- 
tinuous wearing of the prisms. On the 
unlikely assumption that the slope of 
the function beyond the first hour was 
actually constant, a straight line was 
fitted to the data of Hr. 1-4 and the 
amount of time required for full adapta- 
tion was computed. The results showed 
that 38.6, 46.2, and 81.7 hr. of continuous 
adaptation would be required to adapt 
completely to optical tilts of 10, 20, and 
32°, respectively. Thus, even a liberal 
estimate of the time rate of adaptation in- 
dicates that adaptation proceeds at an 
extremely slow rate beyond a relatively 
small initial time interval. 

The rate of growth of the adaptation 
Process requires explanation. One pos- 
sibility is suggested by the assumption 
that the time rate of adaptation is deter- 
mined by the retinal rather than the opti- 
cal tilt at any given instant in the course 
of adaptation. The distinction between 
the terms arises out of the fact that the 
retinal direction corresponding to the ap- 
parent vertical changes over time in the 
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direction of the optical tilt as a conse- 
quence of the adaptation process. Thus 
the angular measure of optical tilt im- 
posed by the prisms stays constant rela- 
tive to the true direction of gravity but 
diminishes relative to the retinal direction 
corresponding to the apparent vertical. 
The fact that the tilt to which S is ex- 
posed may be computed according to the 
latter procedure, designated as retinal tilt, 
together with several additional assump- 
tions, provides one basis for a theoretical 
account of the observed growth function. 
For example, consider an S$ exposed to a 
20° optical tilt and assume 5° of adapta- 
tion at the end of 1 hr. The above 
analysis suggests that at the beginning 
of the second hour S would be exposed 
to a retinal tilt of 15°. If adaptation is 
assumed to be a linear function of retinal 
tilt, and if the slope of this function re- 
mains constant over time? then during 
the second hour S would adapt by an 
amount equivalent to a constant fraction 
(e.g. .25) of 15° or 375?. At the begin- 
ning of the third hour the new retinal tilt 
would be given by 20— (5 +3.75) or 
11.25? and the level of adaptation would 
be expected to increase by 2.81? (i.e. 
11.25 X .25), Through successive appli- 
cations of this procedure the change for a 
given unit of time (#) in the magnitude 
of adaptation (A) as a function of optical 
tilt (T) can be represented by the formula 
AA — TK (1—K)^ where K is the 
slope of the linear function relating A to 
T, and t represents positive units of time 
greater than zero. The rate of adapta- 
tion can thus be shown to be a negatively 
accelerated exponential function of time 
for with K<1, AA decreases as t 
increases. 

It may be concluded that it is possible 
to deduce the negatively accelerated func- 
tion describing the time rate of adaptation 
through iterative application of the linear 
relation assumed to hold between magni- 
tude of adaptation and retinal tilt. 

An alternative hypothesis stems from 
the proposition that the magnitude of 


3 This is a simplifying assumption only, for 
the slope is free to vary within limits without 
altering the conclusion which follows, 
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adaptation is a linear function of optical 
tilt (eg. see Fig. 2) and, in contrast 
with the previous analysis, this quantity 
is assumed to be independent of the in- 
stantaneous state of adaptation of the eye. 
In this context, optical tilt may be defined 
as the angular difference between respec- 
tive directions of two successive retinal 
flow patterns (e.g., the directional differ- 
ence between the "normal" preadaptation 
flow pattern and that produced by the 
wearing of prisms). Since adaptation is 
assumed to vary linearly with this differ- 
ence angle, it follows that the adaptation 
process entails the use of a memory 
mechanism whereby the direction of a 
previously encountered flow pattern may 
be retained and, in some fashion, com- 
pared with that of an ongoing pattern. 
The terms of the present model of the 
adaptation process may be coordinated 
with those of the feedback theory of von 
Holst (1954) by assuming the comparison 
to be between the efferent copy of the pre- 
adaptation motor-sensory loop and the 
new reafferent signal engendered by the 
tilting prisms. The present view further 
assumes that the organism is maximally 
sensitive to such differences early in the 
adaptation period with sensitivity falling 
off in time as a result of the gradual 
decline in the efficiency of the memory 
mechanism. The decline may, of course, 
be due either to a general decay of the 
original memory process or to interfer- 
ence from the deposition of memory traces 
of the direction of the ongoing flow 
pattern. In either case the negatively 
accelerated function relating degree of 
adaptation to time (see Fig. 1) may be 
understood as a consequence of the rela- 
tively rapid loss over time of memory 
traces associated with the angular direc- 
tion of a previously exposed retinal flow 
pattern. Accordingly, the poorer the 
sensitivity to differences in direction be- 
tween the two patterns, the slower should 
be the rate of adaptation. It is apparent 
that both formulations may, in principle, 
account for the present data, and that only 
further tests of the empirical consequences 
of these views can determine their 
validity. 
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The linear relation between magnitude 
of adaptation and degree of optical tilt 
(see Fig. 2) which was first reported 
by Morant and Beller (1965) was again 
observed to hold in the present experi- 
ment over a different range of time and 
tilt values. The simplicity of this relation 
is quite surprising in relation to other 
psychological functions. It is of particu- 
lar interest to note that in contrast with 
acquisition data from studies of verbal 
learning, there appears to be no apparent 
need for a concept of capacity, viz, the 
magnitude of adaptation in any given unit 
of time is dependent only upon the degree 
of optical tilt to which S is exposed. It 
is probable that absolute magnitude of 
adaptation is not the appropriate analogue 
to capacity as is amount learned. Rather, 
the percentage of adaptation per unit time 
does appear to have definite limits and to 
produce marked inter-§ differences as 
well. 

A final comment concerns the fact that 
one study of adaptation to a 20° optical 
tilt (Mikaelian & Held, 1964) has shown 
complete adaptation within 2 hr. Pre- 
selected fast-adapting Ss were used in this 
study whereas the present data reflect the 
average of slow and fast adapters. The 
implications of such individual differences 
remain to be drawn. 
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SPEED SCORES OBTAINED AT RANDOM VERSUS 
STANDARD TIMES UNDER A CONTINUOUS 
PERCENTAGE WEIGHT LOSS* 
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When Ss are maintained on a continuous weight-loss schedule, time of 
data collection bears no constant temporal relationship to time of feed- 
ing. Thus, tests of temporal conditioning need not be confounded with 
changes in drive. Comparisons between groups run at standard vs. 
random times each day failed to reveal any differences with respect to 
performance strength and variability. Since drive was controlled and 
constant, we conclude that no temporal conditioning occurred. 


When Ss are white rats and motiva- 
tion is defined in terms of a 23-hr. 
food-deprivation interval, Es almost 
always collect data at the same time 
each day, The reasoning behind this 
procedure lies in the assumption of an 
activity cycle, biological clock, depriva- 
tion drive, or some combination. 
Whatever advantages this procedure 
may have, it is obvious that the time 
of data collection is rhythmically yoked 
to the daily meal (i.e., rigidly cyclic) 
and provides opportunity for the de- 
velopment of an antedating response 
(rg) to the time interval of 23 hr. (i.e., 
temporal conditioning). We have not 
found any reports of temporal condi- 
tioning of the running response per se. 
Studies of the influence of feeding 
rhythm on subsequent activity or run- 
ning measures would lead one to ex- 
pect temporal conditioning of running 
on the basis of the Bolles and deLorge 
(1962) and Birch et al. (1958) reports 
but not from the data of the Brown 
and Belloni (1963) study. A test for 
the degree of temporal conditioning to 
the 23-hr. interval requires that data 
be collected at an hour other than the 
23-hr. interval. The data, however, 


1 Supported by Grant MHO7157-04 from 
the National Institute of Mental Health to 
David Ehrenfreund. 


are influenced jointly by a change in 
drive from 23 hr. to some other hour 
and generalization of temporal condi- 
tioning. Under the typical hours of 
deprivation procedure it is difficult to 
design an adequate experiment that 
would isolate the joint effects of drive 
and temporal conditioning. If Ss run 
at the same time each day are com- 
pared with Ss run at a random time, 
any difference might be due to daily 
variations in drive or drive stimulus 
rather than the preclusion of temporal 
conditioning in the random group. If 
Ss are run for a large number of days 
at the same hour and then changed to 
a different hour, we are again faced 
with the confounding of joint effects 
of drive and temporal conditioning. 
However, if, in terms of time, data 
collection were entirely independent of 
the drive maintenance schedule, it 
would be easy to devise a test to deter- 
mine the extent to which temporal con- 
ditioning developed to the 23-hr. inter- 
val (or any other) between data- 
collecting sessions. One way out of 
this difficulty is to redefine drive in 
terms of percentage of weight loss and 
then provide a feeding technique that 
will maintain Ss constantly, continu- 
ously, and automatically at the experi- 
mental weight level. Under such con- 
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Fic. 1. Constant weight a 
apparatus is presently being m 
Maryland.) 


PParatus mounted inside of Sound. 
anufactured by Foringer & Comp: 


-attenuating cubicle. (This 
any, Incorporated, Rockville, 


ditions data could be collected under 
any temporal distribution without the 
confounding effects of changes 

either D or Sp simply because Ss w 


always be under the stated drive 
conditions, 


In our laboratory we have for the 
past several years been manipulating 


in 
ill 


TEMPORAL CONDITIONING AND VARIED TEST TIME 


drive by having each S live in a 
constant-weight apparatus (see Fig. 1) 
which automatically brings S to a 
specifiable percentage of ad libitum 
weight and maintains it at that level 
continuously and automatically for as 
long as is desired. These devices as 
previously described (Ehrenfreund, 
1960; Ehrenfreund & Allen, 1964) are 
sensitive to small losses in weight. 
We have set them so that a loss of 1 
gm. in the animal’s experimental 
weight will activate a feeding mecha- 
nism which delivers $ gm. amounts of 
granulated Purina chow at 15-sec. in- 
tervals until the loss is recovered. 
The animals are thus fed throughout 
the day on an aperiodic schedule con- 
trolled by their random periods of 
weight loss. Water is available ad 
libitum. 

Since our Ss are maintained con- 
tinuously on a constant drive, time of 
data collection is entirely independent 
of considerations of drive, i.e., Ss may 
be run at any time and not necessarily 
the same time each day, In the pres- 
ent study two sorts of observations 
were made to ascertain the effect of a 
fixed interval (23 hr.) vs. a random 
interval between daily running ses- 
sions: (a) animals receiving daily ses- 
sions at random times in a straight 
alley were compared with animals 
which were run at a standard or fixed 
time each day; (b) after 12 days one 
half of each of these groups contin- 
ued unchanged, but the other half 
was switched to the schedule of the 
other group. Comparisons were made 
among the four subgroups. In respect 
to the first observation it was assumed 
that, if temporal conditioning occurred, 
the group run at a standard daily time 
would have a lower day-to-day vari- 
ability and a faster alley speed than 
would a group run at random daily 
times, In respect to the second ob- 
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servation the group, changed from a 
fixed to a random schedule, would suf- 
fer a decrement in running-speed 
scores and an increment in running- 
time variability. 


METHOD 


Subjects—Eighteen male Sprague-Dawley 
rats, 100 days old, were reduced to 80% 
of their respective free feeding weights in 
individual constant weight cages, and main- 
tained at that level for 10 days prior to 
the first running session and throughout the 
experiment. 

Apparatus—The Ss were run in a straight 
cloudy plastic alley (Ehrenfreund & Badia, 
(1962) measuring 5 ft. X 23 in. X4 in. Infra- 
red light sources above the maze were di- 
rected to Hunter photocells below the maze, 
which were arranged to provide a 2-ft. run- 
ning speed measure in the middle of the 
alley and an 18-in. terminal speed measure 
at the end of the alley. 

Procedure.—All Ss received five trials 
per day with approximately 8 min. between 
trials for a total of 17 days. During the 
first 12 days, a standard group of 10 Ss 
were run at 3 pm. A random group of 
8 Ss were run over an 11-hr. range from 
8 am. to 7 E.M. with no two consecutive 
daily times closer than 2 clock hr. apart. 
After the twelfth day half of each group 
remained on the same schedule and half 
shifted to the other schedule. 


RESULTS 


The running speed curves, as shown 
in Fig. 2, illustrate smooth and almost 
equal approaches by the fifth day to 
an asymptotic level of 4 ft/sec for both 
groups. A simple analysis of variance 
between the two groups over preshift 
Trial Blocks 6 through 12 yielded a 
between groups F <1 for both run- 
ning and terminal speeds. 

Analysis of the data for the four 
subgroups (Days 13-17) also failed 
to reveal any differences (F <1 for 
both running and terminal speeds). 

To check for a possible temporal 
conditioning effect in terms of differ- 
ential variability, the nonchange sub- 
groups were analyzed separately from 
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Fic. 2. Speed curves for groups run at standard and random daily times. 


(To 


the right of the vertical line half of each group was switched. Each point is the 


mean of Ss' median scores.) 


the change groups. Standard-deviation 
scores obtained from reciprocal times 
over Blocks 8 through 17 for each S 
in the two nonchange groups yielded 
group mean o values of 5.22 and 7.43 
for the random and standard interval 
groups, respectively. A £ of 1.84 be- 
tween the two groups did not reach an 
accepted significance level (5:20, 
df — 3). The difference in mean o 
scores between the two change groups 
was analyzed for order effect (5 blocks 
preshift vs. 5 blocks postshift), group 
effect, and Group x Order interaction, 
All yielded an F « 1. 


Discusston 


All the evidence presented here leads 
to the conclusion that, with constant 
weight maintained, animals may be run 
on a random daily schedule without ef- 
fecting either the rate of approach to 
asymptote, the final asymptotic level ob- 
tained, or the stability of the performance, 

Thus, under these conditions running 
sessions may be Spread out over the day 


on any desired schedule while the cor- 
rect deprivation level is automatically 
and closely maintained. Such a tech- 
nique is amenable to parametric studies 
involving widely spaced trials on the de- 
velopment of FE, PRE, etc. Further- 
more, animals may be run on more than 
one study at a time which in addition to 
possible economies allows for a variety of 
designs for ascertaining interaction ef- 
fects. Some, at least, may be interested 
in convenience factors, i.e., that of free- 
ing E from a rigid daily running sched- 
ule and also allowing him (in certain 
studies) to leave the lab for a few days 
at a time (Ehrenfreund & Allen, 1964) 
while assuring the undisturbed perform- 
ance of Ss upon the resumption of 
running. 
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Vision can adapt for some optical distortions that vary with the direc- 
tion of gaze, in particular for the variable image stretching and shear- 
ing caused by prism spectacles. These 2 adaptations were studied 
using both vertically and horizontally oriented adaptation prisms, 
thereby untying the type of image distortion from the type of cor- 
related eye movement. Experiment I showed the mechanisms of the 
2 adaptations to differ, the shearing adaptation being critically de- 
pendent on the orientation of the correlated eye movements. In Ex- 
periment II, the shearing adaptation was found not to be the exact 
complement of the optical shearing: the apparent tilting of some image 
lines was reduced (positive adaptation), while the apparent counter- 
tilting of other image lines was increased (negative adaptation). It 
was concluded that the adaptation to prismatic image shearing is a 
rotation, most probably a change in the Patterning and registration of 


torsional eye movements. 


Among the more remarkable types 
of adaptation to distorting spectacles 
are the gaze-contingent adaptations dis- 
covered by Kohler (1963). They 
were first observed in the case of prism 
spectacles. Prism spectacles distort 
the shape of the retinal image in a way 
that varies somewhat with the direction 
of gaze through the spectacles ; that is, 
prismatic distortion has a gaze-contin- 
gent component. Figure 1 shows this 
gaze-contingent image distortion for 
the case of base-right prism spectacles. 
When the cross-and-square target of 
Fig. 1 moves from right to left relative 
to S’s head—either because the head 
or the target is moving—and $'s gaze 
thus shifts from right to left, the image 


1 This research was supported by Research 
Grant MH 07588-02 from the National Insti- 
tutes of Health to the University of Minne- 
sota. Special thanks are due to our Ss; 
to our assistants Karren Ikeda, Ann Mosher, 
Bill Frayer, and Mary Viola; and to the 
Psychology Department of Cornell Univer- 
sity for providing space and facilities. Iyo 
Kohler and James J Gibson provided invalu- 
able. advice, and we owe the term “gaze- 
contingent” to Celeste McCollough. 


of the target is subjected to a horizontal 
stretch. When the target moves from 
down to up relative to S’s head, and 
S’s gaze consequently shifts from down 
to up in Fig. 1, the image of the target 
is subjected to a vertical shear. Sup- 
posing that S fixates the target through- 
out its relative movement, this means 
that his foveal image is subject to a 
variable distortion. No fixed adapta- 
tion at the fovea, such as a figural after- 
effect, could compensate for this vary- 
ing distortion. Yet Kohler found that 
when he wore prism spectacles for a 
long period (up to 60 days), the gaze- 
contingent distortion of objects became 
less visible. The adaptation was con- 
firmed by negative aftereffects when 
the prism spectacles were removed: 
then objects moving from right to left 
appeared to contract horizontally, and 
objects moving from below to above 
the head sheared in the direction oppo- 
site that originally imposed by the 
spectacles. Kohler inferred from these 
observations that his visual system had 
developed a gaze-contingent adaptation, 
an adaptation that varied in a sys- 
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Fic. 1. 


Gaze-contingent distortions pro- 
duced by 20-diopter prism spectacles, base 


right. (The target is square. Off-center 
views are 20° from center. The drawing 
was traced from photographs of the test 
target used in this study.) 


tematic fashion with the position of his 
eyes in his head. 

Pick and Hay (1964, in press) con- 
firmed Kohler's observations using 
eight Ss and a variety of tests which 
verified the generality of the gaze-con- 
tingent adaptation. Aside from show- 
ing that the adaptation appeared to 
reside somewhere in the visual system, 
since targets of all kinds showed the 
adaptation, whether the eyes scanned 
the target or remained fixed while the 
head moved, the only clue found to the 
nature of gaze-contingent adaptation 
was that adaptation to image shearing 
was greater than that to image stretch- 
ing. In Pick and Hay’s study, all S’s 
wore horizontally oriented spectacle 
prisms; that is, the prism bases were 
either both to S’s right, or both to his 
left. (To see the effects of base-left 
prisms, rotate Fig. 1 by 180°.) This 
meant that image stretching was always 
associated with horizontal movements 
of the eyes in the head, while image 
shearing was always associated with 
vertical eye-head movements. The dif- 
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ference in adaptation might be due 
either to the difference in optical stimu- 
lation (shearing vs. stretching) or to 
the difference in the correlated motor 
activity (vertical vs. horizontal eye- 
head movements). 

The present study compared these 
optical and motor hypotheses as to the 
critical factor in gaze-contingent adap- 
tation by having half the Ss wear ver- 
tically oriented prisms; that is, specta- 
cles with the prism bases either both 
up or both down. (The resulting dis- 
tortions can be seen by turning Fig. 1 
90° counterclockwise or clockwise, re- 
spectively.) For these Ss, shearing 
was associated with horizontal eye- 
head movements, and stretching with 
vertical eye-head movements. If the 
optical hypothesis were correct, the 
adaptation to shearing should continue 
to exceed that to stretching. If the 
motor hypothesis were correct, the 
adaptation to stretching should now ex- 
ceed that to shearing. 

A secondary purpose of the present 
study was to make a direct compari- 
son of the gaze-contingent adaptations 
with the more familiar “gaze-invariant” 
adaptations to prism spectacles : curva- 
ture (Gibson, 1933), color fringes 
(Hay, Pick, & Rosser, 1963), and dis- 
placement (Held & Hein, 1958). 
(These prism distortions are only rela- 
tively invariant with gaze ; whether the 
adaptation to them has a gaze-contin- 
gent component is as yet uncertain. 
See Pick & Hay, in press.) 


EXPERIMENT I 
Method 


Adaptation conditions—Two groups of 
eight Ss were employed, one group wearing 
binocular horizontal prisms, the other wear- 
ing binocular vertical prisms. Prism ex- 
posure was 6 days, a period that induced 
the larger share of adaptation in an earlier 
42-day study (Pick & Hay, 1964). During 
this period Ss carried on their normal activi- 
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ties as students, never removing the specta- 
cles except when their eyes were shut. The 
prisms were 20 diopters in strength, made of 
crown spectacle glass with flat surfaces. 
They were mounted in 40-mm. round frames 
with black side occluders, and afforded an 
approximately 90° field of view. All Ss had 
normal acuity, stereoscopy, color vision, and 
lack of astigmatism. There were equal 
numbers of men and women in each group, 
with ages ranging from 18 to 32 yr. In the 
horizontal-prism group, half the Ss wore 
base-left prisms, half wore base-right prisms; 
in the vertical-prism group, half wore base- 
down, half wore base-up prisms. Reversing 
the bases, of course, reverses the left-right 
or up-down direction of the distortions, and 
thus serves as a control for accidental biases 
in the test procedure. 


Adaptation Tests 


Prism-series test —The principal test was 
modified from a measure found to agree with 
other indexes of adaptation (the “spectacle 
series” in Pick & Hay, in press). The S 
removed his adaptation spectacles and was 
given a graded series of weaker test spec- 
tacles, for each of which he described the 
distortions on a cross-and-square target like 
that of Fig. 1. To check for gaze-contin- 
gent distortions, S was told to fixate the 
center of the target and to turn his head 
both horizontally and vertically, thus produc- 
ing in his foveal image the Stretching and 
shearing of Fig. 1. 
invariant distortions, he was told to describe 
the straightness of the cross limbs, and 
whether there were any color fringes along 
their edges. Each $ was given two runs 
through the test just prior to his adaptation 


one-diopter strength prisms, he was dropped 
from the experiment, Tests were given at 
48, 72, 96, and 120 hr. of prism exposure 
(counting the time S Spent sleeping). The 


one-diopter steps, including a Zero-power 
Each prism 
SE Meena a three times during a 
‘est, in random order, followin the 

of constant stimuli, k aM 


added at the end of the regular test series, 
The test spectacles were identical to the 
adaptation spectacles, except that they oc- 
cluded the left eye, so that transient con- 
vergence errors induced by changing prisms 
would not disturb S's vision. 

The cross-and Square target stood 2.5 m. 
from $ in his frontal-parallel plane, its cen- 
ter at eye-level, and its limbs spanning 45? of 
visual angle. The cross was white (10 
mL.), against a black background (0.5 mL.), 
with a 2,500* K. color temperature. 

Displacement tests—A test of eye-hand 
coordination was used to measure adaptation 
to the displacement effect of the prism spec- 
tacles (Held & Hein, 1958). The S's head 
was fixed by a biteboard, and empty specta- 
cle frames were worn. The targets were 
mounted behind a vertical glass plate 50 cm. 
from his nose, in the frontal-parallel plane. 
After looking at a target briefly, S shut his 
€yes and marked its direction on the glass 
with his right forefinger. Vertical and hori- 
zontal marking errors were read from the 
other side of the glass by E. On each test, 
three target positions were used, each being 
marked four times. After two pretests, this 
test was given at 12, 24, 48, and 72 hr. of 
prism exposure. 

A test of nonvisual, “head-hand” coordina- 
tion was given on the same test occasions, 
to measure the nonvisual coordination 
changes Previously found to be induced by 
prism exposure (Harris, 1963; Pick, Hay, & 
Pabst, 1963). With his eyes shut, S used 
his right forefinger to mark the spot directly 
in front of his nose on the glass plate. Once 
again, vertical and horizontal marking errors 
could be measured. Ten such markings were 
made on each test Occasion, prior to the eye- 
hand test. 


Results 


Two thresholds were computed from 
the prism-series data: The spectacles- 
distortion. threshold was the prism- 
Strength required to make $ see the 
distortion in the direction characteristic 
of his adaptation Spectacles. The op- 
bosite-distortion threshold was the 
prism Strength needed to make him see 
the distortion in the direction opposite 
that of his adaptation | spectacles. 
These two thresholds define the region 
of indifference to the distortion that is 
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plotted in Fig. 2 for the gaze-contin- 
gent distortions and in Fig. 3 for the 
gaze-invariant distortions. Positive 
ordinate values indicate prisms with 
the same base side as the adaptation 
spectacles; negative values indicate 
prisms with the opposite base side. 
Figures 2 and 3 were analyzed in 
terms of two visual functions: the 
visual-null point, or center of the 
region of indifference, and the jnd, or 
half-width of the indifference region. 
Gaze-contingent adaptations.—The 
gaze-contingent adaptation observed in- 
volved only changes in null point; none 
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Fic. 2. Gaze-contingent adaptation during 
exposure to 20-diopter prism spectacles. 
(The upper boundary of the shaded area for 
each prism-orientation group is the group's 
mean spectacles-distortion threshold; the 
lower boundary is the group's mean opposite- 
distortion threshold. Each data point is the 
mean of eight Ss.) 
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Fic. 3. Gaze-invariant color fringe and 
curvature adaptation to 20-diopter prism 
spectacles. 


of the jnd changes approached sig- 
nificance. 

Neither the optical nor the motor hy- 
pothesis was supported by the results 
in Fig. 2. The optical hypothesis, that 
prismatic shearing is especially effective 
in inducing adaptation, is disproved by 
the absence of a shearing adaptation in 
the vertical-prism group: the mean 
null-point change for 48 through 
120 hr. was —0.23+0.50 diopters. 
(Ninety-five per cent confidence limits 
will be given throughout.) The pre- 
vious finding of a strong shearing 
adaptation for horizontal prisms is, on 
the other hand, confirmed: +1.53 
Æ 0,85 diopters, The motor hypoth- 
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esis, that vertical eye-head movements 
induce stronger gaze-contingent adap- 
tation, implies that stretching adapta- 
tion should be greater in the vertical- 
prism group. The data in Fig. 2 show 
just the opposite trend: +0.53 + 0.40 
diopters for the vertical-prism group, 
and +0.74 + 0.35 diopters for the hori- 
zontal-prism group. 

One inference from Fig. 2 might be 
that prism orientation, per se, some- 
how controls the extent of gaze-con- 
tingent adaptation. This is supported 
by a significantly greater overall null- 
point shift in the horizontal-prism 
group: F (1, 14) = 16.54, p < .005. 
This inference is cast in doubt, how- 
ever, by the significant interaction be- 
tween prism orientation and type of 
distortion: F (1, 14) = 9.70, p<.01. 
This interaction is attributable to the 
fact that only the difference in shearing 
adaptation reaches significance: t(14) 
= 429, p < 01; for the difference in 
stretching adaptation, £( 14) = 91, 
(In this connection, it should be noticed 
that much of the difference between 
prism groups shown on the graph of 
stretching adaptation is present before 
prism exposure begins, ) 

Gaze-invariant adaptations.—In con- 
trast to the results for shearing adapta- 
tion, the prism-orientation groups 
Showed similar records for curvature 
and color fringes (Fig. 3). The mean 
null-point change for curvature was 
+1.29 + 0.49 for the horizontal-prism 
group, +1.52 + 0.61 for the vertical- 
prismgroup. The same means for color 
fringes were +2,32 + 0,57 and +2.42 
+ 0.42, respectively. This similarity 
implies that no overall difference in 
prism exposure can account for the dif- 
ference in shearing adaptation, 

A second difference from the gaze- 
contingent data is that jnd increases 
appeared to occur. For curvature 
the horizontal- and vertical-prism 
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changes were +0.77 + 0.52 and + 0.42 
=Æ 0.47 diopters, respectively. For 
color fringes, the same values were 
+0.26 + 0.28 and +0.56 + 0.54 
diopters. 

Displacement adaptation.—Figure 4 
shows the results for the eye-hand and 
head-hand coordination tests. The 
ordinate shows changes in the mean 
direction of pointing relative to the 
preexposure values. (The diopter 
measure is 100 times the tangent of 
the displacement angle.) 

The horizontal-prism results are con- 
sistent with a previous study which 
found that eye-hand adaptation is ac- 
companied by a transient change in 
nonvisual localization (Hay & Pick, 
1966). The vertical-prism group 
shows less adaptation on both tests: 
F (1, 14) 257, p<.05. The eye- 
hand test shows greater changes for 
both prism groups: F (1, 14) = 60, 
b «.05. There is no reliable interac- 
tion: F (1, 14) = 0.85. 

The most striking feature of the 
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Fic. 4. Eye-hand and head-hand coordi- 
nation changes during exposure to 20-diopter 
Prism spectacles. (Each data point is the 
mean of eight Ss.) d 
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data in Fig. 4 is the complete absence 
of a head-hand coordination change in 
the vertical-prism group, at least on the 
test occasions employed. (It is pos- 
sible that a change occurred which was 
far more transient than that in the 
horizontal-prism group.) In terms of 
the analysis presented by Hay and 
Pick (1966), this finding indicates 
that an adaptation specific to the visual 
system occurs more readily for vertical 
displacement than for horizontal dis- 
placement. 


Discussion 


The principal finding for the gaze-con- 
tingent adaptations is that shearing adap- 
tation is critically dependent on prism 
orientation, while stretch adaptation is 
not. Although these results do not agree 
with either the optical or motor hypoth- 
esis, they can be interpreted in a number 
of ways. 

Ocular-torsion hypothesis—A revision 
of the original motor hypothesis may be 
made, based on the fact that shearing 
adaptation did conform to that hypothesis’ 
prediction, even though stretching did 
not. In this connection, it is notable that 
eye movements by themselves could theo- 
retically compensate, in a fashion, for 
shearing, while they could not do so for 
stretching. Inspection of Fig. 1 shows 
that the shearing consists, for base-right 
prism spectacles, of a tilting of horizontal 
lines when the eyes move in a vertical 
fashion; the same would be true for base- 
left spectacles. If those vertical eye 
movements were to develop a torsional 
component, the net tilting of horizontal 
lines on the retina could be reduced. 
(Vertical-image lines would, of course, 
be equally affected by the ocular torsion.) 
But if ocular torsion can produce adap- 
tation for horizontal prisms, why would 
it not also do so for vertical prisms? On 
this point it may be noted that both the 
muscular and neural control of vertical 
and horizontal eye movements are quite 
different (Alpern, 1962, pp. 26 f., 76 f.). 
For example, the muscles mainly re- 
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sponsible for horizontal eye movement 
are relatively incapable of exerting tor- 
sional forces on the eyeball (Alpern, p. 
34). Hence a torsional component might 
be less readily added to horizontal eye 
movements than to vertical eye move- 
ments, and it is the former which are 
associated with shearing when vertical 
prisms are worn. 

Shearing-orientation hypothesis —A re- 
vision of the original optical hypothesis 
can also be made, based on the fact that 
the retinal orientation of shearing (and 
of stretching) differs for horizontal and 
vertical prisms. Horizontal shearing, 
produced by vertical prisms, and vertical 
shearing, produced by horizontal prisms, 
may stimulate distinct mechanisms, and 
adaptation may be possible for the latter 
but not the former. 

Binocular-interaction hypothesis.—The 
fact that the two eyes are horizontally 
separated, and their horizontal move- 
ments, unlike their vertical movements, 
are sometimes opposed, opens the way 
to interactions between the orientation of 
the prisms and adaptation. Several pos- 
sible hypotheses arise. For example, 
opposite shearing will occur for the two 
eyes of a vertical-prism S whenever he 
changes the convergence of his eyes, 
whereas shearing will never be binocu- 
larly disparate for a horizontal-prism S. 
Binocular disparity in the prism distor- 
tion might well inhibit adaptation to it. 


Experiment II 


To test the alternative explanations 
of the effect of prism orientation on 
shearing adaptation, the procedure of 
Exp. I was repeated, with three major 
changes, and several minor ones. 

The first major change was that Ss 
wore monocular adaptation spectacles, 
their left eyes being occluded. Tf 
binocular disparity of the shearing dis- 
tortion is responsible for the lack of 
shearing adaptation in the vertical- 
prism group of Exp. I, such adapta- 
tion should now occur. 
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Ss were asked to judge additional fea- 
tures of the gaze-contingent shearing 
caused by prisms, In the first experi- 
ment, only the primary shearing effect 
of prisms was tested. Examination of 
Fig. 1 shows that there is a secondary 
kind of shearing, which is actually due 
to the curvature effect of prisms. As 
the gaze shifts from down to up in Fig. 
1, the vertical limb of the cross tilts 
counterclockwise, opposing the clock- 
wise tilt of the horizontal limb. This 
secondary effect will be termed counter- 
shearing. Countershearing permits a 
test of the shearing-orientation hypoth- 
esis: Vertical prisms should induce 
adaptation to countershearing, since it 
has the same retinal orientation as 
primary shearing for horizontal prisms. 

The new judgment of countershear- 
ing also tested the ocular-torsion hy- 
pothesis. A torsion of the eyes could 
not simultaneously compensate for 
primary and countershearing, since the 
two involve opposite tiltings, Indeed, 
an ocular torsion that diminished pri- 
mary image shearing on the retina 
would increase the image movement 
due to countershearing. Negative 
adaptation is therefore predicted for 
countershearing in the horizontal-prism 
group. To further test the ocular-tor- 
sion hypothesis, $ was also asked to 
judge the scissors movement of the 
target, which can also be seen in Fig. 1. 
That is, as the gaze shifts from down 
to up, the quadrants of the cross change 
their sizes. Adaptation to scissors 
movement could clearly not be ex- 
plained by a torsion of the eye. 

The third major change was the iso- 
lation of the gaze-contingent distor- 
tions from the gaze-invariant distor- 
tions. A new target was used to pre- 
vent curvature and color fringes from 
being visible during the prism-series 
test. This insured that S's judgments 
of the gaze-contingent distortions were 
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not influenced by his observations of 
gaze-invariant distortions. 


Method 


Adaptation conditions—Sixteen new Ss 
were used in the same design as before. The 
exposure period was cut to 8 hr., since shear- 
ing adaptation seemed to occur quickly in 
Exp. I. To make the exposure as uniform 
and intense as possible, Ss were escorted 
out of doors on walks, rides, and a picnic, 
four at a time (divided equally among the 
prism-orientation groups). The same spec- 
tacles as before were used, with a foam- 
rubber occluder substituted for the left-eye 
prism. 

Adaptation tests—The same prism series 
was used, with a modified target. The lat- 
ter was a projected 35-mm. transparency of 
the target used in Exp. I, the photograph 
having been taken through a 20-diopter prism 
(matching the “downward view” in Fig. 1). 
Deep red filters were fixed over the test 
spectacles. This resulted in S being able to 
see only the brightly illuminated target, and 
b observe only gaze-contingent distortions 
of it. 


Results 


The adaptation to primary shearing 
closely matched that obtained in Exp. 
I: +1.50 - 0.82 diopters for horizontal 
prisms, +0.12 + 0.53 diopters for ver- 
tical prisms. ^ This contradicts the 
binocular-interaction hypothesis. It 
also shows that the same level of 
shearing adaptation can be attained in 
8 hr. as in 6 days. 

The results for countershearing 
showed a "negative adaptation" for the 
horizontal-prism group: —1.23 + 0.92 
diopters. This supports the ocular-tor- 
sion hypothesis, as does the correlation 
between shearing adaptation and coun- 
tershearing adaptation: r= —.69, p 
< 10. On the other hand, the vertical- 
prism group showed no reliable adapta- 
tion for countershearing : -+0.42 + 0.80 
diopters. | This fails to support the 
shearing-orientation hypothesis. 

The results for scissors adaptation 
were consistent with the above, No 
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Fic. 5. Gaze-contingent rotation afteref- 
fects induced by exposure to prismatic image 
shearing. (The case illustrated is for 20- 
diopter base-right adaptation spectacles. The 
size of the aftereffect relative to the prism 
distortion has been exaggerated for clarity.) 


adaptation was found in either the 
horizontal- or vertical-prism groups: 
—0.10 + 0.54 and +0.20 + 0.63 diop- 
ters respectively. 

The nature of these results is sum- 
marized in Fig. 5. 

Stretching adaptation was not found 
in either prism group: +0.17 + 0.35 
and +0.23 + 0.34 diopters for hori- 
zontal- and vertical-prisms, respec- 
tively. Failure to induce detectable 
stretching adaptation in 8 hr. is con- 
sistent with the low level of this adapta- 
tion even after 42 days (Pick & Hay, 
in press). 


Discussion 


The ocular-torsion hypothesis of shear- 
ing adaptation is supported by the nega- 
tive adaptation to countershearing, and 
the lack of adaptation to the scissors 
movement. The amount of ocular torsion 
needed to account for the observed adap- 
tation to primary shearing is about 1°; 
this is well within the limits of known 
torsional effects (Alpern, 1962, p. 22 f.). 
These results may be compared to those 
of Howard and Templeton (1964) who 
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directly measured the ocular torsion 
elicited by tilted lines. A stationary tilt 
failed to elicit torsion, but a line of 
changing tilt did elicit torsion of 1° or 
more. The latter stimulus is comparable 
to the prism-parallel tilting involved in 
primary shearing. 

The ocular-torsion hypothesis is similar 
in nature to one derived from experiments 
on prismatic displacement (Hay & Pick, 
1966). In both hypotheses, an ocular 
factor is the source of adaptation to the 
prism: in the case of displacement, the 
source is a shift of ocular posture which 
restores the target image to the fovea; 
in the case of shearing, the source is a 
torsional component added to ocular move- 
ments, which reduces image motion on 
the retina. It is perhaps significant that, 
of all the prism adaptations measured in 
Exp. I, displacement adaptation was the 
only one besides shearing which changed 
with prism orientation. This is consistent 
with the view that ocular compensation is 
critically dependent on the particular ocu- 
lomotor system involved. Both hypoth- 
eses involve a further assumption: the 
central mechanism which takes account of 
ocular posture and movement in analyzing 
retinal stimulation (cf. Alpern, 1962, p. 
171 f.) must be adjusted, in order that 
the new, prism-compensating posture or 
movement is responded to in the same way 
as the old, preprism posture or move- 
ment (cf. Harris, 1965; Held, 1961). 

The results for gaze- contingent stretch- 
ing suggest that it is different in nature 
from the shearing adaptation. It is not 
sensitive to prism orientation; it is not so 
strong or so readily induced; it is clearly 
not explicable on the basis of ocular 
torsion. 
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CONCEPT IDENTIFICATION AS A FUNCTION OF 
PROBABILITY OF POSITIVE INSTANCES AND 
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Probability of positive instances (.125, .250, or .500), the number of 


relevant stimulus dimensions (1, 2, 


3, or 4), and the sex of S were 


factorially combined in a concept-identification task. Trials to cri- 
terion were found to increase linearly with the number of relevant 


stimulus dimensions and log: of the 
The number of positive instances to 


probability of positive instances. 
criterion increased linearly with 


the number of relevant stimulus dimensions, and the slope of the 
function increased as the probability of positive instances increased. 
The results indicate that negative instances serve an important role in 
the process of identifying the concept. The results are discussed in 
terms of the way in which Ss process information in the task. 


In the typical concept-identification 
task, S is required to learn to classify 
a series of stimuli according to certain 
rules (concepts) chosen by E. The 
classification is often based on two cate- 
gories, one representing examples of 
the concept (positive instances) and 
the other category representing non- 
examples (negative instances). Par- 
ticular concepts may differ in the type 
of rule (eg., conjunctive, disjunctive, 
or relational) that is used to combine 
the relevant stimulus attributes to form 
the concept. The exact proportion of 
positive instances (P) of a conjunctive 
concept in a set of stimuli composed of 
all combinations of the attributes on 
binary stimulus dimensions is given by 
P —2-À, where R is the number of 
relevant stimulus dimensions. Thus 
with conjunctive concepts, the relative 
size of the subset of positive instances 
decreases as the number of relevant 
dimensions increases. 


1'This research was supported in part by 
Predoctoral Fellowship No. MH-20,485 from 
the National Institutes of Health. The 
author is indebted to E. J. Archer and R. C. 
Calfee for their assistance in the preparation 
of the manuscript. 
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Considerable attention has been 
given to the role of positive and nega- 
tive instances in the concept-identifica- 
tion task. Smoke (1933) concluded 
that the information conveyed by posi- 
tive instances is easier for S to use in 
arriving at a solution. Whitman and 
Garner (1963) argue that the primary 
function of negative instances is to 
define the total set of stimuli from 
which the particular subset (of positive 
instances) is drawn, and that, if the 
subset itself defines the larger set, then 
negative instances are of no use to S, 
and they may make the problem more 
difficult by adding “noise” to the stimu- 
lus situation. Hovland and Weiss 
(1953) controlled the amount of infor- 
mation in various stimulus series and 
found that S is more likely to discover 
a concept conveyed by all positive in- 
stances than by all negative instances 
or by mixed positive and negative 
instances. 

In the situation where S is presented 
with the stimuli successively and in a 
random order, it is a nearly impossible 
task to store all the information in 
each stimulus for comparison with 
subsequent stimuli. In such a situa- 
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tion, there are several strategies (vary- 
ing in efficiency) that S may employ 
which are effective in spite of limited 
memory capacity. Among the more 
common strategies discussed by Bruner, 
Goodnow, and Austin (1956) and 
Bourne (1963) are: (a) wholist 
strategy—S considers all of the infor- 
mation in one positive instance (the 
focus) and eliminates irrelevant infor- 
mation when subsequent positive in- 
stances are encountered, that is, S 
takes the stimulus attributes common 
to positive instances as his hypothesis; 
and (b) hypothesis-testing strategy—S 
selects some aspect of the stimulus 
(usually a positive instance) as his 
hypothesis of the concept and proceeds 
to test the hypothesis against subse- 
quent stimuli, 

Although negative instances are of 
little use with the wholist strategy, both 
positive and negative instances may in- 
firm an incorrect hypothesis for the 
hypothesis tester. The problem is to 
determine the relative usefulness of 
positive and negative instances for 
hypothesis testing. 

Another question which has been 
the subject of investigation is the rela- 
tionship between the number of non- 
redundant, relevant stimulus dimen- 
sions and performance in a concept- 
identification task. Several studies 
have shown that performance (meas- 
ured by trials, time, or errors to cri- 
terion) is a monotonically decreasing 
function of the number of relevant 
dimensions. However, the findings are 
inconsistant as to the nature of the 
function. Walker and Bourne (1961) 
varied the number of relevant dimen- 
sions and found a Positively accelerated 
function in the range of one-three rele- 
vant dimensions, However, the num- 
ber of relevant dimensions was con- 
founded with response uncertainty 
since the number of response categories 
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increased with the number of relevant 
stimulus dimensions. Nonetheless, 
Bourne, Goldstein, and Link (1964) 
using only two classification categories, 
reported a similar function in the 
same range. In their study, the prob- 
ability of positive instances was con- 
founded with relevant information 
since the probability decreased as the 
number of relevant dimensions in- 
creased, and in fact they report that 
only the linear component of the func- 
tion is significant when the analysis 
is performed on the number of positive 
instances to solution. Bulgarella and ` 
Archer (1962), using auditory stimuli, 
held the proportion of positive in- 
stances constant over the range of one- 
three relevant dimensions and found 
that the relationship between number 
of relevant dimensions and trials to 
solution was linear. 

The nature of the effects of number 
of relevant stimulus dimensions and 
probability of positive instances on per- 
formance in concept identification 
should offer important clues about the 
way in which S processes information 
in the task. The present experiment 
was designed to determine the effects 
of these two variables and their 
interaction, 


METHOD 


Subjects—The Ss were 168 students in 
elementary psychology courses at the Uni- 
versity of Wisconsin. They were given 
class credit for participation. Each S was 
randomly assigned to one of the conditions 
in the order of his appearance for the ex- 
periment, Of the 168 Ss, 9 were discarded 
due to apparatus malfunction and 15 failed 
to reach criterion within the limit of 40 min. 
or 256 trials, whichever occurred first. Of 
the failing Ss, there were 3, 2, 5, and 5 in 
conditions with one, two, three, and four 
relevant dimensions, respectively, and the 
failures were evenly divided across the three 
levels of positive instance probability. Data 
from the 72 men and 72 women who solved 
their problems were used for all analyses. 


PROBABILITY OF POSITIVE INSTANCES AND CONCEPTS 


Design—An orthogonal design of four 
levels of number of relevant dimensions (one, 
two, three, and four), three levels of proba- 
bility of occurrence of positive instances 
(.125, .250, and .500), and the sex of S was 
employed. Two problems differing in the 
particular relevant dimensions and/or the 
sequence of patterns were nested within each 
of the 12 conditions formed by the factorial 
combination of amount of relevant informa- 
tion and probability of positive instances. 
The problems were repeated on sex. Within 
each of these 48 combinations were nested 3 
Ss to account for the total of 144 Ss. 

Apparatus.—There were four major appa- 
ratus components: (a) S’s response and 
feedback panel consisting of response levers 
labeled Examples and Nonexamples for 
positive and negative instances, respectively, 
and a signal lamp mounted above each of 
the levers; (b) a Series 80000, one-plane 
readout unit manufactured by Industrial 
Electronics Engineers, Incorporated which 
presented the stimulus patterns to be de- 
scribed; (c) two Western Union tape trans- 
mitters which programed the stimuli and 
feedback lamps; and (d) a timing and 
control device which determined the dura- 
tion of the feedback and postfeedback inter- 
vals and controlled the operation of the tape 
transmitters. 

Stimuli.—The stimulus patterns were gen- 
erated by the combinations of the presence 
or absence of each of 12 lines arranged such 
that 8 of the lines formed a square and 
the remaining 4 lines quartered the square, 
2 horizontally and 2 vertically. The entire 
set of 12 lines is shown in Fig. 1 along with 
a sample of patterns which occurred in 
various conditions of the experiment. The 
number of irrelevant lines was held con- 
stant for all conditions at 2. The lines which 
were neither relevant nor irrelevant for a 
particular condition were either on in all of 
the stimulus patterns or off in all of the 
patterns for that condition. The Ss were 
informed that the 4 lines in the center would 
be on during every trial. Combination of 
the values of the two bilevel (on or off) 
irrelevant dimensions resulted in four varia- 
tions of positive instances for all conditions. 
For particular concepts, certain lines were 
required to be on and/or others off. The 
number required to be on was balanced with 
the number required to be off across the two 
problems. 

Procedure.—The Ss were given detailed 
instructions about the nature of conjunctive 
concepts and the meaning of relevant, irrele- 
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Fic. 1, Various combinations of the lines 
in the pattern in the upper left constituted 
the stimuli. (The other three patterns are 
examples of stimuli occurring in various 
experimental conditions.) 


vant, and nonvarying (quiet) stimulus di- 
mensions. The significance of positive and 
negative instances was explained, and S was 
informed that the first pattern would be an 
example of the concept. 

The order of presentation of the stimulus 
patterns was random with the restriction on 
the proportion of positive instances which 
was met in each block of 32 patterns. 

On each trial, a stimulus was presented 
and remained in view until S’s response 
classifying the stimulus as an example or a 
nonexample. The response initiated a 2.0- 
sec. informative feedback interval during 
which the lamp above the correct classifica- 
tion lever was lighted. The stimulus pattern 
and the informative feedback terminated 
simultaneously and 2.0 sec. later, the next 
stimulus was presented. The criterion was 
32 consecutive correct classifications includ- 
ing at least four positive instances. 


RESULTS 


Trials to solution.—In Fig. 2 mean 
trials to criterion are shown as a func- 
tion of number of relevant stimulus 
dimensions and probability of positive 
instances. Analysis of variance of 
trials to criterion indicated that the 
data support the hypothesis of a linear 
relationship between trials to criterion 
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Fic. 2. Mean number of trials to criterion, 
(The dotted line connects the points which 
would occur without a restriction on positive 
instance probability.) 


and the number of relevant dimensions, 
F (1, 12) 266.06, p< 001. The 
broken line in Fig. 2 connects the 
points which would occur if the stimu- 
lus instances were drawn at random 
from a complete and nonredundant set 
of stimulus patterns. The steeper slope 
of the broken line indicates that both 
number of relevant dimensions and the 
probability of positive instances con- 
tribute to the difficulty of the task; 
however, an analysis of variance of the 
conditions with these “natural” prob- 
abilities indicates that the function is 
still linear, F (1, 30) = 26.42, t 
< .001. 

The overall effect of probability of 
positive instances is also significant, 
and an orthogonal polynomial analysis 
indicates that the trend is linear, F (1, 
12) = 27.04, p< 001. However, 
since the intervals between the three 
probability levels were not equal, the 
significant linear component should be 
interpreted as indicating that successive 
halving of the probability leads to a 
linear increment in trials to criterion, 
or, equivalently, the number of trials 
to solution is a linear function of the 
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log of the probability of positive 
instances. 

A significant interaction, F (12, 96) 
= 2.52, p « .01, of the nested prob- 
lems with the sex of S suggests that 
men and women tend to differ on par- 
ticular problems. However, the main 
effect of sex was not significant, and 
there was no consistent tendency for 
one sex to perform better than the 
other. 

Positive instances to solution.—Fig- 
ure 3 presents the mean number of 
positive instances to criterion as a func- 
tion of the number of relevant dimen- 
sions and the probability of positive 
instances. ^ Analysis of variance of 
positive instances yielded much the 
same results as the analysis of trials 
for the main experimental effects. 
However, the interaction between the 
number of relevant dimensions and the 
Probability of positive instances was 
significant, F (6, 12) = 8.89, ? < 001. 
Furthermore, an orthogonal polynomial 
analysis indicated that the linear com- 
ponents representing the three levels 
of positive instance probability have 
significantly different slopes. That is, 
the slope of the function relating posi- 
tive instances to criterion to the number 
of relevant stimulus dimensions in- 


tz 50 


MEAN NU 
INSTANCI 
[s] 


o l 2 3 
NUMBER OF 
RELEVANT DIMENSIONS 
Fic. 3. Mean number of positive instances 

to criterion. 


PROBABILITY OF POSITIVE INSTANCES AND CONCEPTS 


creases as the probability of positive 
instances increases. 


Discussion 


Walker and Bourne (1961) and 
Bourne et al. (1964) report a positively 
accelerated function relating trials to 
criterion and the number of relevant 
stimulus dimensions in a concept-identi- 
fication task. The results of the present 
study, however, support the findings of 
Bulgarella and Archer (1962) who report 
a linear relationship. Since the function 
obtained by Bourne et al. (1964) was 
not replicated in the corresponding con- 
ditions of the present experiment (the 
points connected by the dotted line in 
Fig. 2), possible sources of disagreement 
should be examined. First, the stimulus 
displays were considerably different, 
Bourne et al. using geometrical dimen- 
sions (form, size, color, etc.). Second, 
Bourne et al. required their Ss to state 
a hypothesis after each trial. It is inter- 
esting to note that in the Bourne et al. 
study the positive acceleration virtually 
disappears in conditions where several 
previously presented stimuli are avail- 
able. In these conditions Ss make fewer 
memory errors. Perhaps the stimulus 
display used in the present experiment 
facilitates encoding of stimulus patterns 
for use later in the series. However, the 
overall difficulty of the present task was 
considerably greater than in the Bourne 
et al. experiment. 

The overall linear increase in trials 
to criterion with an increasing number 
of relevant stimulus dimensions has im- 
portant implications for the way Ss 
process information in the task. A 
wholist strategy, based on the elimination 
of irrelevant information, should lead to 
a constant number of trials to solution 
regardless of the number of relevant 
dimensions since there were two irrele- 
vant dimensions in all conditions. On 
the other hand, the number of conjunc- 
tive hypotheses increases exponentially 
with the number of variable stimulus 
dimensions which should lead to a posi- 
tively accelerated function relating trials 
to the number of relevant dimensions with 
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a random process of hypothesis selection. 
While it is probably unreasonable to 
think of S as “randomly” sampling hy- 
potheses, the question remains, how does 
he sample? Initially, the selection of a 
hypothesis is probably determined to a 
considerable extent by the obviousness 
(Archer, 1962) of the individual stimulus 
dimensions, and in the context of the 
present experiment, preference for par- 
ticular line patterns. Given that S has 
an initial hypothesis, the problem becomes 
one of determining how the resampling 
process leads to the correct hypothesis. 
One possibility is that resampling is de- 
termined by the degree of success that S 
has had with an incorrect hypothesis. 
That is, S resamples on individual dimen- 
sions in an attempt to retain whatever it 
is about a particular hypothesis that leads 
to more successes than expected. In 
this way, then, S tries to converge on the 
exact dimensions which are relevant by 
"amending" the current hypothesis. 
Another result of interest is the inter- 
action between probability of positive in- 
stances and number of relevant dimen- 
sions in the analysis of positive instances 
to criterion. If Ss were focusing on a 
single positive instance and eliminating 
irrelevant information when subsequent 
position instances were encountered, a 
constant number of positive instances 
should be required regardless of their 
probability. This is apparently so with 
one relevant dimension, but it is not true 
when the number of relevant dimensions 
increases. The hypothesis-testing strat- 
egy, however, depends upon both positive 
and negative instances to infirm incorrect 
hypotheses. The overall facilitative ef- 
fect of increasing the probability of posi- 
tive instances with trials to criterion 
would indicate that positive instances are 
used more efficiently, but this does not 
explain the interaction effect for number 
of positive instances to criterion. Con- 
sider, however, a hypothesis which con- 
tains some but not all relevant dimen- 
sions and no irrelevant dimensions. This 
hypothesis would produce errors only on 
certain negative instances. Since this 
type of hypothesis is more likely with 
more relevant dimensions in the prob- 
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lem, the rate of increase in number of 
positive instances to criterion with an 
increase in number of relevant dimensions 
is greater when the probability of posi- 
tive instances is greater (Fig. 3). The 
needed information is in negative in- 
stances, and S must see a number of 
positive instances determined by their 
probability in order to encounter the 
critical negative instances. The im- 
portant point is that negative instances 
are necessary to infirm certain hypoth- 
eses. While the results indicate that the 
wholist strategy is not an adequate model 
for this experimental paradigm, consid- 
erable work is indicated to identify the 
important characteristics of the hypoth- 
esis-testing strategy. Further research 
will be concerned with the nature of 
hypothesis amending. 
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FACTORS AFFECTING TRANSFER IN CONCEPT- 


IDENTIFICATION PROBLEMS * 


PEDER J. JOHNSON 


University of Colorado 


180 Ss were required to make a reversal (R), interdimensional 
(IRD), or intradimensional (IAD) shift in the solution of a concept- 
identification problem. For 3 of the Ss the stimuli contained 2, 4, or 6 
irrelevant dimensions throughout the problem, while the stimuli for the 
remaining Ss all contained 2 irrelevant dimensions prior to solution 
change, at which time 2 or 4 new irrelevant dimensions were intro- 
duced. The R and IRD postshift data were virtually identical, both 
showing an increase in difficulty as a function of the number of irrele- 
vant dimensions, while the IAD shift data showed no such increase. 
The introduction of new irrelevant dimensions resulted in an increase 
in difficulty only in the case of an IAD shift. The data were 
interpreted as providing partial support for hypothesis testing type 


models. 


This experiment investigated the ef- 
fects of (a) number of irrelevant 
stimulus dimensions and (b) adding 
new irrelevant dimensions on perform- 
ance in concept-identification problems 
which require a solution shift. The 
study was designed to provide evi- 
dence on certain assumptions made by 
hypothesis-testing (Bower & Trabasso, 
1963; Restle, 1962) and cue condi- 
tioning (Bourne & Restle, 1959) mod- 
els.of conceptual behavior. 

Previous research (Buss, 1956; 
Isaacs & Duncan, 1962; Kendler & 
Kendler, 1962) investigating various 
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shifts in problem solution have found 
differences in shift difficulty of the fol- 
lowing order : intradimensional (IAD), 
reversal (R), and interdimensional 
(IRD). The greatest ease of shifts 
in which the same dimension is rele- 
vant during both stages of the problem 
(IAD and R) has been interpreted as 
supporting mediational theory (Kend- 
ler & Kendler, 1962). All previous 
transfer studies of this type have held 
the number of irrelevant dimensions 
constant throughout the experiment. 
In this experiment the number of ir- 
relevant dimensions was systematically 
varied. According to mediational the- 
ory there should be no increase in task 
difficulty on the transfer phase (Stage 
II) as a function of the number of 
irrelevant dimensions if the medi- 
ational responses learned during acqui- 
sition (Stage I) remains correct on 
Stage II (R and IAD). 

Central to the “hypothesis-testing” 
models (Bower & Trabasco, 1963; 
Restle, 1962) is an assumption that 
an incorrect response on any trial leads 
to resampling, at random and with re- 
placement, from a population of hy- 
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potheses. The probability of solution 
on the first trial or following any error 
trial remains equal to the proportion 
of correct hypotheses in that popula- 
tion, Therefore any change in the 
solution of a problem leading to an 
error trial should result in resampling. 
If Ss resample, the mean number of 
errors to criterion on Stage II should 
increase as a function of the number 
of irrelevant dimensions regardless of 
the type of shift. 

According to cue-conditioning mod- 
els (Bourne & Restle, 1959) learning 
is considered to be the combined result 
of the gradual development of associ- 
ations between relevant (correct) cues 
and response categories and the grad- 
ual adaptation of irrelevant (incorrect) 
cues. Once a cue is adapted it be- 
comes ineffective as a determiner of 
response. The model implies that 
when S attains solution, all (or nearly 
all) cues are adapted. A second vari- 
able in this experiment involved the 
introduction of new irrelevant dimen- 
sions at the point where solution is 
changed. According to the cue- 
conditioning model these newly intro- 
duced cues will not be adapted and 
should therefore have a greater impact 
on shift performance than irrelevant 
dimensions present prior to the shift. 
Since the newly added dimensions 
were in all cases irrelevant, the cue- 
conditioning model would predict an 
increase in errors on the second solu- 
tion relative to the situation wherein, 
the same irrelevant dimensions are 
present throughout the problem. This 
manipulation also has direct implica- 
tions for mediational theory. Accord- 
ing to mediational theory the newly 
introduced dimensions should not lead 
to increased errors in the case of R 
or IAD shifts where the mediational 
response learned during Stage I re- 
mains correct for Stage IT, 
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METHOD 


Procedure.—The Ss were required to sort 
a series of visually presented stimuli into 
two categories by pressing (on each trial) 
one of two response buttons which repre- 
sented the two conceptual categories. The 
stimuli were geometric patterns (presented 
on a translucent screen with a stripfilm pro- 
jector) that varied on up to seven binary 
dimensions, e.g., Color, with levels red and 
green. The solution to any problem always 
involved a single dimension with the re- 
maining dimensions irrelevant to the cor- 
rect classification of the stimuli. 

Each S was given detailed instruction re- 
garding the nature of the task. The E de- 
scribed all of the possible dimensions (a 
total of seven) and their levels.2 The Ss 
were told that only one of the seven dimen- 
sions mentioned was relevant to solution of 
the problem and that the patterns would not 
necessarily vary on all seven dimensions. 
It was pointed out that each of the two but- 
tons on the control panel represented a con- 
cept or conceptual grouping and that only 
one button should be pressed in response to 
each pattern. The sequence of events was 
then explained: a pattern would appear on 
the screen and remain until a button was 
pressed; after this, the pattern would dis- 
appear and a light would be activated above 
the correct button for a period of 1 sec.; 
a 7-sec. blank interval followed the feed- 
back signal and then the next pattern would 
appear. The S was told that the experiment 
would end when he had made the necessary 
number of consecutively correct responses 
(10 in both Stages I and IT). 

Apparatus.—The stimulus patterns were 
photographed in a random sequence on 
16-mm. filmstrips. Two Dunning Animatic 
strip projectors were used to present the 
stimuli upon a milk-glass screen (9 X 13 
in.). The screen was mounted in an opaque 
partition approximately 2 ft. from S and at 
eye level. The geometric patterns projected 
on the screen were .75 in. on a side for small 
and 1.5 in. for large. In Stage II of the NRS 
condition when size was relevant the patterns 


were enlarged with small 1.0 in. and large 
2.0 in. 


2 The dimensions and their respective lev- 
els were as follows: Size (large-small), 
Color (red-green), Form (square-triangle), 
Vertical Position (top-bottom), Horizontal 
Position (left-right), Orientation (upright- 
tilted), and Number (one-two). 
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A panel, containing two black response 
buttons and two corresponding feedback 
lights, was situated on the table between S 
and the viewing screen. Either button on 
the control panel activated an Esterline- 
Angus operations recorder and a cascading 
series of electronic timing units. The 
Esterline-Angus recorded the correct re- 
sponse and S's actual response on each trial. 
'The timing units and relays controlled the 
duration of informative feedback and post- 
feedback interval. Informative feedback was 
determined by two Western Union tape 
transmitters programmed in sequence with 
the film in either of two projectors. The 
two stages of the experiment were presented 
to Ss without interruption by switching 
projector-transmitter couplings. 

Subjects.—The Ss were 180 students from 
elementary psychology classes. Each S was 
randomly assigned to 1 of 15 treatment 
combinations and served individually for one 
experimental session lasting approximately 
30 min. 

Design.—The Ss were assigned to the fol- 
lowing experimental conditions: 

1, Problems. Two problems, which dif- 
fered according to the relevant dimension 
were used in Stage I. For Problem S, size 
was relevant, and for Problem C, color was 
relevant. 

2. Number of Irrelevant Dimensions. 
Different problems contained two, four, or 
six irrelevant dimensions. With two ir- 
relevant size, color and form varied, while 
the remaining four dimensions were con- 
stant. With four irrelevant the number and 
vertical position dimensions were added. In 
the case of six irrelevant horizontal posi- 
tion and orientation were added. 

3. Shift Condition. After S reached cri- 
terion in Stage I an R, IRD, or IAD shift 
was made. The R shift simply entailed re- 
versing the classification used in Stage I. 
In the IRD condition the relevant dimen- 
sion was shifted from either size to color 
or from color to size, The IAD shift in- 
volved a change in the levels of the Stage I 
relevant dimension; for Problem C the shift 
was from red-green to yellow-blue, and for 
Problem S the shift was from small-large 
(.75 and 1.5 in.) to small-large (1.0 and 
2.0 in.). 

4. Presence-Absence. In Cond. P the 
same irrelevant dimensions were present 
during both Stage I and Stage II. In Cond. 
A problems contained two irrelevant dimen- 
sions on Stage I with zero, two, or four new 
irrelevant dimensions being introduced at the 
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onset of Stage II, resulting in a total of 
two, four, or six irrelevant dimensions on 
Stage II. 

The experiment is a 3 (dimension; two, 
four, and six) X3 (shift; R, IRD, and 
IAD) X2 (P and A) factorial. The de- 
sign is incomplete, however, as a result of 
the identity of Cond. P and A in the case 
of two irrelevant dimensions. 


RESULTS 


Separate 3 X 3 analyses of variance 
were performed on Cond. P and A for 
both stages of the experiment. To 
obtain an estimate of P-A effect in 
Stage II the two irrelevant dimension 
condition was omitted resulting in a 
3 x 2 x 2 factorial analysis. 

Since the results for trials and er- 
rors to criterion were virtually identi- 
cal only the analysis on errors are pre- 
sented, For Stage I there were no 
significant differences among type of 
shift, F (1, 90) 2 18 and .37 for 
Cond. P and A, respectively. In 
Cond. A where all Ss received two ir- 
relevant dimensions the dimension ef- 
fect was also not significant, F (2, 
90) —.14. In Cond. P the dimen- 
sion effect was significant, F (2, 90) 
= 5.85, p<.01, with a significant 
linear component, F (1, 90) = 9.79, 
p < .01. There was no significant dif- 
ference between problems (size and 
color), F (1, 90) = 1.10 and 71, p 
Spo; 

The mean errors to criterion for 
two, four, and six irrelevant dimen- 
sions for each of the experimental con- 
ditions of Stage II are presented in 
Table 1. Difficulty increases generally 
as a function of the number of irrele- 
vant dimensions, with the exception of 
the IAD shift in Cond. P. Stage II 
analyses show Dimensions to be a sig- 
nificant variable for both Cond. P, F 
(2, 90) =7.82 and Cond. A, F (2, 
90) = 9.13, p< .01. Type of shift 
was significant for Cond. P, F (2, 90) 
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TABLE 1 
Means AND SDs or Postsuirt (SrAGE II) 
Error Data 
No. Irrelevant Dimensions 
Shift Condition Present | Condition 
2 4 6 4 6 
R M | 4.67 | 12.58] 13.66 | 8.58 | 15.17 
SD | 3.22} 6.75} 9.78] 6.27 | 13.65 
IRD | M | 6.33 | 12.83 | 13.42 | 13.50 | 12.33 
SD | 7.81} 7.78| 9.77| 6.36| 8.93 
IAD] M | 1.00} 1.25 67] 2,50) 7.17 
SD | 1.80] 1.65 .65| 4.91] 7.65 


= 28.09, p < 01, and Cond. A, F (2, 
90) = 9.34, p < 01. The Shift x Di- 
mension interaction in Cond. P also 
approached significance, F (4, 90) 
= 2.63, p < 05 for trials and F (4, 
90) = 2.10, p «.10, for errors. In- 
spection of Table 1, however, suggests 
that the Shift x Dimension interaction 
effect was primarily the result of the 
difference between the IAD and the 
combined R and IRD data. A subse- 
quent analysis showed no significant 
difference between the R and IRD 
conditions, F (1, 66) — .09, and no 
significant interaction between shift 
and dimension when only the R and 
IRD shifts are considered, F (2, 66) 
= 08. These results further suggest 
that the IAD shift was primarily re- 
sponsible for the significant Shift 
X Dimension interaction. 

There was no significant difference 
between Cond. P and A, F (15:32) 
= 37, p>.10. However, there is 
strong evidence from Table 1 that 
Cond. A is more difficult than Cond. 
P for the IAD shift. In this shift 
alone the P-A difference is significant, 
F (1, 44) 2 867, p <.01 for errors 
to criterion, 
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Discussion 


According to cue-conditioning and 
hypothesis-testing models the difficulty of 
a CI task varies with the proportion of 
relevant (correct) cues of hypotheses, 
which are assumed to be a direct func- 
tion of the number of irrelevant dimen- 
sions. In this experiment the prediction 
that task difficulty will increase linearly 
with number of irrelevant dimensions 
was confirmed. The finding is in accord 
with previous research (Bourne & Hay- 
good, 1959, 1960) more directly con- 
cerned with this relationship. 

The hypothesis-testing models of Bower 
and Restle assume that any error trial 
leads to resampling from the complete 
hypothesis population. Any type of shift 
leading to an incorrect response should 
then result in resampling with the proba- 
bility of a correct hypothesis correspond- 
ing to the proportion of relevant dimen- 
sions. This implies that errors to crite- 
rion should increase as the number of 
irrelevant dimensions increase, regard- 
less of the type of shift. In contrast, 
mediational theory predicted no increase 
in task difficulty for the R and IAD 
shifts, where the mediational response 
learned during Stage I responding re- 
mained correct. 

The analysis of Stage IT data for Cond. 
P showed type of shift to be significant, 
lending support to mediational theory, 
but number of irrelevant dimensions was 
also significant as predicted from the 
hypothesis-testing model. A more de- 
tailed analysis of the results revealed the 
shift effect was due almost entirely to 
the difference between the IAD and the 
other two shifts (R and IRD) which did 
not differ from one another. The sig- 
nificant dimension effect was entirely the 
result of performance trends in the R and 
IRD shifts. 

j These results suggest that the hypothe- 
sis-testing model can be extended directly 
to transfer situations involving an IRD 
or R type of shift, but not to an IAD 
Shift. In the case of an IAD shift Ss do 
not appear to resample from the complete 
population of hypotheses, It seems more 
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likely that some subset of hypotheses as- 
sociated with the previously correct at- 
tributes has increased probability of 
being sampled. However, this result 
cannot be clearly interpreted as evidence 
for mediational responding. The IAD 
shift in the present experiment was con- 
founded with the possible effects of nov- 
elty associated with the newly introduced 
levels of the relevant dimension, and 
therefore must be considered as less cru- 
cial evidence of mediational responding 
than the observation of similar results 
in an R type of shift. 

The cue-conditioning model implies 
that at the termination of Stage I all (or 
nearly all) irrelevant cues are adapted. 
The newly introduced dimensions in 
Cond. A should have a greater saliency 
or probability of being sampled at the 
onset of Stage II. Since the solution to 
all problems in Stage II involved a di- 
mension present in Stage I, errors should 
be greater in Cond. A than in Cond. P. 
Thus the cue-conditioning model pre- 
dicted an overall significant effect be- 
tween Cond. P and A, with Cond. A 
showing a greater number of trials and 
errors to criterion over all types of shift. 
On the contrary, mediational theory pre- 
dicted no difference in difficulty between 
Cond. P and A for the R and IAD shifts 
because the mediational response learned 
during acquisition was correct for Stage 
II responding. The mediational theory 
would make no explicit predictions with 
respect to a P to A difference for the 
IRD shift. 

Analyses of variance showed no reli- 
able performance difference between 
Cond. P and A over all three shifts. An 
inspection of mean scores, however, sug- 
gested a difference in difficulty for the 
IAD data, which a separate statistical 
analysis demonstrated to be significant. 
The results are ambiguous in the sense 
that they support the notion of an adap- 
tation process in the case of an IAD 
shift but do not for an R or IRD type 
of shift. An examination of assump- 
tions of the cue-conditioning model sug- 
gests a possible explanation of why the 


process of adaptation appeared to func- 
tion for one shift and not the other. 

In an early paper, Restle (1955) stated 
that any change in the validity of cues in 
a problem results in an erasure or com- 
plete elimination of previous condition- 
ing and adaptation. From the results 
on Cond. P, Stage I and II, it appears 
that Ss essentially start over when pre- 
sented R or IRD shifts, since there was 
no significant difference between Stages 
I and II for these shifts with both show- 
ing a dimension effect. Therefore, there 
should be no Cond. P-A effect on R and 
IRD shifts, since if Ss start over at the 
onset of Stage II they can be thought of 
as facing identical problems in Cond. P 
and A. There was, however, a difference 
in IAD data from Stages I to II in 
Cond. P, suggesting that such an erasure 
process perhaps does not occur for this 
type of shift. If erasure does not occur 
the adaptation of irrelevant cues during 
Stage I would be expected to have an 
effect upon Stage II responding. At the 
onset of Stage II in Cond. P all cues 
are assumedly adapted except those as- 
sociated with the new levels appearing on 
the relevant dimension for an IAD shift. 
In terms of cue validities all the cues as- 
sociated with the new irrelevant dimen- 
sions would be expected because of their 
novelty to have some validity greater than 
0.0 and therefore have the greatest prob- 
ability of being sampled or utilized. In 
the case of Cond. A, however, there are 
unadapted cues associated with the newly 
introduced dimensions that can be as- 
sumed also to have a cue validity 
greater than 0.0 and would be expected 
to compete with the cues associated with 
the new levels of the relevant dimension. 
"The difficulty of an IAD shift in Cond. A 
should then increase as a linear func- 
tion of the number of newly added dimen- 
sions as it appears from the results. 

Previous research (Kendler & Kendler, 
1962) investigating R and IRD shifts 
have consistently found the R shift to 
be less difficult for adult humans. An 
examination of methods used in experi- 
ments investigating these shift conditions 
revealed a number of procedural variables 
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that could possibly have contributed to 
the unexpected finding of no difference in 
the present experiment. An experimental 
program is currently underway to deter- 
mine the influence of some of these 
procedural variables on the relative diffi- 
: culty of R and IRD shifts. 
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17 pigeons were tested for stimulus generalization in extinction 
twice, 1st following single stimulus training and later following addi- 
tional training including a 2nd stimulus. Test I followed 12 days 
of VI-1 min. training to peck a key illuminated by 555 my light. 
Following this, Group I (n=8) received 12 days of nondifferential 
VI-1 training to randomly alternating stimuli of 555 ma and a white 
vertical line on a black surround; Group II (n=9) received discrimi- 
nation training consisting of VI-1 reinforcement to the 555 my stimulus 
and nonreinforcement to the white vertical line. It was found that, in 
comparison with the single stimulus control condition, interdimensional 
nondifferential training significantly flattens both absolute and relative 


generalization gradients, whereas 
training steepens them. 


In studies of the influence of discrim- 
ination training on stimulus generaliza- 
tion, three operationally distinguishable 
types of discrimination training have 
been employed. In the first type (“in- 
tradimensional") both the positive and 
the negative stimuli are on the same 
dimension, i.e., the dimension explored 
in generalization testing (e.g., Hanson, 
1959; Thomas, 1962). In a second 
type (“interdimensional” training) the 
negative stimulus is either the absence 
of the positive, or a stimulus orthogonal 
to the dimension along which the posi- 
tive stimulus is varied in generalization 
testing (e.g., Honig, Boneau, Burstein, 
& Pennypacker, 1963; Jenkins & Har- 
rison, 1960, 1962). A third type of 
discrimination training ^ procedure 
(“extradimensional”) involves dis- 
crimination training along one dimen- 
sion but generalization testing along a 
novel previously irrelevant dimension 


1 This research was supported in part by 
Research Grant RO 1-HD-00903-05 from 
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interdimensional discrimination 


(e.g., Perkins, Hershberger, & Weyant, 
1959; Reinhold & Perkins, 1955). 
One common effect of all three types 
of discrimination training is the steep- 
ening of generalization around the 
positive stimulus value in a subsequent 
generalization test. Following intra- 
dimensional training this effect has 
typically been related to the hypothe- 
sized algebraic summation of gradients 
of excitation and inhibition as outlined 
by Spence (1937). The steepening of 


‘the gradient following inter- and extra- 


dimensional discrimination training, on 
the other hand, has given rise to several 
different interpretations. One of these 
was suggested by Reinhold and Per- 
kins (1955) who showed that extra- 
dimensional discrimination training 
(between two floor textures) steepened 
the gradient along the brightness di- 
mension. These investigators argued 
that such training produces a set-to- 
discriminate which steepens generaliza- 
tion along even an irrelevant stimulus 
dimension. 

An alternative view has been ex- 
pressed by Restle (1955), Jenkins and 
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Harrison (1960), and others. These 
authors point out that in discrimination 
training responding in the presence of 
extraneous cues is reinforced during 
SP periods, but extinguished in SA 
periods. Presumably this weakens 
their control over the response, result- 
ing in increased control by the training 
stimulus. 

If differential reinforcement for re- 
sponding to two different stimuli en- 
courages the development of a set-to- 
discriminate which is then reflected in 
a steepened generalization gradient, it 
follows that nondifferential (equal) re- 
inforcement of responding to two dif- 
ferent stimuli should discourage the 
development of such a set resulting in 
a flattened generalization gradient. 
With equal justification, nondifferential 
training may be considered to intro- 
duce an additional “extraneous” stim- 
ulus which would reduce the control 
exercised by the original training stim- 
ulus thereby flattening the resulting 
generalization gradient, Clearly, both 
of the views described above would 
predict that interdimensional nondiffer- 
ential training (as defined above) 
would result in a flattened generaliza- 
tion gradient. The present study was 
designed to test this prediction as well 
as to replicate with a different stimulus 
dimension the finding by Jenkins and 
Harrison (1960) that interdimensional 
discrimination training steepens the 
gradient around the positive stimulus 
value. 


METHOD 


Subjects.—The Ss were 17 experimentally 
naive homing pigeons maintained at 70-75% 
of their ad libitum weight throughout the 
experiment. 

Apparatus —Two Grason-Stadler pigeon 
chambers were employed with associated 
automatic programming and recordi di 
ment. Industrial Electronics rupem 
line display cells fitted with Kodak Wratten 
filters No. 65, 74, 99, 73, and 72B provided 


monochromatic stimuli with peak wave- 
lengths at 501, 538, 555, 576, and 606 my, 
respectively. The display cells also provided 
a white vertical line à in. wide X & in. high 
on a black background. A Grason-Stadler 
white noise generator, Model 901, supplied 
masking noise to both boxes at all times. 

Procedure.—Following magazine and key- 
peck training, Ss were given 2 days of 50 
continuous reinforcements each and then 
placed on a variable interval schedule with a 
mean interreinforcement interval of 1 min. 
(VI-1). During all of this time the key was 
continuously illuminated by a light of 555 my. 
All Ss were given 12 daily 3-hr. sessions of 
VI-1 to the 555-my stimulus. Immediately 
following the twelfth VI session, a generali- 
zation test in extinction was administered to 
all Ss. The five different stimuli, 501, 538, 
555, 576, and 606 my, were randomized 
within a series and 12 different random series 
were presented to each S. Each stimulus 
presentation was for 60 sec. and was im- 
mediately followed by the next stimulus with 
no blackout intervening. 

On the day following this test the eight Ss 
in Group I started nondifferential training 
with the 555-my stimulus and the white 
vertical line randomly alternated for 1-min. 
periods with both reinforced according to the 
same VI-1 schedule, Immediately following 
the twelfth i-hr. session of nondifferential 
training, a second generalization test (similar 
to the first) was administered to each S. 

On the day following their generalization 
test, the nine Ss in Group II started on 
discrimination training. The 555 my and 
vertical-line stimuli were randomly alter- 
nated for l-min. periods with responding to 
555 my still reinforced according to the 
original VI-1 schedule, but responding to the 
line was extinguished, Discrimination train- 
ing was continued for 16 daily sessions, the 
last immediately followed by a generaliza- 
tion test similar to that administered earlier. 
Responding in the presence of the white-line 
stimulus was never reinforced and there 
was little tendency to respond to it, even at 
first. For this reason the use of criterion 
of discrimination Performance which would 
have been met very early in training seemed 
Inappropriate, and instead extensive training 
was employed to insure that Ss had consider- 
able experience with both stimuli, 


RESULTS 


In Fig. 1 are presented the mean 
generalization gradients of eight Ss in 
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TESTI e—e 
TEST X o---o 


MEAN RESPONSES 


WAVELENGTH (My) 


Fic. 1, Absolute generalization gradients 
of Group I. 


Group I on both generalization tests. 
For each S the gradient on each test 
consisted of the total number of re- 
sponses emitted in the presence of each 
of the test stimuli during the course 
of the entire test. The values plotted 
are the group means of these individual 
gradients. 

As Fig. 1 reveals, both tests result 
in decremental gradients around the 
555-mp value. The slopes of the 
gradients in Fig. 1 were analyzed in the 
following manner. A separate slope 
score was obtained from each side of 
the gradient by multiplying each S’s 
number of responses by the orthogonal 
weighting coefficients —1, 0, and +1, 
with the +1 weighting given to the 
training stimulus value. The slope 
score thus obtained is the absolute value 
of the generalization decrement, in 
number of responses. These slope 
scores were then subjected to a f test 
which yielded for the left side: t(7) 
= 972, p<.01; for the right side: 
#(7) =11.87, p« 01. Thus the 
gradient for Group I following inter- 
dimensional nondifferential training is 


reliably flatter than the gradient pre- 
viously obtained under the single stim- 
ulus (control) condition. 

Another way of analyzing slope dif- 
ferences requires the data to be con- 
verted into relative generalization 
gradients. One convenient procedure 
is as follows: for each S on each test, 
the proportion of total responses made 
to the various test stimuli is computed 
and averaged for the group on each 
test, resulting in the plot of relative 
generalization gradients shown in Fig. 
2. This transformation eliminates dif- 
ferences in response strength giving 
each S equal weight in determining 
the group average. In addition, it pro- 
vides a simple and convenient measure 
of gradient steepness; the higher the 
percentage of total responses emitted to 
the training stimulus value, the steeper 
the gradient. As is clear in Fig. 2, the 
difference in steepness is in the pre- 
dicted direction and is significant, 
t(8) = 3.85, p < .01. 

Figure 3 presents the mean generali- 
zation gradients of Ss in Group II. 
The slopes of the gradients in Fig. 3 
were analyzed in the same manner as 
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Fic. 2. Relative generalization gradients 
of Group I. 


TEST 1 oe 
TEST X 0---o 


MEAN RESPONSES 
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Fic. 3. Absolute generalization gradients 
of Group II. 


those in Fig. 1 had been. An analysis 
of the slope scores for the left side 
yields a £(8) = 10.75, p < .01; for the 
right side: £(8) = 3.77, p < .01. Thus 
the gradient for Group II following 
interdimensional discrimination train- 
ing is reliably steeper than the gradient 
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Fic. 4. Relative generalization gradients 
of Group II. 
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previously obtained under the single 
stimulus (control) condition. 

In Fig. 4 are presented the generali- 
zation gradients transformed as for 
Group I (Fig. 2). The difference in 
steepness is in the predicted direction 
and is significant, £(8) = 2.63, p < .05. 


Discussion 


The distinction between absolute and 
relative measures of stimulus generaliza- 
tion has been highlighted in the recent 
literature. Newman and Grice (1965) 
have pointed out that conclusions regard- 
ing the effect of a given experimental 
manipulation on generalization may differ 
depending upon which of these two 
procedures is utilized. Thus, for ex- 
ample, in a study by Jenkins, Pascal, and 
Walker (1958) increased drive led to an 
absolutely steeper generalization gradient 
but to a relatively flatter one. The abso- 
lute measures conform to Hullian predic- 
tion and Newman and Grice prefer them, 
quite probably for this reason, Other 
considerations, however, favor the use of 
relative generalization gradients, most 
notably their relative stability across time 
during extinction testing, and across in- 
dividuals. Thus relative generalization 
gradients based on Guttman and Kalish’s 
(1956) data show little or no change in 
slope during an extended test in extinc- 
tion, and Thomas and King (1959) have 
reported virtually identical mean relative 
generalization gradients for high and low 
responding Ss in several different drive 
level conditions, Thus it appears to be 
at the level of relative generalization that 
the stability and lawfulness of generaliz- 
ing behavior is most apparent. 

In the present study both absolute and 
relative generalization gradients were 
Presented and the results were in agree- 
ment. Nondifferential training flattens 
the gradient relative to the control con- 
dition, whereas discrimination training 
steepens it. The finding that interdimen- 
sional discrimination training steepens 
the gradient of generalization around S+ 
has Previously been teported by Jenkins 
and Harrison (1960). These authors 
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found that the frequency of a tone used 
in single stimulus training exhibited very 
little control over key-pecking rate (i.e., 
a very flat decremental gradient). The 
introduction of interdimensional discrimi- 
nation training with the reinforced tone 
as S+ and silence as S— was adequate 
to generate a reliable and steep decre- 
mental gradient. That such explicit dis- 
crimination training is not necessary to 
produce steep gradients along the wave- 
length dimension is clear from Test I in 
this study. This difference may be 
peculiar to the two modalities or stimulus 
continua involved, but evidence suggests 
the relevance of at least two other factors, 
the implicit discrimination training in- 
volved in learning to peck a localized 
visual stimulus (cf, Heinemann & 
Rudolph, 1963) and the likely presence 
of a preexperimental history of wave- 
length discrimination in birds reared in 
the normal manner (cf., Peterson, 1962). 

It is of interest to note that inter- 
dimensional discrimination training has 
now been shown to steepen generalization 
gradients along both auditory and visual 
stimulus dimensions. The flattening of a 
generalization gradient by interdimen- 
sional nondifferential training has not 
been reported before but it clearly occurs, 
as was predicted. 

These results are consistent with the 
view that interdimensional discrimination 
training increases “attention” or a set-to- 
discriminate, whereas ^ nondifferential 
training encourages its opposite. Recent 
evidence, however, raises serious question 
about the general applicability of these 
concepts. In contradiction to the find- 
ings of Reinhold and Perkins (1955), 
Mackintosh (1965) has reviewed con- 
siderable evidence which suggests that 
the effect of attention is generally quite 
specific, such that training Ss to dis- 
criminate along one stimulus dimension 
reduces stimulus control (as reflected in 
generalization slope or discrimination 
performance) along other dimensions. 

The view expressed by Restle (1955) 
and Jenkins and Harrison (1960) may 
be extended so as to fit all of the data 
currently available. Such an extension 
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requires the assumption that the control- 
ling influence of all (internal and ex- 
ternal) stimuli is of fixed extent, follow- 
ing the analogy of a pie. Thus, the 
greater the number of controlling stimuli 
and the greater their influence, the lesser 
the degree of control available to other 
stimuli, i.e., the smaller their share of the 
pie. 

In addition to studies like the present 
one which manipulate external stimuli, 
the “pie hypothesis” fits the data of 
studies which employ various reinforce- 
ment schedules which presumably develop 
a greater degree of proprioceptive control 
over behavior than does variable interval 
and also result in flatter generalization 
gradients. Such a finding was reported 
for differential reinforcement of low rate 
by Hearst, Koresko, and Poppen (1964) 
and for variable ratio by Thomas and 
Switalski (1966). 

Certain details of the pie hypothesis 
remain to be worked out. For example, 
the rule by which additional stimuli re- 
duce the control available to the training 
stimulus is clearly not a simple sub- 
tractive one, Pilot work performed in 
this laboratory with four different stim- 
uli used in interdimensional nondifferen- 
tial training resulted in a gradient only 
slightly flatter than that obtained with 
two training stimuli in the present ex- 
periment. Despite the incompleteness of 
its formulation, the pie hypothesis is 
readily subject to experimental test and 
should therefore provide the basis for a 
better integrated and more fruitful inves- 
tigation of many problems relating to the 
stimulus control of behavior. 
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Using homophones as stimuli and similar-sounding words (e.g. 
CANVAS-CAMPUS) as responses, 4 experiments were carried out to 
compare the learning and retention of lists of high intralist similarity 
(W), high interlist similarity (B), and lists of low similarity (Con- 


trol). 


Experiment I was used to select degrees of learning for 


Experiments II and III. In Experiment II, List 2 recall after 1 wk. 


indicated superior retention of W. 


In Experiment III, free recall of 


both lists after 1 wk. indicated superior recall for W, and, to a lesser 
extent, for B. Experiment IV compared recall for a single list (W vs. 
Control), with W better retained after 1 wk. In all experiments, W 
markedly retarded learning with B impeding List 2 learning slightly. 
All of the W and B conditions were run with both consistent and 
inconsistent pairings: in a consistent pairing, if Bore was paired with 
CANVAS, then BOAR would be paired with campus. The inconsistent 
pairings were generally deleterious to learning, but did not affect 


retention much. 


A. description of the extinction and 
recovery of generalized error tenden- 
cies proposed by Gibson (1940, 1942) 
led to the expectation that similar items 
would interfere with one another over 
time, and lead to greater forgetting 
than could be obtained with dissimilar 
items. A trace theory such as Koffka's 
(Koffka, 1935) might lead one to the 
same prediction. The experimental 
evidence, however, as reviewed by Un- 
derwood (1954, 1961), has consistently 
failed to show that high intralist simi- 
larity is conducive to forgetting. Un- 
derwood (1954, 1961) has professed 
to be puzzled by these findings because, 
he maintains, the effects of interlist 
similarity upon recall are well estab- 
lished. However, Underwood's faith 


1The research was supported by a grant 
from the UCLA Research Committee. The 
data collection and tabulation were largely 
in the hands of Sandra Wilcox, with the help 
of Sonia Hoffman in Exp. I, II, and III. 
Lester D’Andrea assisted with the statistical 
analyses, 


in the efficacy of interlist similarity 
(B)? seems to be based upon data 
which are not really comparable with 
the available data on intralist simi- 
larity (W). Experiments on W show 
that when an S attempts to learn items 
which are similar to one another, he 
tends to confuse the similar items and 
his learning is slower than it would be 
if the items were dissimilar. Having 
overcome this interference, the similar 
jtems are retained as well as the dis- 
similar items. However, studies in 
which B is manipulated are concerned 
with retroactive and proactive inhibi- 
tion (RI and PI), and do not provide 
clear evidence concerning the retention 
of materials which have overcome in- 
terference. The RI experiment ends 
with a demonstration of interference, 
and S is never given the added training 


2 Henceforth B will designate between-list 
similarity, while W will designate within-list 
Similarity. Usually, B or W alone will 
imply “High B” or “High W.” 
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which would allow him to overcome 
this interference before a retention in- 
terval. Hence, the RI experiment 
does not deal with the interaction of 
similar items during a retention inter- 
val, but shows instead that interference 
is generated when S attempts to master 
two similar sets of items. In the PI 
experiment we come closer to a situa- 
tion which parallels experiments on W. 
Here, in learning a second list, S may 
suffer interference from the first, but 
this interference is finally overcome. 
Subsequent tests of second-list reten- 
tion are therefore tests of retention for 
lists which have overcome interference. 
While examinations of B in PI have 
not been numerous, they suggest that 
in general, manipulations of B do not 
affect retention (Dallett, 1962, 1964). 
The one exception is the A-B, A-C 
paradigm (identical stimuli and unre- 
lated responses), in which second-list 
retention is substantially worse than it 
is in other paradigms. The A-B, A-C 
paradigm, however, is known (eg., 
Postman, 1961; McGovern, 1964) to 
produce considerable unlearning of the 
first list, and hence the similar mate- 
rials might not have been brought to 
the point at which sufficient interfer- 
ence in learning has been overcome. 
Furthermore, experiments on W have 
not included identical elements of the 
sort usually used in the A-B, A.C 
paradigm. Whatever the comparable 
cases of W and B may be (and there 
probably are no completely unobjec- 
tionable parallels), it is clear that most 
or all of the evidence suggesting that B 
affects "retention" is comparable to 
evidence suggesting that W affects 
acquisition. 

One further indirect argument sup- 
ports the conclusion that B and W are 
equally ineffective. In his 1954 review, 
Underwood (1954) cogently pointed 
out that many of the studies comparing 
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high and low W had involved re- 
peated testing of Ss, and that the 
sampling of materials had been such 
that high W entailed low B while low 
W was associated with high B. Sev- 
eral of the studies which had been done 
up to that point, then, might have 
shown no difference between high and 
low W only because of counteracting 
effects of (respectively) low and high 
B. However, since 1954, some studies 
have been done in which this confound- 
ing was avoided (Underwood, 1961), 
and intralist similarity still does not 
decrease recall The implication of 
this for the confounded studies previ- 
ously mentioned is that interlist simi- 
larity must have had very little effect 
in those studies since by implication 
there could not have been much effect 
of intralist similarity to be cancelled. 

Since the PI experiment comes clos- 
est to making W and B comparable it 
was hoped that the issues could be 
clarified through the examination of 
second-list retention following the 
learning of 12 similar pairs of paired- 
associate items, the similar items being 
arranged for some Ss in such a way as 
to produce similarity within each of his 
two lists, while for other Ss the same 
items were arranged in such a way as 
to produce similarity between his two 
lists. Similarity was manipulated by 
using words of similar sound and spell- 
ing, rather than by using words of 
similar meaning, or trigrams con- 
structed from only a few letters as 
has been done before, Obviously, the 
effects of similarity upon retention may 
differ for different materials, and the 
underlying issues will not be settled by 
only one or two investigations. 


METHOD 


_ Materials —Two sets of paired-associate 
lists. were used; they differed only in the 
pairing of stimuli and responses, and only 
one set will be described. The stimuli 
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were 12 homophone pairs (e.g., REIGN-RAIN), 
most of them taken from Johnson and Fehmi 
(1963). The responses were 12 pairs of 
words obtained from articulation-testing ma- 
terials prepared by Black and Haagen 
(Black, 1957; Black & Haagen, 1963). The 
responses were not homophones, but sounded 
alike—one of them being the word most 
frequently given by Ss mishearing the other 
word in an articulation test. The 24 stimuli 
and 24 responses appear in Table 1. In 
addition, each stimulus and each response 
was matched with control items having the 
same number of syllables and practically 
identical frequency of usage according to the 
Thorndike-Lorge (1944) word count. 

The items in Table 1 appear in a consistent 
pairing. In such a pairing, if BOAR were 
paired with canvas then Bore would be 
paired with campus. In a nonconsistent 
pairing, if BoAR were paired with CANVAS, 
BORE would never be paired with campus, 
nor would knowing the stimulus to which 
CAMPUS was attached convey any informa- 
tion about the response paired with BORE ex- 
cept that it was not campus. Control items 
were yoked to experimental items by virtue 
of their number of syllables and frequency of 
usage, and were paired correspondingly. 
Lists were then constructed by the appropri- 
ate segregation of pairs. Two consistent 
lists of high W were constructed, each 
having six S-R pairs and their six similar 
mates. Two inconsistent lists of high W 
were also constructed. Two lists of high B 
were constructed from the same items, by 
their 12 similar mates in the other list; as 


TABLE 1 


MATERIALS USED, IN ONE OF THE 
Two CONSISTENT PAIRINGS 


BOAR CANVAS 
MUSSEL OPAL 
SHOOT TASSEL 
CARAT DESCEND 
PROFIT MORNING 
KERNEL FORMAL 


BORE CAMPUS 
MUSCLE OVAL 
CHUTE TACKLE 
CARROT DEFEND 
PROPHET WARNING 
COLONEL NORMAL 


HORSE JAGGED 
KNEAD BALLOT 
SEEN PACKAGE 


HOARSE JACKET 
NEED BALANCE 
SCENE PASSAGE 


CENT REGION SENT REASON 
BEACH CERTAIN BEECH CURTAIN 
RAIN PILLAR REIGN KILLER 


Note.—A vertical bisection of the table yields two 
lists of high interlist similarity (BC), while a horizontal 
bisection of the table yields two lists of high intralist 
similarity (WC). 


669 


placing 12 different pairs in one list, and 
in the case of W both consistent and incon- 
sistent pairings were used. The control 
items yielded four lists. All of these lists 
were prepared in four different orders for 
presentation. Each list of any given pair of 
lists was used equally often as an S’s first 
or second list. For the two lists of a Control 
condition, one list was taken from a pair of 
B lists, and the other was one of the four 
control lists. 

Procedure —The experiments involved two 
sessions 1 wk. apart. Each S was assigned 
to an experimental condition on his appear- 
ance (using a prearranged schedule), and 
was given standard instructions for paired- 
associate learning. In addition, he was told 
to be sure to speak his responses distinctly. 
Although possible problems had been antic- 
ipated, Es reported no trouble in identifying 
S’s responses. The S practiced the first 
list for a fixed number of trials which dif- 
fered for different experimental conditions 
and then was immediately asked to learn an- 
other list. The second list was exposed by 
merely moving the window on the memory 
drum, and the changeover required less than 
30 sec. The lists were presented at a 2: 2 
sec. rate with 4 sec. between trials. Having 
learned both lists, S was dismissed and 
reminded to return in 1 wk. If he asked 
what would happen at the second session 
he was told to expect more of the “same 
sort of thing.” On his return 1 wk. later, 
he was given a test of retention, the nature 
of which varied from experiment to ex- 
periment. 

Design and subjects—The Ss were stu- 
dents in the introductory psychology classes 
at the University of California, Los Angeles. 
They participated as part of a course require- 
ment, and had not previously participated in 
laboratory learning experiments. The only 
reasons for replacing Ss were (a) prior 
experience, (b) S's native language was not 
English, (c) error on part of E, or (d) 
apparatus failure. No Ss were discarded 
for failure to learn. 

The first experiment (Exp. I) will be 
considered a pilot experiment, and most of 
the data will not be discussed. In this 
preliminary experiment, Ss learned two lists 
of the same sort (High W, High B, or 
Control: if W or B, the pairing could be 
consistent (C) or not (NC)—hence, five 
experimental conditions; WC, WNC, BC, 
BNC, Control). Both lists were learned 
to criterion, with 16 Ss in each of the five 
conditions, The data from Exp. I were 
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TABLE 2 
SUMMARY oF CONDITIONS, TRIALS, AND Ss 
Learning Trials 
Exp. Cond. 7-Day Retention 
List 1 List 2 
NC 15 10 4 trials, List 2 16, 16 
jr BC BNC 10 6 4 trials, List 2 16, 16 
Control 8 5 4 trials, List 2 16 
OL Control 15 5 4 trials, List 2 16 
HI WC, WNC 15 10 Free recall 8,8 
BC, BNC 10 6 Free recall 8,8 
Control 8 5 Free recall 14 
OL Control 15 5 Free recall 14 
IV WC, WNC 15 eL 4 trials, List 1 32, 32 
Control 8 — 4 trials, List 1 31 


used as a basis for selecting fixed numbers 
of learning trials to be used in the other 
experiments. The values decided upon ap- 
pear in Table 2. Since it was expected 
that this would not perfectly equate the 
amount learned in the various experimental 
conditions, two control groups were included 
in Exp. II and III, each learning two lists 
of low W and low B, one having 15 trials 
and one only 8 trials on the first list, and 
both having 5 trials on the second list. 
Using the fixed numbers of trials suggested 
by Exp. I, Exp. IL III, and IV were 
carried out consecutively. Experiments II 
and III differ only in the test of retention; 
in Exp. II Ss were asked to relearn the 
second of the two lists they had practiced 
the week before; the first relearning trial, 
according to convention, is referred to as 
"recall" In Exp. III Ss were given a sheet 
Of paper and asked to write down all the 
items they could think of, both stimuli and 
responses. When they had done this, they 
were asked to (a) indicate which had been 
the responses, and (b) indicate which had 
been on the first and which on the second 
of his two lists. These added tasks were 
usually not difficult for S, since he tended 
to write items in a Pair-and-list Organization 
to start with. Finally, Exp. IV examined 
the problem of single-list retention; the W 


drum tapes used in the control conditions had 
low W stimuli and high W responses. This 
tape had been used for two Ss apiece in 
Exp. I, II, III, and for one S in Exp. IV. 
The Ss affected were discarded. Owing toa 
scheduling mixup, surplus Ss were available 
in Exp. II, and these were used to replace 
the two who had to be discarded. In the 
other experiments, the N was appropriately 
reduced. 


RrsurTs 


List 1 learning —Learning curves 
for Exp. II and III (in which Ss 
learned both List 1 and List 2 for a 
fixed number of trials) appear in Fig. 
land 2. The curves in Fig. 1 repre- 
sent List 1 learning, and it is apparent 
that B and control curves coincide, 
while the W conditions are retarded in 
learning with WNC lagging WC. In 
Exp. I, BC had required an average of 
10.94 trials to reach criterion, with 
BNC taking 11.56 trials and the con- 
trol group 11.50: these values do not 
differ significantly according to a New- 
man-Keuls test (Winer, 1961, p. 80), 
although they do differ (p < .05) from 
WC, which required 17.44 trials, and 
WC in turn differed from WN C, with 
22.06 trials (again, $ < .05—this sig- 
nificance level will apply to all New- 
man-Keuls significance tests reported 
in this paper). Only in Exp. IV did 
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Mean Correct Anticipations 


10 15 


Trial 


Fic. 1. List 1 learning curves; combined 
data of Exp. II and III. 


the difference between WC and WNC 
fail to appear, although the W condi- 
tions in that experiment were clearly 
worse than the controls. Hence, high 
intralist similarity seems to have re- 
tarded learning, with a consistent pair- 
ing being somewhat less damaging than 
an inconsistent pairing. This is in 
agreement with the implications of 
most previous studies. 

Although the effectiveness of the W 
conditions is of interest, it is consider- 
ably more important to determine 
whether the different conditions in 
Exp. II, IIT, and IV were more or less 
equated in degree of List 1 learning. 
One ‘indication of degree of learning 
is the terminal level of performance 
reached in each condition. For the 
15-trial Overlearning Control condi- 
tion, the number of correct anticipations 
on the eighth trial was used, while for 
other conditions the number of correct 
anticipations on the last trial of learn- 
ing was used. Analyses of variance in- 
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dicated no significant differences in 
Exp. II, F (5, 90) = 1405, or Exp. 
IV, F (2, 92) <1. In Exp. III a 
significant difference appeared, F (3, 
56) = 2.85, p< .05. The Newman- 
Keuls procedure indicated that the dif- 
ference was between the two extreme 
conditions, B with 10.75 correct antici- 
pations, and W with 8.37 correct an- 
ticipations. The two control condi- 
tions fell between these extremes, with 
9.93 and 9.71 correct anticipations, not 
significantly different from either W 
or B. Since the recall data to be 
examined will concern separate com- 
parisons between W, B, and the con- 
trols, this difference should not invali- 
date the recall data. 

Although the various conditions have 
reached approximately equal terminal 
performance levels, it is generally 
agreed that this is not enough to guar- 
antee equal degrees of learning. For 
example, Ss reaching a given level of 
performance are generally considered 
to have learned more, the faster they 
attained this performance level (Un- 
derwood, 1964). Hence, in terms of 
rate of learning, the W conditions must 
be considered to be slightly behind the 
B and control conditions at the end of 
List 1 learning. If we simply count the 
overall number of correct anticipations 
however, we will find that the W Ss 
have accumulated more correct antici- 
pations over their 15 trials than the B 
and control Ss have over their 8 or 10 
trials. Depending upon the aspect of 
learning which one wishes to empha- 
size, it is possible to argue that the W 
conditions have learned less, or more 
than the B and Control conditions. 
These considerations will be important 
for the evaluation of the recall data. 

List 2 learning—Learning curves 
for List 2 appear in Fig. 2; the data 
have been plotted separately for W 
and B, with the same control data 
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appearing in each half of the figure. 
It will be seen from examination of 
Fig. 2 that both W and B are slightly 
retarded in List 2 learning, although in 
the B conditions clear retardation ap- 
pears only for the NC pairing, if the 
control group without overlearning 
(the lower dashed curve) is used. 
Examination of terminal performance 
levels using analyses of variance re- 
vealed no significant differences in 
Exp. II, F (5, 90) = 1.917, or in Exp. 
II, F (3, 56) = 1.673. However, the 
conflicting considerations regarding 
degrees of learning on List 1 apply 
just as well to List 2. 

Errors in learning.—A.. tabulation 
was made of the overt errors produced 
by each S during learning. "These ap- 
pear in Table 3, where they have been 
summed over Exp. II, III, IV, and 
expressed as a percentage of the total 
opportunities for error available in 


Mean Correct Anticipations 


Fic, 2. List 2 learning curves; same Ss 
as in Fig. 1. (The upper dashed curve is 
the Overlearning Control, the lower the Non- 
overlearning Control, The same control 
data appear in both halves of the figure.) 
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TABLE 3 


TOTAL ERRORS AS A PERCENTAGE OF TOTAL 
OPPORTUNITIES IN LEARNING, ALL OF 
Exp. II, III, IV COMBINED 


Cond. List Intra Inter Total 
Control 1 14.5 = 18.8 
2 5.8 0 8.5 

OL Control} 1 13.5 — 17.8 
2 &3 0 11.3 

BNC 1 17.2 21.4 
2 5:2 1.5 8.2 

BC 1 11.9 -= 17.0 
2 8.6 1.9 11.9 

WC 1 34.5 — 35.6 
2 30.4 0 32.0 

WNC 1 254 — 26.1 
2 21.4 a 22.6 


* Percentage less than 1%, 


each condition. It will be seen in 
Table 3 that the majority of the errors 
are intralist errors. The column 
headed “Total” includes within-list and 
between-list errors, as well as errors in 
which S gives a stimulus item as a 
response, and errors in which he im- 
ports an  extraexperimental word. 
These last two error categories are 
minor ones, and are represented by 
approximately equal numbers of errors. 
Interlist errors are extremely infre- 
quent, although they do occur in those 
conditions in which they might be ex- 
pected. 

Recall: Exp. II —The test of recall 
in Exp. II consisted of four trials of 
List 2 relearning, with all procedural 
aspects being similar to the learning 
session 1 wk. before. The mean recall 
scores (i.e., correct anticipations on the 
first relearning trial) appear in Table 
4. The Newman-Keuls procedure was 
used to analyze differences between 
conditions. The result of this analysis 
was that WC and WNC were shown 
to differ from BC, BNC, and the 15- 
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TABLE 4 


MEAN RETENTION Losses AND MEAN 
NuMBER CORRECT ON THE FIRST 
Two RELEARNING TRIALs, Exp. II 


Number Correct 
Cond, Loss 

Trial 1 Trial 2 

WC 5.27 5.69 9.06 

WNC 5.19 4.56 8.00 

Control 7.03 3.56 8.37 

OL Control] 8.89 2.31 9.06 

BC 7.93 2.13 7.06 

BNC 7.69 1.87 6.37 
(MSerror) (5.625) (5.313) | (5.603) 


trial Control group; WC also differed 
from the other control condition. 
Other differences were not significant. 
An added analysis was done using the 
difference between the obtained recall 
scores and the recall scores predicted 
by a successive probability analysis 
(Underwood, 1964). This analysis 
led to what was essentially the same 
outcome, BC, BNC, and the over- 
learned controls being significantly 
worse in recall than WC and WNC; 
the remaining control condition differ- 
ing from none of the other conditions. 
On the second relearning trial (Table 
4) the two highest conditions, WC 
and the overlearned control, differing 
from the two lowest, BC and BNC. 
By the third relearning trial, the groups 
did not differ. 

Since the raw recall scores and the 
losses from the successive probability 
analysis show essentially the same dif- 


. ferences among groups, we must con- 


clude that the rate at which. Ss reached 
the terminal performance level on their 
second list has not too markedly af- 
fected the relative degrees of learning 
attained, since this would work against 
the superiority of the W conditions. 
It is still possible, however, that the W 
conditions have, as a result of the 
fact that they were given a greater 
number of trials on their second list, 


overlearned certain items for which the 
interfering effects of intralist similarity 
were minimal. Under conditions such 
as these, the distribution of item 
strengths for the W groups would be 
broader and flatter than the distribution 
of item strengths for the other groups, 
with more items falling at the “ex- 
tremely weak” and “extremely strong” 
ends of the distribution. In a short- 
term test of recall, such as is provided 
by an added learning trial, performance 
would depend upon the number of 
weak items lost, and the W conditions 
would appear at a disadvantage. In 
a long-term test of retention, however, 
performance would depend upon the 
number of strong items retained, and 
the W conditions might do better than 
the B or Control conditions. This 
possibility can be looked into by using 
only those items which were not over- 
learned. The relevant data appear in 
Fig. 3, which will require some ex- 
planation. For each condition, data 
were obtained for items gotten right, 
once, twice... k times. For items 
gotten right k times prior to Trial n-1 
the successive probability analysis 
yielded the probability of a correct re- 
sponse on Trial n. This index of 
associative strength is designated 
Pr(n/n-1) and appears on the abscissa 
of Fig. 3. Data were also obtained 
concerning the proportion correct in 
recall for items gotten right k times 
through Trial n; these appear on the 
ordinate. The values of Pr(n/n-1) 
and the corresponding proportions re- 
called were taken for all values of k 
up to the first one giving Pr(n/n-1) 
of 1.00, and both measures were 
smoothed using 3-point moving aver- 
ages. The smoothed data points were 
plotted in Fig. 3 with the end points 
omitted: thus, the extreme left-hand 
data point on any of the curves 
represents proportion recalled. and 


Proportion Recalled 


Pr.(n/n-l), in learning 


Fic. 3. Proportion recalled as a function 
of probability of a correct response in learn- 
ing, Exp. II. (The Overlearning Control is 
the lower of the two control curves.) 


Pr(n/n-1) for items gotten right once 
in learning, the values having been ob- 
tained from the combined data of items 
gotten 0, 1, and 2 times, The next 
point represents items gotten right 
twice, and is based upon items cor- 
rectly anticipated 1, 2, 3 times .. . etc. 
Examination of the figure indicates 
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tions show superior recall. However, 
if one compares items correctly antic- 
ipated an equal number of times (k), 
the only clear superiority is for WC 
items anticipated once or twice in 
learning. So far, it seems to make 
more sense to use Pr(n/n-1) rather 
than b as the preferred index of 
strength. 

Recall: Exp. III. —In Exp. III, the 
recall test consisted of having S write 
down everything he remembered from 
the two lists he had learned the pre- 
vious week, The C and NC pairings 
were given to only eight Ss each, and 
will not be distinguished in the analy- 
Ses. The recall protocols were ex- 
amined in a number of ways, and only 
some of the scores will be presented ; 
these appear in Table 5. The groups 
did not differ in the total number of 
pairs, F (3, 56) — 1.687. These were 
pairs which were properly put together, 
with stimulus and response correctly 
identified. There were significant dif- 
ferences in List 1 words recalled (com- 
bining stimuli and responses which 
were about equally well recalled), with 
the nonoverlearned control doing worse 
than the other three conditions. There 
were also significant differences in 
number of List 2 words recalled, with 
the W condition recalling more than 


that for equal Pr(n/n-1) the W condi- the worst of the two controls. The 
TABLE 5 
MEan RECALL Scores, Exp. III 
List 1 List 2 Errors 
Cond. 

Words Pairs Words Pairs cout, t Other 
wW 10.25 3.00 8.62 2.31 1.50 2 
B 10.00 2.00 7.62 1.31 294 iD 
Control 471 114 4:07 071 0.79 3.86 
OL Control 9.64 3.14 5.14 1.43 0.29 3.21 
(Sos) (22.05) (16.97) ^ i 
" note. SUR errors" include extraexperimental intrusions, and confusion of stimulus with Tesponse regardless 
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Proportion Recalled 


à 4 5 6 78 9 10 


Pr (n/n-1), in learning 


Fic. 4. Proportion recalled as a function 
of probability of a correct response in learn- 
ing Exp. IV. 


data of this experiment seem to be 
somewhat more variable than the data 
of Exp. II, but they still appear to in- 
dicate some superiority in List 1 recall 
for W, and also for B, as compared 
with the control condition which did 
not overlearn the first list. List 2 re- 
call is less clearly facilitated, and the 
effect is not significant for B. The 
superiority of B in List 1 recall might 
be sómewhat suspect since the B con- 
ditions did somewhat better in List 1 
learning than the W conditions in this 
experiment. However, it makes sense 
to assume that, with unlimited time, the 
B Ss might be able to recover (or re- 
construct) more words than the control 
Ss. That they did not do better in 
recall in Exp. II might be explained 
by the fact that the B condition in Exp. 
III produced more interlist confusions 
than the W conditions (47 vs. 24). 
Recall: Exp. IV.—In Exp. IV only 
one list was learned, and a test of re- 
learning followed after 1 wk. The 
Newman-Keuls procedure revealed 


that WC was:superior in recall to the 
control condition (a nonoverlearning 
control), while WNC was not. Ex- 
amining the weaker items as a function 
of Pr(n/n-l) by the same procedure 
used in Exp. II reveals generally su- 
perior recall for WC (Fig. 4). Finally, 
an examination of losses from a suc- 
cessive probability analysis (using the 
Newman-Keuls procedure) indicated 
that both WC and WNC lost less than 
the controls. Hence, the superiority 
of the W conditions in recall in both 
of the previous experiments is not en- 
tirely dependent upon the presence of 
interlist interference in those experi- 
ments. 

Errors in recall.—The errors made 
on the recall trial in Exp. IT and IV 
were totaled, and appear in Table 6, as 
a percentage of total opportunities for 
error. The two control conditions of 
Exp. II have been combined, since 
they produced very few errors. Two 
things are apparent in Table 6; the B 
conditions continue to make relatively 
few interlist errors, although they do 
make a small number of them, and the 
W conditions, as in learning, manifest 
relatively high intralist error rates. 


TABLE 6 


TOTAL ERRORS IN RECALL AS A PERCENTAGE 
Or TOTAL OPPORTUNITIES FOR ERROR 


Error Rate 
Cond. 
Inter Intra Other 
Exp. II 
Control a a 2.41 
BC 2.53 a 1.90 
BNC 6.17 4.93 3.10 
WC 0 21.78 2.97 
WNC a 15.13 0 
Exp. IV 
Control = 7.03 7.03 
WC = 38.25 3.35 
WNC cx 24.32 70 


* Error rates less than 1%. 
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Error data for the other relearning 
trials were consistent with these. 


Discussion 


An examination of studies of intralist 
similarity indicates that it is not so un- 
common as one might suppose for W to 
bring about superior recall. For ex- 
ample, Underwood and Archer (1955) 
found better recall for W than for control 
lists, and argued convincingly against the 
assertion that this might be due to con- 
founded effects of interlist similarity. 
Similarly, Underwood (1953) obtained 
superior recall for high-similarity adjec- 
tives, in which the confounding of W 
and B cannot have been a serious prob- 
lem. Furthermore, several of the studies 
discussed by Underwood (1954) showed 
equal retention of control and W lists 
when recall was compared for items re- 
ceiving equal numbers of correct anticipa- 
tions. If the W-B confounding can be 
ignored in these studies, as was argued 
earlier, and if we consider the fact that 
equal numbers of correct anticipations 
involve lower short-term probabilities of 
correct responding for W than for Con- 
trol Ss, these studies too might be 
interpreted as favoring W in recall. To 
be sure, the differences are not always 
large ones, but they are more consistent 
in direction than a conclusion of "no 
difference" would suggest. Two addi- 
tional studies Suggest the superiority of 
W in recall. One, briefly described by 
Postman (1963) showed that high inter- 
item associative strength facilitated serial 
as well as free recall after a 7-day reten- 
tion interval. The other, less convincing 
because it employed an RI design, was a 
dissertation by Bohm (1961) using serial 
trigram lists; frst-list retention was 
me facilitated with High W of both 
ists. 

The most obvious explanation of such 
facilitative and interfering effects grows 
directly from concepts such as Gibson’s 
(1940, 1942). Given Similar items, stim- 
ulus generalization and response gener- 
alization (which Gibson did not spe- 
cifically discuss) might operate in such 


a way that stimuli would tend to elicit 
the responses paired with similar stimuli, 
and responses might tend to be replaced 
by similar responses. Such tendencies 
should interfere with learning, and Gib- 
son supposed that during learning these 
interfering generalized response tenden- 
cies would have to be suppressed. But 
we might imagine that instead of being 
entirely extinguished, they are overcome 
in such a way that S, while still tending 
to produce erroneous responses, can 
nevertheless perform the correct re- 
sponses in the time allowed. Thus, while 
interference is overcome, generalization 
persists, and, especially after a long re- 
tention interval, each item in a retention 
test might serve as a cue for other items. 

The mnemonic or “reminder” role of 
similarity should aid WC but might be 
expected to generate added interference 
in WNC by suggesting incorrect pairings. 
However, it is possible to account for the 
WNC findings if we assume that nonspe- 
cific as well as specific reminders are 
often helpful to S. If he is reminded by 
the word PILLAR that KILLER is on the 
list, he may be better able to produce 
KILLER when its proper stimulus ap- 
pears. The fact that each list contained 
only pairs of similar items may be im- 
portant in this respect, 

The facilitative effects of similarity 
should apply to B as well as to W, so it 
is necessary to explain why B is seldom 
facilitated in recall, Two obvious possi- 
bilities suggest themselves. First, the S 
with similarity between lists is reminded 
of items on the other list, and while this 
apparently does not prevent him from 
recalling the list he is asked for, it 
certainly does not help him. Indeed, the 
results of Exp, IT suggested that the B 
conditions Were slightly retarded in re- 
learning, and the results of Exp. III indi- 
cated weakly that p facilitates recall 
when Ss are asked to give both lists, and 
are not asked to discriminate too finely 
between them. The second considera- 
tion, at least for the interpretation of the 
present series of experiments, is that dur- 
ing learning Ss can apparently keep the 
B lists Separate, and are not forced, as 
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are the Ss in W conditions, to suppress 
interference. This is implied by a com- 
parison of intralist and interlist error 
rates. Hence, if Ss in B conditions at- 
tempt to generate reminders in recall by 
emphasizing the similarity among items, 
they might unintentionally generate 
enough interference to cancel any 
facilitative effects. 
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CHANGES IN FEAR GENERALIZATION GRADIENTS AS 
A FUNCTION OF DELAYED TESTING? 


OTELLO DESIDERATO, BARRIE BUTLER; Ax» CORNELIUS MEYER 


Connecticut College 


To determine how strength of acquired fear varies with time elapsing 


between fear conditioning and testing, 


5 groups of 16 rats were 


tested for acquisition of a hurdle-jump response 0, 2, 6, 12, or 24 hr. 
after 35 conditioning trials of light CS and shock, Prior to the jump 
tests, apparatus cues were altered for 3 the Ss in each delay group. 
Results suggested that such stimulus change produces a generaliza- 
tion decrement, the slope of the gradient varying inversely with time 


elapsing between 


conditioning and testing. 


Hypotheses that fear 


"incubates" with time or that time weakens responses interfering with 


the performance of fear-motivated behavior 


In a typical acquired fear experi- 
ment, a neutral CS is paired with un- 
avoidable shock. Later, Ss learn to 
jump a hurdle to escape from the shock 
compartment when CS is presented ; 
Strength of acquired’ fear is inferred 
from decreasing jump latencies across 
a series of test trials. Evidence of ac- 
quired fear has been found when con- 
ditioning and test trials are separated 
by a 24-hr. delay (Goldstein, 1960; 
Kalish, 1954; McAllister & McAllis- 
ter, 1962b). Learning of the hurdle 
response is poor when postcondition- 
ing delay is reduced to 1j hr. (Desi- 
derato, 1964), and no learning has 
been reported when test trials immedi- 
ately follow conditioning (McAllister 
& McAllister, 1962a). 

To explain the finding that fear is 
strongest after a 1-day delay, McAI- 
lister and McAllister (1963) consider 
several hypotheses : (a) fear may sim- 
ply increase with time ("incubation 


1 This research 
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Service. 
Hays and Joan Gock 


were not supported. 


hypothesis), and (b) interfering re- 
sponses (e.g., crouching) may weaken 
with time (incompatible response hy- 
pothesis). However, since their Ss 
are conditioned and tested in slightly 
different conditioning boxes, the Mc- 
Allisters argue that testing occurs in 
a generalized stimulus situation. This 
view leads to a third hypothesis: (c) 
Ss tested after a delay may perform 
better than Ss tested immediately be- 
cause the stimuluse generalization 
gradient of fear flattens with time 
(generalization hypothesis). 

In support of the generalization hy- 
pothesis, the McAllisters report that 
hurdle jumping was learned equally | 
Well by groups delayed 3 min. or 24 
hr, if conditioning and testing were 
Carried out in the same apparatus. 
For two other groups, the conditioning 
box was constructed as a replica of the 
testing box, but some differences in 
visual appearance and extramaze cues 
were present. Under these conditions, 
a sharp performance decrement was 
shown only by the group trained and 
tested with no intervening delay. 
These results are compatible with a 
generalization hypothesis which pre- 
dicts: (a) no effect of variations in 
delay if conditioning and testing con- 
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ditions remain unchanged, and (b) a 
direct relationship between perform- 
ance and delay if testing is carried out 
under altered conditions. The “incu- 
bation” and incompatible response hy- 
potheses, on the other hand, would 
both’ predict better performance with 
greater delay, regardless of changes in 
stimulus conditions. 

Ina recent study, Butler (1964) in- 
vestigated the effect of a more radical 
change in visual and tactual stimula- 
tion on the hurdle-jump response of Ss 
tested after delays of 0, 2, 6, 12, or 24 
hr. In her experiment, stimulus 
change consisted of altering the color 
of the inside walls of the conditioning 
compartment from white to black and 
covering the grid floor with a black 
plywood panel. Butler found no sig- 
nificant interaction between postcondi- 
tioning delay and stimulus change, 
contrary to the generalization hypothe- 
sis. However, the interpretation of 
her results remains unclear, since the 
stimulus change effected was drastic 
enough to eliminate all evidence of 
learning for Ss in the changed-cues 
condition. One could thus argue that 
the extreme change in the situation 
prior to testing reduced generalized re- 
sponse strength to a level too weak to 
reflect the effect of variation in delay, 
and thus precluded a fair test of the 
generalization hypothesis. It appeared 
reasonable to assume that, if stimu- 
lus conditions were only moderately 
changed prior to testing, learning of 
the indicator response would still be 
strong enough to reveal the effect of 
differences in delay in changed-cues 
groups. 

The present experiment reports dif- 
ferences in the strength of the hurdle- 
jump response between groups trained 
and tested under conditions of either 
no stimulus change or of moderate 


change, with postconditioning delay 
again varying from 0 to 24 hr. 


METHOD 


Subjects—The Ss were 80 experimentally 
naive male albino rats of the Charles River 
CD strain. At the start of the experiment, 
their ages varied from 80 to 120 days. 
Rockland laboratory animal diet and water 
were always available in home cages. The 
Ss were housed one to a cage. 

Apparatus—The apparatus, previously de- 
scribed in detail (Desiderato, 1964), con- 
sisted of a conditioning compartment with a 
grid floor, separated from a black-walled, 
wooden-floor escape compartment by a guil- 
lotine door whose bottom edge rested on a 
li-in. hurdle. The CS was light provided 
by a 60-w. frosted bulb in an 84-in. alumi- 
num reflector mounted face down on the 
clear Plexiglas cover of the conditioning 
compartment. On each conditioning trial, 
electronic timers presented a 6-sec. CS, with 
shock timed to overlap the last 2 sec. of 
CS. The UCS, generated by an Applegate 
Model 230 AG stimulator, was measured 
across the grids at .6 ma., at 230 v. 

Design and procedure.—The general pro- 
cedure was to expose S to a number of CS- 
UCS presentations in the conditioning com- 
partment, and then to administer a series of 
test trials on each of which the CS was 
presented and the hurdle-jump latency was 
recorded. The 80 Ss were assigned to one 
of the following five postconditioning delay 
groups: 0, 2, 6, 12, and 24 hr.* 

For all Ss on every conditioning trial, the 
inside walls of the shock compartment were 
covered with panels of white cardboard; 
shock was administered through the grid 
floor. On test trials, for half the Ss in 
each delay condition shock-compartment cues 
were changed by removing the white panels 
to expose black compartment walls. Con- 
trary to Butler's (1964) procedure in which 
a black plywood panel covered the shock 
grid throughout the test trials, the grid 
floor was kept exposed to afford unchanged 
tactual stimulation. For the remaining Ss, 
shock-compartment cues remained unchanged 
for both conditioning and test trials. 

On each of the first 2 days, each S was 
handled for two 10-min. periods, and was 
allowed to explore separately both shock and 


8 Data collection for the 2-, 6-, and 12-hr. 
groups began 6 wk. following the collection 
of data for the 0- and 24-hr. groups. 
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escape compartments for 10 min. each, with 
nO access permitted from one to the other. 
During exploration, internal features of the 
shock compartment were the same as those 
each S later encountered on the test trials. 

On the third day, each S received 35 
CS-UCS presentations in the white-walled 
shock box, administered with a 2-min. in- 
terval between trials. The first trial began 
10 sec. after S was placed inside the com- 
Partment. Ten seconds after the termina- 
tion of the thirty-fifth trial, S was removed. 

The first set of 25 test trials (Day 1) 
was begun 0, 2, 6, 12, or 24 hr. later, de- 
pending on the delay group to which S had 
been assigned. On each test trial, S was 
placed in the conditioning compartment fac- 
ing the guillotine door which was raised 
after 10 sec, activating both CS and a 
Standard Electric timer. As soon as S 
had cleared the hurdle in jumping into the 
escape compartment, the door was lowered, 
terminating both CS and timer. If no re- 
Sponse occurred within 60 sec, of CS onset, 
the door was lowered and S was removed 
from the shock compartment for the normal 
intertrial interval of 30-120 sec. On such 
trials, a 60-sec, latency was recorded. Shock 
Was never present on test trials, 

For Ss in the changed-cues condition, on 
all test trials the shock compartment was 
equipped with black walls; for Ss in the 
unchanged-cues condition, the shock com- 
partment retained the white walls present 
during the conditioning trials. 

A second set of 25 test trials (Day 2) 
was administered 24 hr, after the beginning 
of the first trial of Day 1, following an 
identical procedure, 

"Throughout the experiment, the only room 
illumination was provided by a NE-30 7-w. 
bulb located 25 in. above the escape box. 
A 12-in. electric fan maintained a constant 
low-level masking hum. 


RESULTS AND Discussion 


Latency scores were converted into 
reciprocals and, for each S, the mean 
reciprocal score for five-trial blocks 
was the unit used in all analyses re- 
ported below. 

Table 1 shows mean reciproca] 
Scores for Day 1 for six Stimulus- 
change and six No-change groups of 
eight Ss each. In the right-hand col- 
umn of the table differences between 


TABLE 1 


MEANS AND SDs or Five DELAY Groups 


TESTED UNDER CONDITIONS OF STIMULUS 
CHANGE vs. No STIMULUS CHANGE 


(Dav 1) 

Ui Sümulus | No Change | Difference 
O0 Hr. |.124 (.151) | .339 (.210) | —.215 
2Hr. |.164 (.149) |.335 (.250) | —.171 
6 Hr. |.281 (.185)|.403 (245) | —.122 

12 Hr. |.238 (.248) | .311 (.162)| —.073 

24 Hr. |.432 (212) |.343 (.317) | 4.089 


the two means for each level of delay 
reflect gradient slopes for each delay 
condition, the more negative, the 
steeper the slope. As shown in Fig. 
1, the steepest slope, corresponding to 
the greatest generalization decrement, 
occurs in the O-hr. delay condition. 
As delay is increased to 2, 6, and 12 
hr., the gradient becomes progressively 
flatter, until a positive slope is attained 
in the 24-hr, delay condition. The five 
slopes thus fall precisely in the order 
Predicted by a hypothesis which as- 
sumes the generalization decrement to 
be inversely related to the time elaps- 
ing since fear conditioning. The 
probability of obtaining this particular 
order of difference scores is 1/120 or 
-008. 


While significant differences were 


Le -224-HR 


MEAN RECIPROCAL LATENCY (. SEC) 


NO STIMULUS 
CHANGE 
TESTING CONDITIONS 


Fic. 1. Mean reciprocal latencies for five 
delay groups tested on Day 1 under original 
or changed stimulus conditions. 


FEAR GENERALIZATION GRADIENTS WITH DELAYED TESTS 


found among the means of the five 
Stimulus-change groups, F (4, 35) 
= 2.72, p < .05, means for the five 
No-change groups did not differ reli- 
ably, F < 1.0. The mean difference 
between the two 24-hr. delay groups 
also failed to prove reliable, t(14) 
= 0.62. 

Data collected on Day 2 were much 
more variable and failed to reveal a 
systematic tendency for gradients to 
flatten with increasing delay. Differ- 
ences between mean reciprocal scores 
of Stimulus-change and No-change 
groups were — .132, — .205, — .256, 
— .110, and 4- .290, for groups delayed 
0, 2, 6, 12, and 24 hr., respectively. 

The results of this experiment sup- 
port the conclusion that the sheer pas- 
sage of time is not sufficient to in- 
crease the performance of a learned 
response motivated by fear. The per- 
formance of the five No-change groups 
did not differ, contrary to hypotheses 
involving either an increase in fear 
("incubation" hypothesis) or a de- 
crease in the strength of interfering 
responses (incompatible response hy- 
pothesis). What seems to happen, in- 
stead, is an impairment of perform- 
ance if testing takes place immediately 
after conditioning and in somewhat 
different surroundings. However, the 
passage of time may overcome the 
decremental effect of stimulus change, 
for the Stimulus-change subgroup de- 
layed 24 hr. did perform as well as 
any of the five No-change subgroups. 
Further support for the conclusion that 
only the generalized fear response in- 
creases with time comes from the ob- 
servation that, of the two subgroups 
tested immediately, the  Stimulus- 
change group showed a significant im- 
provement between the first and sec- 
ond test days, #(7) = 274, p < .05, 
but the No-change group did not, t(7) 
= 148, p > .10. 
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It would seem appropriate, then, to 
view the Stimulus-change variable as 
tapping two points on a stimulus gen- 
eralization dimension, and to conclude 
that the passage of time flattens the gen- 
eralization gradient of fear, as suggested 
by McAllister and McAllister (1963). 

The flattening of generalization gradi- 
ents with time may not be restricted to 
emotional conditioning, for this finding 
has also been reported in reward- 
conditioning studies using straight-alley 
(Perkins & Weyant, 1958) and operant 
conditioning (Thomas & Lopez, 1962) 
procedures, with rat and pigeon Ss, re- 
spectively. Studies now in progress in 
our laboratory will determine whether 
this phenomenon also holds for humans. 

The precise mechanism which accounts 
for the observed flattening of generaliza- 
tion gradients with time is not known. 
However, we tend to agree with the Mc- 
Allister and McAllister's (1963) sug- 
gestion that elapsed time somehow de- 
creases the number of jnd's separating 
original and generalized stimuli. With 
a long delay and increased difficulty of 
discrimination, testing under generalized 
(changed) conditions should elicit as 
much fear as testing under the original 
conditions. Conversely, in the case of 
immediate testing, the number of jnd's 
separating the same stimulus range 
would be larger; with discrimination thus 
enhanced, changed testing conditions 
would elicit less fear and performance 
would be correspondingly poorer. This 
assumed effect of time on the number of 
jnd’s between two stimulus situations has 
received support in a recent psychophysi- 
cal study by McAllister, McAllister, and 
Franchina (1965). 
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SENSITIVITY OF THE OBSERVER TO TRANSFORMATIONS 


OF THE VISUAL FIELD* 


MYRON L. BRAUNSTEIN 
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Motion-picture sequences representing 2- or 3-dimensional textures 
rotating or translating with respect to X, Y, or Z and displayed 
through polar or parallel projections were generated by a computer 
technique. 32 Ss selected physical models of the textures and manipu- 
lated these according to the perceived motions. Responses to the 
various transformations corresponded to findings in studies of motion 
parallax, stereokinetic depth, and the kinetic depth effect, while method 
of projection accounted for differences between the findings of Es 
using the same transformations. The results supported the meaning- 
fulness of a psychophysics of depth perception based on a mathematical 
analysis of transformations of the visual field. 


A rigid geometrical transformation 
of a visual pattern may yield informa- 
tion concerning the depth of the figure 
represented by that pattern which is 
not available in a stationary view 
(Braunstein, 1962b). Many studies 
involving specific transformations have 
obtained this result, and have applied 
special terms and explanations to this 
finding. The depth cue of motion 
parallax, the fan illusion (Johnson, 
1927), the stereokinetic effect (Mu- 
satti, 1924), and the kinetic depth ef- 
fect (Wallach & O’Connell, 1953) each 
involve a specific rigid transformation 
of a visual pattern. Only recently has 
the relationship of apparent depth and 
motion become the subject of a com- 
prehensive theoretical treatment (Gib- 
son, 1954, 1957). 

Gibson (1954) has postulated that 
“Any regular transformation of a bi- 


1This research was supported by Na- 
tional Science Foundation Grant GB 2335 
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neering and Research, a division of the 
Flight Safety Foundation, Incorporated. 
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who ran the Ss, and to the Computing Cen- 
ter and Department of Psychology at Ari- 
zona State University for the facilities which 
they provided. 


dimensional image tends to yield a tri- 
dimensional motion in perception, and 
the kind of motion perceived depends 
on the kind of transformation” and has 
argued that “A psychophysics of ki- 
netic impressions would require a 
mathematical analysis and classifica- 
tion of the motions or transformations 
of a retinal image [pp. 311-312].” 
While a mathematical analysis of the 
rigid motions is straightforward, the 
generation of visual displays in con- 
formity with such an analysis was not 
practical until Green (1961) intro- 
duced the concept of preparing motion 
pictures for stimulus display by plot- 
ting each individual frame from mathe- 
matically generated coordinates. A 
digital computer, digital-to-analog con- 
verter, cathode-ray-tube plotting de- 
vice, and motion-picture camera are 
used in this technique. 

'The present study explores some of 
the implications of Gibson's theory of 
the differential sensitivity of O to 
transformations of the visual field. In 
particular, it is an examination of the 
usefulness of this concept in empiri- 
cally tying together some of the re- 
sults of earlier studies which involved 
specific types of figures and transfor- 
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mations. Stimuli will therefore be 
included which vary over the stimulus 
dimensions which appear to be most 
relevant in the two-dimensional pres- 
entation of patterns in motion: the 
type of transformation, the axis of 
transformation, the dimensionality of 
the transformed figure, and the 
method of projecting the figure onto 
a plane for display. 

The complete set of orthogonal rigid 
transformations, as described by Gib- 
son, is included. These are the trans- 
lations along and rotations about three 
perpendicular axes, X, Y, and Z. 
Both two- and three-dimensional tex- 
tures are included for each transfor- 
mation, and each transformation of 
each texture is displayed by projecting 
its elements onto a plane through a 
near point (polar projection) or 
through an infinitely distant point 
(parallel projection). 


METHOD 
Subjects 


The Ss were 16 male and 16 female Ari- 
zona State University undergraduate volun- 
teers from freshman and sophomore-level 
psychology courses. 


Stimuli 


The stimuli were 120-frame, 16-mm. 
motion-picture sequences. Each frame of 
the sequence contained a pattern of small 
white dots against a black background. The 
coordinates of each dot were specified on 
digital tapes generated on a General Electric 
Model 225 computer. The films were pro- 
duced from these tapes using a Stromberg 
Carlson Model 4020 microfilm plotter. 

The computer Program used to generate 
the textures (dot patterns) operated in a 
theoretical three-dimensional space where 
the Z axis represented the line of sight, 
the X axis was the horizontal line through 
the origin perpendicular to the line of sight, 
and the Y axis was the vertical line through 
the origin perpendicular to the line of sight, 
Mathematical manipulations in this theoreti- 
cal space were made in terms of an arbi- 
trary "unit" of distance. Figure 1 illus- 
trates the geometry of the theoretical space, 
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Transformations—The texture was either 
rotated 360° about one of the three axes, 
or translated at an equivalent rate (64 
units in 120 frames) along one of the three 
axes. The three rotations and three trans- 
lations constituted the six basic motions. 

Textures—The textures were composed 
of points randomly selected within the con- 
fines of geometric forms. A two-dimensional 
and a three-dimensional form were selected 
for each transformation, with the criterion 
that no edges be visible within the field of 
view of O when the transformation is dis- 
played. (This criterion could not be met 
for rotations of two-dimensional textures 
about the X or Y axis, In this case the 
number of frames on which an edge was 
visible was minimized.) For the rotations 
about X and Y, the forms were a sphere of 
1 unit radius and a plane 3 units x 30 units. 
For the rotations about Z, a circle of 1 unit 
radius replaced the plane, as there was no 
problem of edges coming into view. A cir- 
cle and sphere were also used for the trans- 
lations along the Z axis, but they were 
made 4 units in radius in order to fill the 
display area when they were most distant 
from the projection point. A plane 2 units 
X 10 units and a Cylinder 1 unit in radius 
and 10 units long were used in the X and 
Y translations. 

Projections —After a texture was gen- 
erated in three-dimensional Space, and each 
time it was transformed by 1/120 the total 
transformation to be displayed, the points in 
the texture were projected onto the plane 
Z —0. Polar projections were produced by 
an algorithm equivalent to drawing lines 
between the points in the texture and a near 
Projection point (0, 0, 3), and displaying 
the points at which these lines intersect the 
plane Z=0. Parallel projections were pro- 
duced simply by retaining the original X 
and Y coordinates of each point in the tex- 
ture and setting the Z coordinate to 0. In 
both cases, the stimulus film was composed 
of small uniform dots representing the 
Points on the plane Z=0. As the trans- 
lation along X and Y and the rotation about 
Z of two-dimensional textures do not re- 
sult in any variation of the points along the 
Z axis, method of projection is irrelevant 
for these displays. The difference between 
the two methods of projection is illustrated 
in Fig. 1. The geometry of projected visual 
displays is further discussed elsewhere 
(Braunstein, 19622). 

The projected textures were represented 
on film by white dots approximately .0075 
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Fıc. 1. Geometry of the theoretical space 
used in generating the stimuli, (Points in 
three-dimensional space are represented by 
#’s; projected points are shown as dots.) 


units in diameter. The dots were uniform 
in size, shape, and brightness and did not 
vary in these characteristics in accordance 
with the positions in three-dimensional space 
which they represented. Approximately 1,000 
dots were visible on a frame of a stimulus 
sequence. For transformations in which the 
number of dots varied from frame to frame, 
the median number was approximately 1,000. 

The two types of transformation with re- 
spect to three axes for two levels of texture 
dimensionality and two methods of projec- 
tion yield 24 stimulus conditions. As method 
of projection does not affect the algorithm 
for generating the displays in three cases, 
21 sequences were produced. These 120- 
frame sequences, separated by equal amounts 
of black leader, were organized according 
to type and axis of motion, but randomized 
with respect to dimensionality and method 
of projection. 


md 
PROJECTOR ~ 


Fic. 2. Schematic representation of the 
arrangement for displaying the stimulus 
films. 
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Apparatus 


An L-W Model 224-A 16-mm. motion- 
picture projector, operated at 16 frames per 
sec., was used to display the stimulus film. 
The projector was housed in a sound- 
absorbent box equipped with a blower which 
provided a masking noise. A 4-ft. square 
Polacoat translucent projection screen was 
located between the projector and S, 6 ft. 
from S’s eye. The S viewed the screen mo- 
nocularly through a tube which restricted 
his field of view to a circular area inscribed 
within the 4-ft. screen. Partitions were ar- 
ranged in the room so that S did not see 
the projector or the screen at any time, ex- 
cept for the portion of the screen visible 
through the tube. Background and dot 
luminances were approximately .004 and 1.3 
ftL., respectively. 

A distance unit in the stimulus display 
was made equal to 2 ft. when projected onto 
the screen. The projection screen thus dis- 
played the plane Z=0 for -1<X<+1 
and —1<Y<+1 with S’s eye at (0, 0, 
3), the polar projection point. The geom- 
etry of the viewing arrangement is illus- 
trated in Fig. 2. This arrangement was 
designed to eliminate factors unrelated to 
the geometric properties of the transforma- 
tions which might serve as cues to two- 
dimensional perception, such as border, bi- 
nocular vision without disparity, and lack of 
differential convergence and accommodation. 

A translation device and two rotation de- 
vices were constructed. The translation 
device consisted of a platform to which two 
shafts could be attached. The platform 
was motor driven and operation of a three- 
position toggle switch could cause it to move 
in a straight line in either direction, at a 
rate of 1.1 in. per sec., for a total of 16 in. 
The rotation devices consisted of reversible 
8-rpm motors mounted in separate boxes. 
Two three-position toggle switches in a 
separate control box could cause the shaft 
on either motor to rotate in either direction. 
Transparent plastic and glass models, which 
could be mounted on these shafts, consisted 
of a 2-in. diameter circle, a 2X 8 in. rec- 
tangle, a 2-in. diameter sphere, and a 2X 8 
in. diameter cylinder. Large numbers of 
small, irregularly spaced dots were painted 
on these models. The translation or rota- 
tion could be caused to occur with respect 
to either of the three axes by orienting the 
device in one of three directions. 
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Procedure 


The stimulus film was viewed by half of 
the Ss in a forward direction and by half 
in a reverse direction. For each direction, 
half of the Ss first viewed all translations 
and half first viewed all rotations. The in- 
structions were the same for both types of 
transformation, with the exception of the 
phrases enclosed in parentheses below, which 
were specific to the translation or rotation 
Portions, respectively. The Ss were in- 
structed to look through the tube monocu- 
larly, and told that 


The displays will represent the shapes and 
motions possible with the objects in (this 
box) (these boxes). After each display, 
I would like you to produce the motion 
which appears the more similar to the one 
displayed (and to select) (with) the ob- 
ject which appears the more similar. If 
there appeared to be no motion, you may 
leave the switch in the center position 
and just select an object. 


At the beginning of the experiment, and 
each time the type of transformation or 
axis was changed, S' was asked to try the 
various switch Positions, and observe the 
resulting motions of the models. 

The translation. or rotation device was 
positioned in advance by E for each axis. 
The film was Stopped after each 120-frame 
stimulus sequence and a light was turned on 
while S' responded. 


RESULTS 


Table 1 shows the percentage of Ss 
who selected a three-dimensional model 
in response to each stimulus, and the 
percentage of Ss who correctly repro- 
duced the direction of the transforma- 
tion used in generating the stimulus 
film sequence, The statistical signifi- 
cance of each percentage was tested 
under a null hypothesis of 50%. If 
this hypothesis could not be rejected 
with p < 05, a further test was made 
against an alternative hypothesis of 
75% (or 25%). Unless otherwise 
noted in the table, for cases in which 
the null hypothesis was not rejected, 
the alternative hypothesis of 75% was 
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rejected with a p< .05. (In all cases 
in which the null hypothesis could not 
be rejected an alternative hypothesis 
of 80%, or 20%, could be rejected 
with p < 05.) These probabilities are 
based on binomial expansions. Tests 
of alternative hypotheses are included 
to indicate the confidence with which 
failure to reject the null hypothesis may 
be used as evidence that depth or direc- 
tion was not reliably discriminated for 
a given stimulus, A factorial analysis 
was not considered appropriate due to 
the uniqueness of the individual stimuli 
resulting from combinations of the 
stimulus dimensions. 

Translations with respect to the X 
and Y axes resulted in the selection of 
two-dimensional models when the stim- 
ulus was generated as two-dimensional, 
and three-dimensional models when the 
stimulus was generated as three-dimen- 
Sional and displayed using a polar pro- 
jection, Dimensionality was not reli- 
ably discriminated for the parallel 
projection of a three-dimensional tex- 
ture. Direction was correctly judged 
for all X and Y translations. 

Textures translated with respect to 
the Z axis were judged as two-dimen- 
sional for parallel projections of two- 
dimensional textures and three-dimen- 
sional for polar projections of three- 
dimensional textures, Dimensionality 
was not reliably discriminated for the 
other stimulus combinations involving 
Z translations. Direction was judged 
correctly for the polar projections, but 
parallel projections were judged to 
Tepresent stationary textures, 

Dimensionality was correctly judged 
for all rotations about the X and Y 
axes but direction was correctly judged 
only in the case of polar projections. 
For rotations about the Z axis, dimen- 
sionality was not reliably discriminated 
for any of the three stimuli but direc- 
tion was judged correctly for all three. 
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TABLE 1 
RESPONSES TO THE Basic Ricip MOTIONS OF THE VISUAL FIELD 
Stimulus % 3-D % Correct 
Transformation 
Axis Form Projection Responses Direction 
Translation x 2-D Parallel 318 97^ 
Polar 
3-D Parallel 44 97^ 
Polar 91* 948 
Y 2-D Parallel 9s 94^ 
Polar 
3-D Parallel 38> 94^ 
Polar 94a 94^ 
Z 2-D Parallel 228 6ra 
Polar 38> 94^ 
3-D Parallel 59 12%" 
Polar 88* 91* 
Rotation X 2-D Parallel 6* 44 
Polar 6» 91* 
3-D Parallel 100* 66° 
Polar 100^ 81^ 
14 2-D Parallel 3r 34° 
Polar 3* 94^ 
3-D Parallel 100" 62 
Polar 94» 94a 
Z 2-D Parallel 44 97s 
Polar 
3-D Parallel 53 100* 
Polar 50 100* 


* Probability of Type I error «.05 with null hypothesis of 50%. 
b Probability of Type II error with alternative hypothesis of Sh 
* Probability of Type II error with alternative hypothesis of 757% 


49% incorrect direction, 84% still. 
* 9% incorrect direction, 78% still. 


Discussion 


The description of the stimuli entirely 
in terms of geometric transformations 
departs somewhat from the terminology 
usually used to describe specific combi- 
nations of transformations, dimensional- 
ity, and method of projection. The fol- 
lowing paragraphs will therefore relate 
the transformations explored in the pres- 
ent study to the various specific stimulus 
conditions to which special terms have 
been applied, and will compare the results 
obtained with these transformations to 
results obtained through working with 
those individual stimulus combinations. 

The X translation of a three-dimen- 
sional form. with polar projection is 
traditionally. discussed as motion parallax, 
and the present results confirm the ef- 
fectiveness of this "cue" even in the 


absence of relative size and other depth 
cues. The X translation of a two-dimen- 
sional form would be comparable to mo- 
tion parallax with respect to a flat object, 
and the results show that the texture un- 
dergoing this transformation was cor- 
rectly perceived as flat. The X transla- 
tion of a three-dimensional form with a 
parallel projection is equivalent to a 
motion-parallax situation involving very 
distant objects, perhaps seen through an 
optical enlargement system. In this case, 
it was equally likely for a two-dimen- 
sional or three-dimensional model to be 
selected as representing the perceived 
form. While motion-parallax studies 
generally involve X translations, the re- 
sults are essentially the same with Y 
translations. 

Translations along the Z axis, or line 
of sight, are normally studied in the con- 
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text of size constancy. The present 
results show the sufficiency of increasing 
or decreasing the spaces between texture 
elements in eliciting Perceptions of ap- 
proaching or receding textures. At the 
same time, depth can be perceived within 
the texture, apparently on the basis of 
differential rates of increase or decrease 
of this spacing. 

Most of the recent research work on 
motion perspective has employed projec- 
tions of rotations about the Y axis using 
à shadow arrangement. The perception 
of depth in such transformations has 
been referred to by Wallach and O'Con- 
nell (1953) as the "kinetic depth effect.” 
Wallach's findings of ambiguity in the 
direction of rotation for certain forms 
and Gibson and Gibson's (1957) finding 
of accurate perception of direction may 
be accounted for by the difference in type 
Of projection studied. When a polar 
projection, corresponding to Gibson’s 
near light source, is used almost all Ss 
judge the direction correctly, When a 
parallel projection, corresponding to 
Wallach’s more distant light source is 
used, the accuracy of these judgments 
drops to a chance level, The parallel 
Projections of two-dimensional textures 
rotating about the Y axis consequently 
give rise to what has been termed the 
“windmill” or “fan” illusion (Boring, 
1942, p. 270). The Present results also 
demonstrate the ability of Os to distin- 
guish between two- and three-dimensional 
textures rotating about the y (or X) 
axis, regardless of the type of projec- 
tion used. 

Rotations about the Z axis correspond 
to the “stereokinetic” phenomena de- 
scribed by Musatti (1924) and more re- 
cently studied by Wallach, Weiss, and 
Adams (1956). In these studies, depth 
may be perceived at first only by some 
Ss and by others after Suggestion. This 
transformation apparently provides in. 
Sufficient information for either a two. 
Or three-dimensional perception to be 
reliably elicited, and the present results 


Show an equal number of selections of 
each model. 


The present study appears to demon- 
strate the validity of Gibson's arguments 
concerning the sensitivity of the human 
O to transformations in the visual field 
and the consequent meaningfulness of a 
Psychophysics of depth perception, A 
systematic evaluation of the responses 
elicited by transformations in the visual 
field can account for the wide variety of 
depth phenomena and “illusions” result- 
ing from motion of stimulus patterns, 
Manipulations of texture, depth, and 
method of projection further account for 
the variations in results found with spe- 
cific types of motion. 

A general finding of interest was that 
no texture generated as three-dimensional 
was judged to be two-dimensional by 
significantly more than 50% of the Ss. 
The textures which were judged as two- 
dimensional by most Ss had been gen- 
erated as two-dimensional and under- 
went transformations yielding a clear 
distinction between two- and three-dimen- 
sional textures, These findings do not 
support the traditional concept that two- 
dimensional Perception is primary and 
three-dimensional perception requires the 
addition of special cues (as found, e.g, 
in Osgood, 1953, P. 249; Woodworth & 
Schlosberg, 1954, p. 455). The assump- 
tion on which this concept is based, that 
Derception is tied in some isomorphic 
manner to the structure of the retina, 
appears unnecessary, Instead, support 
is provided for the postulate that the 
human O is sensitive to transformations 
of patterns of light stimulation to the 
retina, and can perceive motion and depth 
On the basis of such transformations. 
These transformations may contain either 
information leading to the perception of 
two dimensions or information leading 
to the perception of three dimensions. In 
the absence of either type of information, 
depth is ambiguous, 
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ALL-OR-NONE ASSUMPTIONS IN CONCEPT IDENTIFI- 
CATION: ANALYSIS OF LATENCY DATA? i 


JAMES R. ERICKSON, MYRON M. ZAJKOWSKI, AN» EVAN D. EHMANN 
Ohio State University 


responses; (b) latencies on error and on correct trials were not 
different; (c) latencies declined during the presolution period, 
especially on trials following errors; and (d) latencies showed a 
"learning curve" on trials following the last error. Results were dis- 
cussed with respect to sampling-with-replacement assumptions of 
current all-or-none theories of concept identification. It seems probable 
that these assumptions are not correct and that Ss sample with partial 
replacement on trials which follow errors. 


Two recent theories of concept- 
identification behavior involve quite 
similar logical and mathematical struc- 
tures. Restle's (1962) strategy-selec- 
tion model and Bower and Trabasso's 
(1963a) — dimension-selection model 
both involve the basic notion that S. 
in this type of problem acts as a hy- 
pothesis tester (or dimension selec- 
tor), testing various hunches about the 
solution of the problem. If, on any 
particular trial, the hypothesis leads 
to a correct response, it is retained; 
but if it leads to an error, it is put back 
into a hypothesis pool from which a 
new hypothesis is drawn, Both mod- 
els involve sampling with replacement, 
which makes the probability of a solu- 


tion of the problem on any particular 
trial independent of the trial number, 
and the entire process can be described 
as a two- or three-state Markov proc- 
ess with a terminal absorbing state, 

Both of these models are all-or-none 
with regard to response probabilities, 
They have been tested experimentally 
and seem to provide a good description 
of response frequency data (Bower & 
Trabasso, 1963a, 1963b; Trabasso & 
Bower, 1964) in some fairly complex 
situations. 

By making one “reasonable” as- 
sumption, several predictions about re- 
sponse latency can be derived from 
these models. The assumption is: re- 
sponse latency (L) on Trial n is 
monotonically related to H, the size of 
the hypothesis pool on that trial, The 
following Predictions follow from the 
above assumption and from the axioms 


urther réproduction is authorized to Satisfy 
the needs of the United States Government, 


of the above models: 

1. L will be longer on trials follow- 
ing errors than on trials following cor- 
rect responses, Following correct re- 
sponses, H = 1, while following an 
error H = 2? for D binary dimensions 


(assuming S is well informed of the 
nature of the problem), 
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2. L, on trials preceding the last er- 
ror, should be a constant across trials. 
H does not depend on the trial num- 
ber, since sampling with replacement 
is assumed. 

3. L, on trials following the last er- 
ror, should be constant and equal to L 
on trials following correct responses 
for trials preceding the last error. H 
should equal 1 in both cases. 

4. For binary dimensions, L on er- 
ror trials and on correct response trials 
should be equal. During the presolu- 
tion period the probability of a correct 
response by chance is approximately 
.50, so correct responses follow on 
the average 50% errors and 50% cor- 
rects (and so do errors) because the 
model assumes independence from 
trial to trial during the presolution pe- 
riod (see Suppes & Ginsberg, 1963). 

5. Following uninformed errors, L 
should be the same as following cor- 
rect responses. If S is not told about 
an error, there is no reason for re- 
sampling. 

There are relatively few studies 
analyzing response latency with respect 
to all-or-none assumptions. Millward 
(1964) presents paired-associate learn- 
ing data which seem to support Predic- 
tion 2 above, but contradict 3; Wil- 
liams’ (1962) data also seem to con- 
tradict Prediction 3, though she did not 
analyze her data in terms of trials be- 
fore and after the last error. Siegel 
(1964), in a  concept-identification 
study, reports data supporting Predic- 
tion 2, but contradicting 1 and 3. The 
present study presents data relevant to 
all five predictions in somewhat more 
complex concept-identification tasks. 


METHOD 


Subjects—The Ss were 54 students who 
were recruited from the introductory psy- 
chology S pool. Participation in experiments 
was a course requirement. 


Procedure.—Each S received fairly de- 
tailed instructions concerning the nature of 
the task—emphasizing that the solution to 
each problem involved only one dimension, 
that E would assign Responses A and B to 
the two values of the correct dimension and 
that S's task was to discover E's rule for 
response assignment. Two practice prob- 
lems, each involving two binary dimensions, 
were then solved by S. Before solving each 
of the main problems, the several dimensions 
were pointed out, and Ss were allowed to 
examine sample stimulus materials. It was 
assumed that, following this training, H 
would be fairly constant across Ss, since the 
type of solution and the particular dimensions 
involved were well-known. 

Each S solved three main problems. One 
was a control problem, where the assignment 
of responses was constant throughout the 
learning period. One involved reversal 
shifts, where response assignments were 
reversed within a dimension following every 
other error. The third involved nonreversal 
shifts, where response assignments were 
shifted back and forth between two dimen- 
sions after every other error. Following the 
Bower and Trabasso (1963a, 1963b; Tra- 
basso & Bower, 1964) procedure, the shifts 
were instantaneous, so that on the shift 
trial S was given feedback according to the 
new response assignment. Thus, for the two 
shift problems, approximately half of the 
errors were uninformed errors, since S was 
actually told that he was correct. 

"The order of the three reversal conditions 
was partially counterbalanced. One third of 
the Ss learned the problems in each of the 
following three orders: C-R-NR, NR-C-R, 
R-NR-C. The assignment of reversal con- 
ditions to stimulus materials was also par- 
tially balanced. Of the six possible assign- 
ments three were used, with each problem 
appearing once with each reversal condition, 
one third of the Ss receiving each assign- 
ment. The correct solution for each problem 
was randomly chosen for each S. The learn- 
ing criterion was 20 successive correct re- 
sponses (S was informed of the criterion). 

Materials.—The 3  concept-identification 
problems each contained four binary dimen- 
sions. The problems and examples of the 
four dimensions are shown in Fig. 1. 

The various stimuli (24 for each problem) 
were placed on 35-mm. slides, which were 
arranged in several random orders for each 
problem. The Ss, run individually, were 
seated in a booth containing a response box 
with two buttons and two feedback lights, 


[Dimensions] 


Thick vs Thin Border 

Solid vs Doshed Border —— 

1 Horizontal vs Vertical Internat Une 
Upper vs Lower internal Line Position 


SZL 
, 

Doisy vs Tulip 
toe Mund emet 
Handie vs No Handie 

B d4 Bs 
Capital vs Small Letters 
One vs Two Letters. 

Fic. 1. Examples of concept-identification 


stimulus materials used, 


Stimuli were rear 
projected onto a Screen at the front of S’s 


Off. Ten seconds later the next slide came 
on. The Ss wore earphones into which low- 
level white noise Was projected to eliminate 
distracting sounds from the projection equip- 
ment, timers, relays, etc, 


REsULTS 


Frequency data.—The mean number 
of errors (informed) and the mean 
trial of last error for the three reversal 
conditions are shown in Table 1, 
Analysis of variance of the number of 


F (2, 106) = 


the control problem (T, = total errors 
for the control problem), 47, 2 — 
11.72 for the Ieversal problem, and 
4T. —2= 11.64 for the nonreversa] 
Problem. None of these predictions is 
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significantly different from the ob. 
tained values, ¢ (53) — 0.33, 1.00, and 
0.22, respectively. 

Further analyses of the frequency 
‘data are shown in Table 2 for the con- 
trol problems only. In this table, e is 
the learning-rate parameter and 5 is 
the proportion of correct responses on 
trials preceding the last error. The 
precriterion tests revealed that there 
was no evidence for learning prior to 
the final error (stationarity) and that 
trial-to-trial responses were independ- 
ent (independence), but that for one 
of the three problems, grouping the 
trials into blocks of four did not yield 
a binomial distribution for number of 
errors (binomial). These tests were 
suggested by Suppes and Ginsberg 
(1963). Other Statistics computed 
from the Bower and Trabasso (19632) 
model show fairly close agreement with 
the observed data for expected values, 
but not for SDs. The model con- 
sistently underestimated variability for 
almost all Statistics, the worst case 
being trial of last error, 

Latency data.—Preliminary analyses 
showed no latency differences among 
the three types of stimulus materials, 
nor were there any significant differ- 
ences between latency on trials follow- 
ing uninformed errors and on trials 
following Correct responses, There 
were 17 Ss who made at least one 
error and one correct response preced- 
ing the trial of the last error on all 


TABLE 1 


Summary Error DATA FOR THREE 
REVERSAL CoNDITIONS 


Mean 

Mean : 

Cond. ” | No. | sp | rial. | sp 
Errors mae 

Control $4] 343 |a23| 7.28 | sao 

NoremelShit | $4 | 203 [423] 1125 19.09 

Nonreversal Shift | 54] 341 |$00| 14-31 12.95 


ns 
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TAB 


LE 2 


COMPARISON OF OBSERVED DATA FOR CONTROL PROBLEMS WITH PREDICTIONS FROM 
THE BOWER AND TRaBAssOo (1963a) CowcEPT-IpENTIFICATION MODEL 


Square Flower Letter Combined 
€ .240 .310 .346 .292 
$ 468 .520 .583 .522 
n 18 18 18 54 
Precriterion Tests 
df xh b df x b df | x? b df xA ? 
Stationarity 4 |142]|2».80| 4 | 3.51 | 2.5 3 | 144] >.70] 6 2.74 | >.80 
Independence | 1 .06 | >.80| 1 | 1.66 | >.18| 1 30] 2.60| 1 .91 | 2.40 
Binomial 4 | 2.95 | >.60| 4 | 5.74 | >.20] 4 |11.13| «.05| 4 |14.89| <.01 
Other Statistics 
j Obs. Pred. Obs. Pred. Obs. Pred. Obs. Pred. 
Mean No. 
Errors* 4.167 d 3.222 = 2.889 — 3.426 — 
D 6.17 3.63 3.46 2.67 2.19 2.33 4.23 2.88 
Trial of 
Last Error 7.833 | 7.833 6.944 6.713 7.056 6.922 7.218 7.162 
12.11 7.32 8.60 6.19 7.85 7.07 9.40 6.64 
No. Correct 
Prior to 
Last Error 3.667 | 3.667 3.611 3.491 4.111 4.034 3.796 3.736 
D 6.34 4.14 5.42 3.96 5.36 4.51 5.62 4.21 
No. Correct be- 
tween kth and 
k + 1st Error .880 .880 1.155 1.083 1.442 1.396 1.124 1.090 
SD 1.29 1.28 1.34 1.50 1.62 1.83 1.43 1.51 
Error-Correct 
Alternations 4.433 | 4.500 4.222 3.831 3.944 3.784 4.222 4.052 
No. Error Runs | 2.482 | 1.944 1.889 2.156 1.778 2.101 1.870 2.266 


^ Used to estimate e. 


three of the problems they solved.* 
For these Ss, median latencies follow- 
ing errors and following corrects were 
computed for each problem. Analysis 
of variance of these data revealed that 
latency on trials following errors was 
significantly higher than on trials fol- 


2Separate analyses for each reversal con- 
dition using all Ss who made at least one 
error and one correct response on the pre- 
solution period were also performed. The 
latency differences reported were found in 
each of these analyses which, of course, had 
more Ss per analysis. 


lowing corrects, F (1, 16) = 18.927, 
p< 001, that there were no signifi- 
cant differences among the three re- 
versal conditions, F (2, 32) — 1.447, 
p> .25, and that the interaction be- 
tween trial type and reversal condition 
was not significant, F (2, 32) — 0.373, 
p> 75. Mean latencies for the vari- 
ous conditions were shown in Table 3. 

Since so few Ss could be included in 
the above analysis, a second analysis 
was performed which included all Ss. 
By definition, the first trial of the cri- 


694. 


terion run is a sampling trial, whether 
it follows the last error or whether it 
occurs on the first trial; and the sec- 
ond postcriterion follows a correct re- 
sponse for all Ss, so no resampling 
should occur. A f test on latencies for 
these two trials was performed. For 
each S the difference between the 
mean of the latencies on his three first- 
postcriterion trials (one from each 
problem) and the mean of his three 
second-postcriterion trials was com- 
puted. The mean difference score was 
significantly different from 0, t (53) 
= 2.61, p < 02. 

Similar analyses to the above were 
performed for latencies on error trials 
and on correct-choice trials during the 
presolution period. No significant dif- 
ferences were found; in all cases F 
ratios were approximately 1.0, 

Figure 2 shows posterror and post- 
correct latencies as a function of trials 
preceding the last error and also shows 
latencies on trials following the last 
error, (Since no differences for vari- 
ables other than trial type were found 
in the preceding analyses, the three 
problems each S solved were com- 
bined, and mean trial-by-trial latencies 
on all three problems were computed.) 
Figure 2 clearly shows the latency dif- 


TABLE 3 


MEANS or MEDHUN CHOICE LATENCIES (iN 
SEC.) ON TRIALS AFTER ERRORS AND AFTER 
Correcr RESPONSES DURING THE PRE- 
SOLUTION PERIOD FOR 17 Ss Wuo 
MADE AT Least ONE ERROR AND 
ONE ConREcT RESPONSE DUR- 

ING Tuis PERIOD ON Arr, 

THREE CONDITIONS 


Control 


—— ———À 
After Errors | 520 
After Corrects} 3.64 


M 4.42 
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ferences between trial types during the 
precriterion period. It also shows that 
latencies following errors were not 
constant across trials, but decreased as 
Ss got closer to the criterion trial, ie., 
decreased as the number of errors 
made increased. It is also clear that 
latencies following the criterion trial 
decreased, forming a normal “learning 
curve.” 

It might be argued that the decrease 
in latency following errors in the pre- 
solution period was artifactual since 
different Ss appear at different points 
on the curve—some Ss made only one 
or two errors while some made many, 
so the latencies 20 trials before the 
last error are those of “slow learners” 
(though that term has little meaning 
ina hypothesis-sampling model) who 
might take longer to make choices than 
“fast learners,” The following two 
analyses are relevant to this point, 
First, the data of only those Ss who 
had 21 or more precriterion trials were 
analyzed. Their latency data did not 
differ in any obvious way from the 
data shown in Fig. 2; if anything, the 
trends were even clearer. Second, for 
all cases where there were at least two 
errors and two correct responses dur- 
ing the presolution period (a total of 
45), an analysis of variance was per- 
formed comparing latencies during the 
first half of their precriterion trials 
with the last half? The analysis of 
variance showed again that latency 
after errors was longer than latency 
after corrects, F (1, 44) — 27.50, 
b < .001, that latencies during the first 
half of the precriterion period were 


3 This analysis assumes that the S problem 
is the functional sampling unit and that these 
units are randomly sampled. This assump- 
tion is not uncommon in analyses of all-or- 
none models and, if the model is correct, is 
reasonable, even though some Ss contribute 
more than one sampling unit to the analysis 
of variance. 
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T0 


o After Errors 
e After Correct 


60 


Meon Latency in Seconds 
r3 o 
o o 


g 
o 


20 12 j 
Precriterion Trials 
Two-Trial Blocks 


Fic, 2. Mean latency on trials preceding the last error and on tri 
rion trials show latencies on Trial » following errors and following 


r all problems and all Ss combined.) 


error. (The precrite: 
correct responses on Trial »-1 fo 


longer than latencies during the second 
half, F (1, 44) = 672, p < 02, and 
that the interaction between the two 
variables was significant, F (1,44) 
—16129, p «001. Means for the 
various conditions are shown in Table 
4. This analysis shows that the latency 
decline holds up for the Vincentized 
data, and is almost certainly not 
artifactual. 
DISCUSSION 


With respect to the predictions made 
above, the data are quite clear. Laten- 
cies were longer on trials following er- 
rors than on trials following correct re- 


4 Tio 4 
Last Error 
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[3 20 
Postcriterion Trials 
Two-Triol Blocks 


ials following the last 


sponses, supporting the axioms on re- 
sampling of hypotheses following errors. 
Predictions 4 and 5 were also supported ; 
latencies were not different on error and 
on correct-choice trials during the pre- 
solution. period, and latencies following 
uninformed-error trials were similar to 
those following correct choices, demon- 
strating again that the learning process 
assumed by the Bower and Trabasso and 
the Restle models is reasonable; namely, 
that Ss have an opportunity to learn a 
concept-identification problem following 
trials on which they are told they have 
made an error; on these trials Ss have 
an opportunity to sample the correct hy- 
pothesis (or dimension) from the pool 
of hypotheses, 
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Predictions 2 and 3, on constancy’ of 
latencies across trials, were definitely not 
supported by the data. During the pre- 
solution period, Table 4 shows that mean 
latency following correct responses de- 
clined half as much across trials as did 
latencies following errors. While the 
decline following correct responses could 
be explained by factors such as getting 
used to the general situation, the much 
greater decline following errors needs 
additional explanation. 

While there are always many possible 
causes of a particular experimental re- 
sult, one seems quite plausible here. 
That is that H, the size of the hypothe- 
sis pool, does not remain constant across 
trials, but declines as hypotheses are 
tried and rejected. It seems quite likely 
that Ss can remember at least the hy- 
pothesis which they most recently re- 
jected, and probably more—being lim- 
ited mainly by short-term memory 
capacity. 

A model which posits sampling of hy- 
potheses with partial replacement (either 
random replacement or replacement with 
lag K) would appear to warrant future 
investigation. Unfortunately, the above 
data give little information about the 
value of K. The data in Fig. 2 are not 

. useful for this purpose since Ss who are 
represented at a particular point are at 
different “stages” in the learning process. 
Latency data for trials following errors 
No. 1, No. 2, No. 3, etc., were examined, 
but were too highly variable to allow 
meaningful interpretation. This type of 
data, collected from more highly trained 
Ss, whose latencies would be more stable, 
could prove quite valuable in testing a 
lag-K model. It should also be noted 


TABLE 4 


MEANS oF LATENCIES (IN SEC.) AFTER 
ERRORS AND AFTER CORRECT RESPONSES 
FOR THE Two HALVES OF THE 
PRECRITERION TRIALS 


1st Half | 2nd Half M 
After Errors 4.54 3.93 4.23 
After Corrects | 3.38 3.02 3.20 
M 3.96 3.47 


ee Él 
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that while a lag-K model could account 
for the latency data observed above, it 
would not improve on the relatively poor 
fit of the variance predictions seen in 
Table 2. 

The "learning curve" for latency dur- 
ing the criterion run can probably be 
explained most parsimoniously in terms 
of S's increased confidence in the cor- 
rectness of his hypotheses as correct re- 
sponses follow each other. Increased 
confidence could lead to focusing of at- 
tention more and more on only one as- 
pect of the stimulus, leading to more 
rapid responses. To check the "focus of 
attention" hypothesis, latency data from 
correct response runs during the pre- 
criterion trials were examined to see 
whether the same decrease in latency ap- 
peared. There were 36 runs of four or 
more correct responses during the pre- 
criterion trials for all Ss and all prob- 
lems combined. Means for these laten- 
cies in two-trial blocks were computed 
(analogous to the data presented in Fig. 
2). The mean for the first block was 
4.02 sec. and for the second block was 
3.55 sec. Corresponding means from 
Fig. 1 were 3.71 and 2.99 sec. The 
trends are similar, though the absolute 
values are not, A sign test was run on 
the difference between the first and sec- 
ond two-trial blocks; of the 36 differ- 
ences, 25 were positive. The probability 
of obtaining this or a more extreme re- 
sult is less than .05 (two-tailed). 

The focus-of-attention hypothesis is 
not incompatible with all-or-none theory; 
even though S responds according to 
only one hypothesis, he may be trying 
concurrently to test more than one. This 
hypothesis is compatible with incremental 
theory also. It should be pointed out 
that incremental models could also ac- 
count for the decline in latency during 
the precriterion trials and for the appar- 
ently greater-than-chance response per- 
formance during the precriterion trials 
(see Table 2). 
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EFFECT OF ESCAPE DURATION AND SHOCK INTENSITY 
ON THE ACQUISITION AND EXTINCTION OF 
AN ESCAPE RESPONSE* 


HOMER E. STAVELY, Jr. 
Princeton University 


30 groups of rats were compared in the acquisition and extinction of a 
lever-press escape response. The aversive stimulus was electric shock. 
Each group received a single shock level (either 0.25, 0.40, 0.64, 1.05, 
or 1.65 ma.) and a single duration of escape (either 0, 0.5, 2, 8, 32, 
or 130 sec.). 2 lever presses in the presence of shock were required 
for an escape to occur. Groups run at high shock intensities or long 
escape durations performed better in acquisition than groups at low 
intensities or short durations. Response speed at the end of acquisition 
was an approximately logarithmic function of each independent 
variable. In extinction high shock and long duration groups responded 
more than did low shock and short duration groups. 


In drive-reduction theory (Hull, 
1951; Miller, 1959) drive level and 
amount of reinforcement are presumed 
to influence the performance of a re- 
sponse without affecting the rate of 
learning. Experiments supporting this 
position have typically utilized depri- 
vation-produced drives like hunger, 
and specified the amount of reinforce- 
ment in terms of pellet size or amount 
of food consumed (Crespi, 1942; Hill- 
man, Hunter, & Kimble, 1953; Kend- 
ler, 1945; Metzger, Cotton, & Lewis, 
1957; Zeaman, 1949). 

The effects of drive and amount of 
reinforcement, however, have not been 
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studied in aversive situations in the 
context of the learning-performance 
distinction. An important reason for 
the lack of such studies is that aver- 
sive conditioning involves different 
procedures than appetitive condition- 
ing. In the simplest aversive condi- 
tioning paradigm, escape conditioning, 
drive level is the intensity of a noxious 
stimulus. Amount of reinforcement 
is ordinarily considered to be only the 
amount of noxious stimulus reduction. 
An escape usually consists of complete 
stimulus removal and thus, when drive 
is varied, amount of drive reduction is 
varied simultaneously. This is in op- 
position to the appetitive case where 
individual reinforcements produce only 
fractional drive reduction and allow 
independent variation of drive and 
amount of reinforcement. 

In escape conditioning experiments 
where aversive stimulus intensity is 
varied, higher intensities typically pro- 
duce higher levels of performance 
(Campbell & Kraeling, 1953; Dins- 
moor & Winograd, 1958; Keller, 1941; 
Trapold & Fowler, 1960). 

The use of the escape situation al- 
lows the investigation of another aspect 


698 


ESCAPE DURATION AND SHOCK INTENSITY 


of amount of reinforcement that cur- 
rent theorists have not considered, 
namely duration of escape or, in a 
more general sense, duration of drive 
reduction. With appetitive drives the 
duration of drive reduction is difficult 
to assess because of the relatively slow 
build-up of such drives. However, 
with aversive drives even very short 
durations of drive reduction can be 
made easily discriminable. 

The relationship between duration 
of escape and performance is less clear 
than that between aversive stimulus 
intensity and performance. Harrison 
and Abelson (1959) reported no sys- 
tematic changes in performance when 
escape duration was varied over a 
range from 2 to 20 sec. On the other 
hand, Dinsmoor and Hughes (1956) 
and Dinsmoor, Hughes, and Matsuoka 
(1958) ran different groups of animals 
at escape durations from 5 to 40 sec. 
and found shorter response latencies 
at longer durations. The above stud- 
ies are difficult to interpret on two 
counts, They used relatively low in- 
tensities of aversive stimuli and did 
not present acquisition data. The aim 
of the present research was to extend 
these studies by investigation of the 
effects of escape duration over a wide 
range of drive levels on both the ac- 
quisition and extinction of an escape 
response. 

METHOD 

Subjects —The Ss were 267 male rats of 
the Wistar strain obtained from Dierolf 
Farms in Boyertown, Pennsylvania. Their 
weights ranged from 250 to 350 gm. Of 
these, 27 were discarded, along with their 
data, as follows: 11 for apparatus failure, 
13 for E failure, and 3 for infection or death. 
The animals were received from the sup- 
plier in groups of from 25 to 30 5 days 
before they were to be run in the experi- 
ment, They were housed five to a cage 
and allowed Purina lab chow and water ad 
libitum except when in the experimental 
chamber. . 
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Apparatus.—The Ss were run in two nearly 
identical Skinner boxes 114 in. long, 84 in. 
wide, and 73 in. deep. The floor of each box 
consisted of 3-in. stainless steel rods spaced 
3 in. apart and insulated from each other 
and the rest of the box by Teflon sleeves 
inserted into a Plexiglas strip. The sides 
and top were wired into the shock circuit. 
A constant current ac stimulator (Campbell 
& Teghtsoonian, 1958) provided the aversive 
stimulus. The shock was led to the Skinner 
box via a 12-pole, 10-position stepping switch 
that changed the polarity pattern of the 
grids five times per second. This scrambler 
was enclosed in a heavy brass box that re- 
duced its operating noise to a faint ticking. 
Both Skinner box and scrambler were housed 
in a Scientific prototype No. OP300 en- 
closure lit by a 6-w. houselight. Electric 
timing and switching circuitry controlled the 
shock and recorded each latency on a print- 
out counter accurate to +0.2 sec. 

Procedure.—Thirty groups of eight Ss 
each were run in the experiment. Each 
group received a single combination of escape 
duration and shock level in a factorial 
arrangement of six durations and five shock 
levels spaced in approximately equal loga- 
rithmic intervals. The durations were 0, 0.5, 
2, 8, 32, and 130 sec. and the shock levels 
were 0.25, 0.40, 0.64, 1.05, and 1.65 ma. 

Each S was given 50 acquisition trials a 
day for 5 consecutive days followed by a 
15-min. extinction session on the sixth day. 
A complete replication of 30 Ss was run the 
first week. Thereafter 25-27 animals were 
run in a given week with one representative 
from each group at four of the five shock 
levels plus one or more replacements for 
previously discarded Ss. This arrangement 
minimized the probability that two animals 
from the same shipment from the supplier 
would appear in the same group. Animals 
were run in the same "random" order each 
day and the order was changed each week. 

Pilot work indicated that when animals 
were allowed to escape after every response 
at the highest shock level there were only 
small differences due to escape duration. In 
the hope that increasing the number of 
responses required for an escape would pro- 
duce larger differences, several fixed ratios 
were tried. A fixed ratio of two was 
found to be sensitive to variation in escape 
duration without unduly prolonging acquisi- 
tion. Hence, in the present experiment all 
Ss were run on a schedule of two lever-press 
responses per escape. 

Because one of the purposes of the ex- 
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periment was to compare acquisition of the 
escape response between groups it would 
have been desirable to equate the number of 
escapes for all Ss. However, the pilot data 
indicated that this would be impossible. At 
the lower levels of both escape duration and 
shock intensity, some Ss responded at a 
very low rate or not at all. For this reason 
the number of trials was held constant and 
an upper limit of 31 sec. was placed on the 
duration of a single trial. At the end of 31 
sec. a new trial was initiated, the recording 
timers reset to zero and latency of response 
from this new zero point recorded. 

This procedure, it will be noted, is not 
unlike that used in runways where typically 
if an animal does not reach the goal within 
a specified period of time, S is removed from 
the alley, given the appropriate intertrial 
interval and then returned to the start box. 
The difference in the present procedure is 
that S is not removed from the apparatus and 
there is no intertrial interval. This pro- 
cedure, which may at first seem arbitrary 
and capricious, has several advantages over 
other methods that might be used to study 
the same problem. The straight runway 
cannot be used to study duration of escape 
because the minimum duration of escape is 
limited by the time required by E to remove 
S from the goal box and return it to the 
Start box. A two-way straight alley in 
which S would have to be forced from one 
end to the other at low voltages to receive 
a reinforcement would require arbitrary 
trial lengths similar to those of the present 
study. “All other techniques known to this 
investigator incur similar disadvantages. 

The performance measures recorded in a 
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session were a set of 50 latencies, one for 
each trial. In the usual case where a trial 
began with shock onset and ended in an 
escape, the latency was the time it took the 
animal to emit two responses and produce 
the escape. When a trial did not end in an 
escape, a latency score of 31 was recorded. 


. There was one exception to the above defini- 


tion of a latency. A trial ending in an 
escape but preceded by one or more trials 
not ending in an escape was automatically 
assigned a latency score of 31 since the 
actual latency of response from the last 
reinforcement period was in excess of 31 sec. 
The only differences between the extinction 
session and an acquisition session were that 
bar pressing never produced shock reduction, 
and that the session was terminated after 15 
min. of shock. Extinction data were re- 
corded as number of responses per minute. 


RESULTS 


Acquisition curves for all groups ap- 
pear in Fig. 1. Each panel contains all 
groups at the same shock level. The 
abscissas are days of training and the 
ordinates are mean median response 
speeds for each training day. The mean 
median response speed was determined 
by taking the reciprocal of each trial 
latency (1000/latency in sec.) and then 
finding the median reciprocal latency 
for each animal within a given day's 
session. The mean of the eight me- 
dians in each group was then found 


DAYS OF TRAINING 


Fic. 1. Mean median response speed (1000/latency in sec.) of each group 
as a function of days of acquisition training. 
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and plotted in the figure. It is clear 
from the curves in Fig. 1 that acquisi- 
tion performance is an increasing func- 
tion of both independent variables. 
Because of the 31-sec. limit on trial 


latencies all of the slowest animals ' 


showed the same speed score. There 
were enough of these slow Ss to cause 
a skewed distribution of response 
speeds. This result produced hetero- 
geneous variances and precluded the 
use of a parametric statistical analysis. 
However, a series of two-way chi- 
square analyses of variance (Wilson, 
1956) applied to the data for each day 
of acquisition produced significant x 
values for both main effects in every 
case (p «.01). None of the Dura- 
tion X Shock Level interactions ap- 
proached significance (p> 4). 

Most of the animals receiving either 
low shock levels or short escape dura- 
tions did not learn to escape the shock. 
Learning was measured as whether or 
not an S responded faster on Day 5 
than he did on Day 1. With higher 
values of both variables nearly all ani- 
mals did learn. The fanlike appearance 
of the curves in Fig. 1 cannot, however, 
be entirely accounted for by differences 
in the number of animals learning the 
response. In the groups represented 
by the top curves in each of the three 
right-hand panels all animals learned, 
yet their performance differed mark- 
edly as a function of duration and 
shock level. The differences seen in 
these top curves are statistically reli- 
able. The nine groups at the three 
longest escape durations and the three 
highest shock levels were compared in 
an analysis of variance on the basis of 
their Day 5 median response speeds. 
Variances were homogeneous across 
these groups and the main effects were 
significant; escape duration F (2, 63) 
= 4.22; p < .05; shock level F (2, 63) 
= 5.35; p < 01. The Duration X 
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Shock Level interaction F ratio was 
not significant. 

To show the effects of escape dura- 
tion and shock level simultaneously, a 
three-dimensional graph is presented 
in Fig. 2. The horizontal axes are 
log escape duration and log shock level 
and the vertical axis is mean median 
response speed on Day 5. The plotted 
sürface has the general appearance of 
a plane rising with high values of both 
independent variables. When the 
linear multiple-regression equation was 
calculated for these data it was found 
that the best-fitting plane would ac- 
count for 85% of their variance. The 
relatively close fit of the regression 
plane strongly suggests that response 
speed is directly proportional to the 
log of escape duration and shock level. 

The extinction data were not so 
regular, although the effects of dura- 
tion of escape and of the intensity of 
shock were clear. Figure 3 shows the 
course of extinction. The ordinates 
are median number of bar presses and 
the abscissas are 3-min. segments of 
the extinction session. Each curve 
represents one group and each panel 
contains all groups at the same shock 
level. Although there were some ex- 


Fic. 2. Mean median response speed 
(1000/latency in sec.) on Day 5 as a function 
of log escape duration and log shock intensity. 
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BLOCKS OF THREE MINUTES 


Fic, 3. Median number of bar presses in extinction as a function of 
blocks of 3 min. of the extinction session. 


ceptions it appears that the higher the 
shock level or the longer the escape 
interval the greater the resistance to 
extinction. This generalization is sup- 
ported by the results of a two-way chi- 
Square analysis of variance of the total 
number of responses emitted by each 
S during the entire session. Both main 
effects were significant (p < .01) and 
the Duration x Shock Level inter- 
action was not significant ($ > 9). 


Discussion 


The results clearly show that longer 
durations of escape and high intensities 
of shock both produce faster responding 
in acquisition and more responding in 
extinction. Response speed at the end 
of acquisition was approximately directly 
proportional to the log of escape duration 
and the log of shock intensity. Neither 
the acquisition measure nor the extinc- 
tion measure showed any interaction be- 
tween escape duration and Shock level. 

The acquisition data Support the view 
that drive level and amount of reinforce- 


rather than the rate of learning (cf. 
Metzger, Cotton, & Lewis, 1957. Hill. 
man, Hunter, & Kimble, 1953; Kendler, 
- lfhigher drive or larger amounts 


curves for groups at high shock levels 
or long escape durations should rise more 
steeply to a higher asymptote than curves 
for groups at low shock levels and short — 
durations. This did not happen. In fact 
the groups with high shock or long dura- _ 
tions were possibly slower to reach an 
asymptotic response speed, 

The extinction data are less clear in — 
their implications for a learning-perform- 
ance distinction. The present experiment 
did not test the effects of drive in extinc- 
tion because each group received the 
same shock level in the extinction ses- 
sion that it had in acquisition. How- 
ever, under the assumption that amount 
of reinforcement is a performance vari- 
able it would be expected that when 
amount of reinforcement is equated 
across groups, performance differences 
should disappear, In the extinction ses- 
sion this was the Procedure. Groups that 
had differed in duration of escape were 
all shifted to no escape. Under these 
conditions extinction performance should 
have differed only as a function of shock 
level, However, at a given shock level 
the groups did differ depending on 
whether they had been trained with a 
short or a long escape duration, indi- 
cating that amount of reinforcement does 
influence learning as well as perform- 
ance. This conclusion must be tentative, 
however, since the number of reinforce- 
ments was not equal across all groups, 
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WITHIN-SUBJECTS PARTIAL-REINFORCEMENT EFFECTS IN 
ACQUISITION AND IN LATER DISCRIMINATION LEARNING* 


KENT HENDERSON ? 


University of Toronto 


50 male albino rats, in 5 groups, were run 4 trials a day, 2 in a black 
and 2 in a white runway. Ss were continuously rewarded on 48 trials 
in 1 runway (Sif) and rewarded on a percentage of the 48 trials in 
the other (S:+) at 100, 50, 25, 12.5, or 0%. In a 2nd phase, conditions 
were changed so that none of the trials to S; were rewarded while all Ss 
trials were. In Phase 1, speeds to S+ increased as a negatively accel- 
erated function of percentage of reward. Speeds to Sit increased 
slightly as well. In Phase 2, resistance to extinction to Si— was 
related to prior percentage of reward to Ss but speeds to Ss-- were 
not The findings were interpreted within the framework of Amsel's 


theory of frustrative nonreward. 


The present experiment examines 
the effects of different percentages of 
reward to the negative stimulus in 
successive discrimination training upon 
acquisition performance and upon re- 
sistance to extinction to the non- 
rewarded stimulus in later reversal of 
the discrimination. The experiment is 
a replication and extension of recent 
work on the effect of partial reinforce- 
ment on later discrimination learning 
(Amsel & Ward, 1965). 

Partial reward acquisition effects.— 
In simple learning, where one group is 
run to continuous reward and another 
to partial, recent studies (reviewed by 
Amsel, 1962) have shown that asymp- 
totic running speeds are faster to par- 
tial than to continuous reward. Am- 
sel, MacKinnon, Rasbotte, and Sur- 


1This article is based on a dissertation 
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fillment of the requirements for the PhD 
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a grant, G-13895, from the National Science 
Foundation to Abram Amsel. The author is 
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encouragement, and to Glenn Macdonald 
and John Arrowood who served as mem- 
bers of his thesis committee. 

? Now at the University of Manitoba. 


ridge (1964) reported similar effects 
using a within-Ss design in which the 
same S's were run to continuous (W+) 
and partial reward (BÆ) in white and 
black runways, respectively. In addi- 
tion, goal speeds were slower to the 
partial stimulus as they are in between- 
Ss designs. On the basis of the similar 
effects of continuous and partial re- 
ward in the two designs, Amsel et al. 
(1964) concluded that the effects of 
different schedules of reward on ac- 
quisition performance are highly stim- 
ulus specific. 

If the effects of different schedules 
of reward to two stimuli are independ- 
ent in successive discrimination, then 
asymptotic response speeds to a con- 
tinuous stimulus (St) should be the 
Same regardless of the percentage of 
reward to a second stimulus (S+). 
Extrapolating from the effects of per- 
centage of reward in simple condition- 
ing (Spence, 1960; Weinstock, 1958), 
asymptotic running speeds to S+ 
should exceed those to St at inter- 
mediate levels of percentage of reward 
and fall below this speed only as per- 
centage of reward to S- approaches 
zero. Near the goal, asymptotic speeds 
to SÆ should decrease as percentage of 
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reward varies from 100 to 0. These 
effects were predicted for Phase 1 of 
this experiment. 

PRE in discrimination learning.— 
Amsel and Ward. (1965) reported that 
prediscrimination training to S— under 
partial reward retarded discrimination 
learning more than similar pretraining 
to S+. However, they found that pro- 
longed partial reward prediscrimina- 
tion experience to either S— or S+ 
substantially retarded discrimination so 
that the partial-reinforcement effect 
(PRE) appeared to generalize across 
stimuli contrary to the independence 
hypothesis suggested by Amsel et al. 
(1964) on the basis of acquisition 
performance. 

The present experiment Was de- 
signed to investigate generalization of 
the effects of partial reward in a 
within-Ss design on acquisition per- 
formance and on later discrimination 
learning. In Phase 1, five groups of 
Ss were trained to run a white runway 
to continuous reward (S4) and a 
black runway to one of five percentages 
of reward, 100, 50, 25, 12.5, or 0 
(SE). In Phase 2, all groups were 
run under a schedule in which the 
former continuous stimulus was never 
e e ues) while the 
former partial stimulus was continu- 
ously rewarded (S, S,4-). In this 
discrimination phase, the PRE pro- 
duced in Phase 1 by partial reward to 
S,-- could generalize to S,— in Phase 
2 and delay discrimination by increas- 
ing resistance to extinction of the re- 
sponse to S,—. Groups receiving 50, 
25, and 12.596 reward in Phase 1 were 
expected to discriminate slowly in 
Phase 2 while the other two groups, 
with no partial reward training, were 
expected to discriminate quickly. 


METHOD 


Subjects —The Ss were 50 male albino 
rats obtained from Woodlyn Farms, Guelph, 
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Ontario, when 85 days of age. They were 
housed individually and provided with water 
ad libitum and 10 gm. of food per day that 
included the 500 mg. Noyes pellets used as 
the reward. 

Apparatus —The apparatus was a straight 
runway having an entry box (EB) and a 
start box (SB) fixed in tandem to a table 
top, and two movable, parallel alleys, either 
of which could be aligned with the start door 
to form a single runway. Guillotinetype 
retrace doors divided the alleys into integral 
runway and goal box (GB) sections. One 
of the alleys was painted flat black and the 
other flat white, and the EB and SB were 
medium gray. 

The lengths of the EB, SB, runway, and 
GB were 93, 12, 253, and 134 in, respec- 
tively. The width was 3 in. with the excep- 
tion of the SB which was 2 in. wide for the 
8 in. adjacent to the start door, and the 
runways which were 2 in. wide at the start 
door widening to 3 in. gradually over the 
first 7 in. The entire apparatus had 31-in. 
high sides and was covered with Plexiglas 
hinged over the EB and GB. The end of 
the GB held a metal food trough painted the 
same color as the alley and made with a 
high lip so that S could not see into it until 
the last photobeam was broken and the goal 
measure determined. Photocells and source 
lights, equipped with red filters, were 2 in 
above the alley floor and situated at 12, 24, 
and 36 in. beyond the start door to provide 
measures of latency, running time, and goal 
entry time. The last photocell was 3 in. 
from the end of the GB. The experimental 
room was illuminated by two 25-w. lights 
in translucent fixtures located in the ceiling 
about 80 in. above the runways and 18 in. 
to the left of the positions of the EB and GB. 
The ventilation system provided controlled 
temperature and humidity as well as a con- 
tinuous, mild masking noise. 


Procedure 


Habituation—The experimental period be- 
gan 35 days after Ss arrived. During the 
interval, Ss were fed 10 gm. of Purina lab 
chow daily in their home cages at the ap- 
proximate time the squads were to be run. 
They were handled daily for the first 14 
days and then periodically to maintain tam- 
ing. Two days before the experimental 

riod Ss were placed in a carrying cage in 
squads of 10 for 5 min, The carrying cage 
had 10 compartments and was used to trans- 
port Ss to and from the experimental room 
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and to detain them between trials. On the 
next day, they were detained in the experi- 
mental room for 5 min. Phase 1 began on 
the following day. 

Phase 1l——Four trials were run each day, 
two to S and two to Ss, so that over a 4-day 
block of trials, each S ran eight times to Sı 
and eight to S» Trials to Sı were re- 
warded continuously while percentage of 
reward to Ss varied across groups as fol- 
lows. Of each eight trials to Ss Group 
100-100 was rewarded on all eight, Group 
100-50 on four (one per day), Group 100-25 
on two, Group 100-12 on one, and Group 
100-0 on none. To counterbalance possible 
effects of color of the alleys, S: was black 
for one half of the Ss and white for the 
other half. 

Trial orders to Si and S» were arranged 
from combinations 1212, 1221, 2121, and 2112 
which allow two black and two white trials 
per day but permit no more than two suc- 
cessive trials to the same color. Five dif- 
ferent trial orders were developed from 
these combinations with the restriction that 
a combination could be used only once in 
each 4-day (16-trial) block. Two Ss from 
each group were assigned to one of the five 
trial orders so that S: was black for one of 
the Ss and white for the other. For par- 
tially rewarded Ss, reward schedules were 
developed, one for each pair of Ss assigned 
to each trial order within a group. In this 
way, each color-control pair of Ss was run 
under the same trial order to S; and Ss and 
was rewarded on the same trials, the only 
difference was the color associated with par- 
tial and continuous reward. 

Five squads of 10 Ss each were formed, 
each squad containing a color-control pair 
from each group but restricted so that no 
two pair could have the same trial order. 
The Ss within each squad were assigned 
randomly to positions in the carrying cage 
and run in a different order each day. These 
procedures were adopted to minimize re- 
peated patterns of reward and nonreward 
that could serve as differential cues. One 
500 mg. Noyes pellet was used as a reward. 
The minimum intertrial interval was 15 min. 
The squads were fed 30-40 min. following 
their daily trials. 

Phase 2.—Following 96 acquisition trials, 
Phase 2 was begun. The procedure re- 
mained the same except that trials to Si 
were not rewarded and all trials to Sa were 
rewarded. . After 120 discrimination train- 
ing trials, Phase 2 was terminated. 


KENT HENDERSON 


ReEsuLts 

Starting, running, and goal entry 
times were transformed to speeds 
(ft/sec) for each S and summed sepa- 
rately in four-trial blocks to S, and S,. 
In the following, the term “differential 
reinforcement” refers to the within-Ss 
effect of the two reinforcement sched- 
ules on speeds to S, and S,, “percent- 
age of reward” refers to the between- 
groups effect of percentage of reward 
to S,+ in Phase 1, and "color" refers 
to the effect of stimulus color. 


Phase 1 


Mean running and goal speeds to 
S,f and S,= are presented in Fig. 1 
for each of the five groups over the 12 
blocks of trials in Phase 1. Terminal 
response speeds appeared close to 
asymptote. Mean response speeds to 
S;t and S,-, in Terminal Blocks 11 


RUNNING 


MEAN SPEED ( FEET PER SECOND) 


1234567890012 1 234567891112 
BLOCKS OF EIGHT TRIALS 


Fic. 1. Mean running and goal speeds to 
Sı and S.+ for each group in Phase 1. 


WITHIN-Ss PARTIAL-REINFORCEMENT EFFECTS 


and 12 combined, are presented in the 
left-hand panels of Fig. 2 for starting, 
running, and goal speeds, separately, as 
functions of percentage of reward. 
The data conformed well to pre- 
dicted relations between asymptotic re- 
to Sj and S; In 
blocks of acquisition, 
increased as à 


to S, showed a slight, nearly linear 
increase, the two functions meeting at 
100% where the two reward schedules 
were identical 
periority of speeds to 
tained in all groups except 100-100, so 
that the difference in speed to the two 
stimuli decreased as percentage of re- 
ward to S, increased. 

The relations were not identical for 
running speed. Terminal running 
speed to S,+ increased as a negatively 
accelerated function of percentage of 
reward to a maximum speed by 5096 
which was maintained at 100%. 
Mean running speed to S, X increased 
slightly over the range from 0 to 
100% as well. The difference in speed 
to the two stimuli favored S,f at 0 
and 12.5% but actually favored So 
at 50%. Group 100-50 was similar to 
the one of Amsel et al. (1964) in 
which faster running and starting 
speeds were found to the partial stimu- 
lus, except their Ss ran twice as many 
trials to the partial as to the continuous 
stimulus. An analysis of the running- 
of Group 100-50 included 
all 12 blocks of trials and differential 
reinforcement as within-Ss compari- 
sons, as a between-group 
effect. The Trials X Reinforcement 
interaction was significant, F (11, 88) 
=241, p< 05. indicating that the 
trend toward faster running to S; 
late in training was reliable. The main 
effect of differential reinforcement was 
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significant as well, F (1, 8) — 22.15, 
the relation of con- 


ning speed, previously found between 
groups, Was evident in Group 100-50 
as a within-Ss effect as it was in the 
study by Amsel et al. 

Analyses of running- and goal-speed 
data in Terminal Blocks 
combined, for the effects of differential 
reinforcement, percentage of reward, 
and color, yielded highly significant 
mean-squares for the Reinforcement X 
Percentage of Reward interaction, EF 
(4, 40) = 3401 and 37.79, p < 001 
for running and goal speed, respec- 
$ substantiating the impression 
that percentage of reward produced 
systematically greater effects on speeds 
to S.* than to Sit. 

The variation in response speed to 
S} with percentage of reward to 
S,-, apparent in the graphed data 
was subjected 
to analysis over the last four trial 
blocks. goal 
speed 
linear 


to S, 


was depressed as à function of the per- 
centage of 
Therefore, the effects of continuous 
and partial reward were not independ- 
ent in this differential reinforcement 
situation; the effects of nonreward 
trials generalized to the continuous 


This “depressive” effect on S, speed 
raises the possibility that the faster 
running speed of Group 100-50 to S; 
reflects a decrease in speed to S; rather 
than an increase to S, the partial 
stimulus, as suggested in Amsel et al. 
(1964). Mean terminal running 
speeds in Group 100-50 to S, and S, 
were 2.72 and 2.85 ft/sec while speeds 
to comparable stimuli in Group 100-100 
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were both 2.86 ft/sec. However, if 
the decrement in speed to S, were the 
only factor contributing to this effect, 
the trend should have been greater in 
Groups 100-25 and 100-12 in which 
the decrement to S, was greater. 


Phase 2 


The data were analyzed in two ways. 
First, response speeds were analyzed 
directly at two stages of discrimination 
training, after response differentiation 
had stabilized initially (Trial Blocks 5 
and 6) and after a large number of 
trials (Blocks 14 and 15). Second, 
each S’s performance was rated against 
four criteria of discrimination ranging 
from easy to hard, 

Response speed.—Starting, running, 
and goal speeds to S,— and S,+ are 
presented in Fig. 2 as functions of per- 
centage of reward to S.= in Phase 1, 
the data in Blocks 5 and 6 combined 
in the middle panels and that in Blocks 
14 and 15 combined in the right-hand 
panels. Response speeds to S.+ ap- 
proached the same level for all groups 
in Phase 2 in Spite of differences at 
the termination of Phase 1. Early in 
Phase 2, response speed to S,— ap- 
proximated an inverted U-shaped func- 
tion of Phase 1 percentage of reward, 
By Blocks 14 and 15, response differ- 
entiation had increased in all groups 
but not uniformly so that response 
speed to S,— appeared to be inversely 
related to Phase 1 percentage of re- 
ward at this late Stage of training, 
Group 100-0 was more persistent to 
$,— than expected, its performance re- 
sembling that of partially rewarded Ss 
rather than those continuously re. 
warded (Group 100-100) as was 
predicted. 

Analyses of running- and goal-speed 
data in Blocks 5 and 6 supported the 
conclusion that response speed to S, 
was related to Phase 1 Percentage of 


reward while speed to S,--, the former 
Partial stimulus, was relatively ` un- 
affected, F (4, 40) = 14.91 and 23.34, 
p < 001, for running and goal speed, 
respectively. This trend continued 
until the end of Phase 2 where the ef- 
fects of Phase 1 percentage of reward 
on S, were evident in the interaction 
of percentage of reward and reinforce- 
ment, F (4, 40) = 24.65 and 9.50, 
$ < .001. 

Criterion analysis —Trials-to-crite- 
rion scores were generated for each k 
in the following way. For the running 
and goal measures separately, speed 
data for each S, for each day of Phase 
2, were examined for evidence of dis- 
crimination. An S received a “plus” 
for any day on which all S.+ speeds 
were faster than the best S, speed, 
otherwise, the day scored "minus." 
These data, indicating days on which 
each S$ discriminated, were assessed 
against a set of criteria varying from 
easy to hard; first of three consecutive 
"plus" days (3), first of four consecu- 
tive "plus" days (4), first of five con- 
secutive "plus" days (5), and the last 
day to be scored "minus" (L). Mean 
days-to-criterion, for each color-control 
subgroup, are presented separately for 
each criterion in Fig. 3 (running meas- 
ure) and Fig. 4 (goal measure). Sub- 
groups are indicated by the color of 
the S,4- stimulus, 

The predicted relations were approxi- 
mated closely by the "easy" criterion 
(3) data in which days-to-criterion 
Was an inverted U.shaped function 
of percentage of reward, the par- 
tially rewarded Ss discriminating later 
than the other two groups (Groups 
100-0 and 100-100). For successively 
harder criteria, mean days-to-criterion 
appeared to be an inverse function of 
Phase 1 Percentage of reward. The 
change in the shape of the function 
was due primarily to Group 100-0 
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PHASE 2 


5-6 BLOCKS 14-15 


10001225 50 


IN PHASE 1 


Fic. 2. Mean starting, running, 


termination of Phase 1, the early stage of Phase 


and goal speeds to S, and Ss for all groups at the 


2, and the termination of Phase 2, as 


functions of percentage of reward to S:+ in Phase 1. 


which began to discriminate early in 
Phase 2 but response differentiation 
was weak and unreliable. Analyses 
for the effects of percentage of reward 


and color on the criterion data indi- 
cated that percentage of reward pro- 
duced significant variation in rate of 
discrimination for both running, F (4, 
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Fic. 3. Days-to-criterion based on differ- 
ences in running speed to S,— and Sst for 
criteria varying from easy (3) to hard (L). 
(Performance of color-control groups are 


shown separately for conditions where black 
—Bd-—or white—W+—was positive.) 


40) = 676(3),7.74 (4), 17.33(5), and 
935(L), p<.001, and goal meas- 
ures, F (4, 40) = 606(3), 5.06(4), 
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groups are 
shown separately for conditions where black 
—B4—or white—W-+—was Positive.) 
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874(5), and 6.76(L), p < .01. Color 
reached significance once but then only 
as a weak interaction effect, 


Discussion 


Acquisition. Differences in response 
speed to S, and S,, which had stabilized 
over the last half of Phase 1, decreased 
as percentage of reward to the partial 
stimulus approached 100. Terminal re- 
sponse speed to the two discriminanda, 
and the relations between them, con- 
firmed predictions, 

There was no abrupt transition in 
speed to S— when some of the trials were 
rewarded. Rather, there was a sys- 
tematic increase in speed to S— as level 
of percentage of reward increased. The 
effect of the rewarded trials was greater 
on speed further from the goal as would 
be expected under the postulate (Amsel, 
1962) that a gradient of approach based 
on anticipatory reward, Tr, is not as 
steep as a gradient of avoidance based on 
anticipatory frustration, rp. In terms of 
the present findings, the simple discrimi- 
nation situation (S+, S—) does not ap- 
pear to have unique properties in deter- 
mining response speeds but may be 
viewed as a limiting case of partial re- 
ward where reward and nonreward are 
correlated perfectly with distinctive stim- 
ulus elements, 

The hypothesis that performance in 
Successive discrimination would be stim- 
ulus specific was not Supported by two 
results. First, response speed to S+ was 
related to the percentage of nonreward 
trials to S— indicating that generaliza- 
tion had occurred. Second, running 
speeds to S— should have exceeded those 
to S+ even at low percentages of reward 
as in simple conditioning (Weinstock, 
1958). 


The inhibiting effects of nonreward 
upon response speed to S+ requires ex- 
planation. A similar finding was re- 
ported by Davenport (1963). He found 
a “depression effect” in speed to the posi- 
tive stimulus on forced trials in a spatial 
discrimination based upon differential 
Percentages of reward, but the trend did 
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not reach significance when analyzed. 
Spear (1964), using a T maze, found 
no differences in goal-stem speed on 
forced trials to the positive stimulus as à 
function of magnitude or percentage of 
reward to the S-goal. 

In successive discrimination learning, 
Goldstein and Spence (1963) found no 
depression effect in speed to S-- as mag- 
nitude of reward was varied to S—. In 
that study, an effort was made to reduce 
the frustrative effects of differences in 
magnitude of reward by removing S from 
the goal immediately after the reward 
was consumed or the final photobeam was 
broken on nonreward trials. The use of 
a longer detention in the goal may have 
allowed frustration (Rp) to increase, 
rp to generalize, and its aversive proper- 
ties to produce a depression of speed to 
S+. MacKinnon and Amsel (1964) 
have shown that the drive-inducing ef- 
fects of nonreward, as measured by in- 
creased vigor of a second runway re- 
sponse, reach a maximum only after de- 
tention of about 15 sec. 

Amsel and Surridge (1964) have 
shown that the inhibitory effect of non- 
reward (and presumably the strength of 
rg) increases with magnitude of reward. 
This variable, along with time of deten- 
tion and characteristics of the apparatus 
that facilitate stimulus generalization, 
may in combination provide an explana- 
tion consistent with the trends reviewed 
above. 

The explanation favored by Amsel and 
Ward (1965, Exp. 5) for faster running 
speeds to the partial goal was basically 
the same one employed to account for 
this effect in the between-Ss design (Am- 
sel, 1958, 1962). In partial reward train- 
ing, rg — Sp occurs in the presence of a 
prepotent approach response and Sp, 
while it initially evokes avoidance, even- 
tually becomes part of the stimulus com- 
plex associated with approach. Early in 
the response chain, weak rp does not 
provide strong cues for avoidance but 
contributes to “excitement” and drive. 
Near the goal rp — Sr is strong enough 
to elicit avoidance and reduce the strength 
of approach. In the running segment of 
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the response chain, rg contributes to re- 
sponse strength through its drive-induc- 
ing properties so that running speeds are 
higher to a partial stimulus. In the goal 
segment, rp — sp is a source of compet- 
ing response tendencies as well as drive 
and the net effect is to reduce goal speed 
to the partial stimulus. 

While the effects of simple and dif- 
ferential reinforcement schedules cannot 
be compared directly because comparable 
schedules were not run between-Ss in 
the present experiment, faster running 
to the partial stimulus might have been 
expected in groups other than 100-50. 
Weinstock (1958) found faster running 
speed to percentages of reward as low 
as 17 and 33 than to continuous reward. 
The factor that could account for the 
absence of this trend in Groups 100-12 
and 100-25 is the generalization of Tr 
from S+ to S+. The addition of gen- 
eralized rp to S+ would tend to produce 
a stronger frustrative reaction (Ry) to 
nonreward than would be expected under 
similar percentages of reward (12 and 
25) in simple learning. As a result, the 
strength of fg — Sry based on strong and 
relatively frequent Rp, would maintain 
active avoidance further from the goal. 

PRE in discrimination —Performance 
in Phase 2 supported the hypothesis that 
a PRE may generalize to a stimulus not 
directly associated with partial reward. 
These results are consistent with previ- 
ous findings (Amsel & Ward, 1965; Ross, 
1964; Spear, 1964) which implicate some 
mechanism within S in the transfer of 
the PRE. Brown and Logan (1965) 
recently demonstrated a generalized par- 
tial-reinforcement effect (GPRE) based 
upon placement trials in a distinctive, 
“negative” goal. In all of these studies, 
the effect occurred to the former “Dosi- 
tive’ goal following the omission of re- 
ward. In Amsel’s theory generalization 
of the PRE can occur through the 
ry — Sp mechanism which becomes a part 
of any stimulus complex immediately 
antedating frustrative nonreward (Ross, 
1964). Other theorists have suggested 
that the important factor in the develop- 
ment of a PRE is what S learns to do 
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after nonreward, to not make excessive 
vgs (Logan, 1960) or to locate "extra 
attractions” (Lawrence & Festinger, 
1962). All could account for the gen- 
eralization of a PRE. 

The effects of specific levels of per- 
centage of reward upon resistance to ex- 
tinction to S,— in Phase 2 of the present 
experiment confirmed expectations for all 
groups but one, 100-0. These Ss, trained 
in a simple discrimination and then re- 
versed, showed a PRE nearly as strong 
as partially rewarded Ss although they 
had not been trained to approach a non- 
reward goal vigorously. That is, in 
Phase 1 rg — sp occurred in the presence 
of a weak, unreliable approach response, 
a situation that should result in poor 
conditioning of sp to approach and little 
resistance to extinction. 

Evidence from several earlier experi- 
ments (Birch, Allison, & House, 1963; 
Birch, Ison, & Sperling, 1960; Theios & 
Blosser, 1965; Theios & Brelsford, 1964) 
supported the position that continuous re- 
ward and discrimination training have 
similar effects on resistance to extinction 
of the response to S+. However, the 
present findings and those of Jenkins 
(1961) indicate that discrimination train- 
ing may increase resistance to extinction 
toS+. To account for the difference be- 
tween their findings and Jenkins’, Birch 
et al. (1963) suggested that the difficulty 
of the discrimination was the relevant 
factor and they noted, as a number of 
authors had earlier (Humphreys, 1939; 
Skinner, 1938), the parallel between 
trials prior to discrimination and par- 
tial reinforcement where the response 
is intermittently rewarded. They argued 
that the discrimination between hori- 
zontal and vertical lines in Jenkins’ study 
required a large number of trials, 
enough to produce a substantial PRE, 
whereas a much smaller number was re- 
quired for the brightness discrimination 
used in their experiment in which no 
PRE was found. Group 100-0 here 
learned a brightness discrimination 
quickly but resistance to extinction was 
still high. Consequently, factors other 
than difficulty must be involved. 


Brown and Logan (1965) suggested 
that a GPRE occurs following a small 
amount of simple discrimination training 
but not following a large number of trials 
during which learned responses to non- 
reward (which presumably support the 
PRE) could extinguish. The results of 
the present experiment and Jenkins’ 
would be consistent with this suggested 
relation, but those of Birch would not, 
for in Brown and Logan's schema all of 
these studies used a small number of 
training trials. 

Clearly, factors in addition to schedule 
of reward and number of training trials 
are required to account for the data. A 
set of factors, suggested by Amsel’s the- 
oretical framework (1958, 1962), are 
those related to the strength of goal re- 
actions (Rp, Ry) and generalization of 
their conditioned components (rp, rp) 
across discriminanda which could occur 
even after response differentiation. 
When generalization is precluded, as in 
between-Ss designs, the strength of the 
PRE is related to the schedule of reward 
and number of training trials (Amsel, 
1962). 
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SIMULTANEOUS PRACTICE, NUMBER, AND LOCUS OF 
IDENTICAL ITEMS IN ACQUISITION OF 
TWO SERIAL LISTS + 


DOUGLAS L. NELSON, WILLIAM E. SIMPSON,? 
AND W. J. BROGDEN 


University of Wisconsin 


Experimental groups acquired 2 serial lists of 8 2-digit numbers by 


practice on alternate trials, 


Identity between lists was 0, 2, 4, or 6 


and locus of identity occurred early, middle, or late. Control groups 
acquired 1 of the several lists. A supplementary control acquired a 


single list of 16 numbers. 


Difficulty of acquisition increased as degree 
of identity decreased. Simultaneous Practice at 0 


identity required 


for either list 3 times the trials and 4 times the errors required for 
single practice, Acquisition of both lists at 0 identity did not differ 
significantly from acquisition of the list of 16, but the form of the 


serial-position curves did differ, 
differed as a function of degree a 


The acquisition of two serial lists 
simultaneously by practicing each on 
alternate trials has been given little 
study. Wishner, Shipley, and Hurvich 
(1957) and Fischer (1964) have used 
this procedure in studying isolation 
and organization effects upon the form 
of the serial-position error curve. 
They compared acquisition of non- 
sense syllables under two conditions: 
(Cond. 1) instructions to learn a single 
list with letters homogeneous in ink 
color and type case; and (Cond, 2) 
instructions to learn two lists (first and 
second halves of the Cond. 1 list) that 
differed in ink color and type case. 
Condition 2 gave more efficient acqui- 
sition than Cond. 1 and the serial- 
position error curves were representa- 
tive of single lists, Fischer (1964) in- 
cluded a condition (Cond. 3), learn 
two lists, no isolation by color or type, 
and condition (Cond. 4), learn one 
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National Science Foundation and the Re- 
search Committee of the Graduate School 
from funds provided by the Wisconsin 
Alumni Research Foundation, 

? Now at Albion College. 


Form of the serial-position curves 
nd locus of identity. 


list, isolation by color and type. Con- 
dition 3 did not differ from the Control 
Cond. 1, but Cond. 4 showed the same 
differences from the control condition 
that was produced by Cond. 2. 

The acquisition of two serial lists by .- 
alternate practice may be related to 
several anchor or reference conditions, 
The sum of acquisition measures of 
the two may be compared with those 
of a single list composed from the two 
as has been done by Wisher et al. 
(1957) and Fischer (1964). It may 
be compared with sequential acquisi- 
tion as in the transfer paradigm AB. 
It may be related to the part condition 
of part-whole learning in which the 
first phase is the AB transfer paradigm 
and the second phase is whole learning 
of the parts. And the acquisition of 
each of the two lists in alternate prac- 
tice may be compared to the acquisition 
of each alone, for either position of 
the AB transfer paradigm. Measure- 
ment of single-list acquisition in alter- 
nate practice suggests the possibility 
of either facilitation or inhibition as a 
function of the interrelations of the 
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SERIAL POSITION AND ITEM LOCUS IN DIFFERENT LISTS 


items in the two lists and the kinds of 
materials used for items. 

The present study relates the acqui- 
sition of two serial lists simultaneously 
by practicing each on alternate trials 
to the control condition of learning 
each alone as a single list. It tests the 
hypothesis that difficulty of acquisition 
will increase as the degree of identity 
between the two lists is decreased. 
The serial-position locus of identical 
items in the two lists should reduce 
errors relative to conditions where the 
items are different in the two lists. 
Therefore, locus of identity in the two 
lists was varied systematically to test 
the hypothesis that the form of the 
serial-position error curves will vary 
within each identity level as a function 
of locus of identity. It was also hy- 
pothesized that the form of the serial- 
position error curves with locus of 
identity counterbalanced for each level 
will vary as a function of the degree of 
identity. 


METHOD 


Each S in the experimental conditions 
practiced two eight-item serial lists of two- 
digit numbers on alternate trials. Identity 
between the A and B lists was varied and 
consisted either of zero, two, four, or six 
identical items. These conditions are rep- 
resented by 12 Ss each and the groups are 
designated by the symbols 0, 2, 4, and 6. 
The A list was held constant and items of 
the B list were replaced with items from 
the A list by position to provide the appro- 
priate degree of identity. The locus of the 
identical items in the B list was varied for 
Groups 2, 4, and 6 and occurred in the 
early (E), middle (M), or late (L) posi- 
tion. For Locus E, identity began with 
Position 1 and ended with Positions 2, 4, 
or 6. For Locus M, identity involved Posi- 
tions 4 and 5, 3 through 6, or 2 through 7. 
For Locus L, identity ended always with 
Position 8 and began with either Positions 7, 
5, or 3. These subgroups and their condi- 
tions are designated by the symbols 2E, 2M, 
2L, 4E, 4M, 4L, 6E, 6M, and 6L. 

"There were two forms each for the A and 
B lists consisting of separate sets of numbers, 
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Half of the Ss in each condition learned 
Form 1 of the A and B lists and the other 
half learned Form 2. Furthermore, half 
of the Ss in each of the above conditions 
practiced the A and B lists in an ABAB 
... sequence and the other half practiced 
the lists in a BABA . . . sequence. 

There were four control groups of 12 Ss 
each representing the complete identity con- 
dition (8). One group learned List A only 
(8A) and a second group learned List B 
only (8B). For each of these groups, 
half of the Ss learned Form 1 and half 
learned Form 2. The two remaining groups 
learned B lists identical to those learned by 
the experimental groups for Identity Cond. 
2, 4, and 6 and Identity Locus Cond. E, M, 
and L. Nine Ss of Group 8B1 each learned 
List B Form 1 for each of the nine com- 
binations of identity and locus and three 
additional Ss were assigned at random to 
three of the nine combinations to give the 
total of 12. Group 8B2 was organized in an 
identical manner except that Form 2 of the 
B lists was used. 

The 96 Ss were students in classes of ele- 
mentary psychology who received points 
toward their grades for participation in the 
experiment. They were assigned randomly, 
in order of their appearance, to the various 
treatments of the experiment and to the two 
Es who were intentionally and inextricably 
confounded with all other treatment 
conditions. 

Construction of all lists of two-digit num- 
bers involved the following restrictions. 
Zero was not used. No digit was repeated 
in any two-digit number (e.g. 22). A given 
digit occurred once and only once in each 
digital position for each list. Series of 
ordered numbers were eliminated and the 
attempt was made to maintain a difference of 
two or greater between successive digits 
within each digital position. 

Slides of the numbers were prepared and 
were projected black on a white background 
by a Kodak Carousel automatic projector on 
a milk-glass screen in an aperture between 
the control and experimental rooms. The S 
was seated at a table such that the projection 
screen was approximately 30 in. from his 
face. The vertical height of the digits at the 
screen was 1 in. An auxiliary timer was 
used to provide a 2-sec. exposure of each 
item in the list, thus giving a 2-sec. rate of 
presentation. Slides of the leters A and B 
were used to identify the beginning of the 
A and B lists for the alternate practice 
conditions and A was used to identify the 


716 D. L. NELSON, W. E. SIMPSON, AND W. J. BROGDEN 


beginning of the single lists of the control 
conditions. The interlist or intertrial inter- 
val was 10 sec. An intercom system between 
the experimental and control rooms was used 
by E to give instructions to S and to record 
the vocal anticipatory responses made by S 
in serial learning. Failure to respond and 
responses of incorrect numbers were recorded 
as errors. 

Separate instructions were given to Ss 
assigned to the experimental and control con- 
ditions, For the experimental groups, the 
instructions described the serial anticipation 
method, identified the A and B lists and the 
alternate practice Procedure, and stated that 
the practice session would terminate when 
each list had been learned to a criterion of 
one errorless trial. For the control groups, 
the instructions described the serial anticipa- 
tion method for a single list and stated that 
practice would terminate when a single error- 
less trial was reached. A preliminary prac- 
tice session with a list of three adjectives 
was completed by all Ss before starting the 
experiment proper. 


RESULTS 


The experimental design provides 
for a series of statistical analyses of 
trials and errors to the acquisition cri- 
terion. Analysis of variance was used 
in each case and when repeated meas- 
ures were involved, the Procedure fol- 
lowed was that of Winer (1962). 
Where appropriate, analysis of vari- 
ance was followed by Tange tests 
(Duncan, 1955) of differences be. 
tween means, All analyses of error 
data Provide an evaluation of serial 
Position and its interactions with other 
Sources of variation. The 5% level 
is the criterion of significance for all 
Statistical tests, : 


The four control groups, 8A, 8B, 


8B1, and 8B2, differ from each other 
only in the two-digit numbers compris- 
ing the single list learned. Analysis 
of trials and errors shows no signifi- 
cant differences between group means, 
In addition, the analysis of errors 
shows no significant differences in the 
form of the serial-position error curves, 
It, therefore, is appropriate to use the 
data from any one of the control 
groups to represent the complete iden- 
tity condition, However, only the 
data from Groups 8A and 8B are used 
in the analyses to be reported. 

The experimental design permits 
examination of two kinds of relation- 
ships in the data of the experiment. 
One is acquisition of a single list as a 
function of the degree of identity be- 
tween two lists acquired simultane- 
ously. The other relates acquisition 
measures of learning two lists simul- 
taneously as a total task to the degree 
of identity between these. Table 1 
presents the group means for both of 
these relations. Separate analyses of 
variance of the List A and List B data 
show identity to be a significant source 
for both trials and errors. Difficulty 
of acquisition increases as the degree 
of identity between Lists A and B de- 


TABLE 1 


Group MEANS or TRIALS AND ERRORS AS A 
Function or IDENTITY Conpit1ons 


Identity 

SE ia ete 
0 | 2 | 4 | 6 | 8 

Trials 
List A | 23.08 18.67 19.58 9.67 742 
List B | 22.08 18.58 17.33 9.92 8.50 
A+B 45.16 3725 36.91 19.59 7.96 
——— rH 39.91 19.59 7.96 

Errors 
List A |105.67 76.92 70.08 30.92 26.92 
List B | 103.42 69.50 68.00 30.91 25.83 
A+B | 209.09 146.42 138.08 61.83 26.38 
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creases. Maximum identity is repre- 
sented by learning a single list as was 
done by Control Groups 8A and 8B. 
The results of range tests are repre- 
sented in the table by lines drawn be- 
neath the rows of means. The means 
above a solid line do not differ signifi- 
cantly, The breaks in the line indicate 
significant differences between sets of 
means. For trials there are two sets 
of means and for errors there are three 
sets of means significantly different 
from each other. There are no signifi- 
cant differences between comparable 
conditions of the A and B lists. Se- 
quence of learning, ABAB... vs. 
BABA ... is not significant. 

Analyses of variance of the A+ B 
data were restricted to Identity Cond. 
0, 2, 4, and 6, Identity is a significant 
source of variation for both trials and 
errors. For trials there are two sets 
of means and for errors there are three 
sets distinguished by range tests. The 
means for Identity Cond. 8 in the 
A + B rows of Table 1 are the means 
for trials and errors of the Ss of 
Groups 8A and 8B combined to make 
a single group. The significance of the 
differences of these means from the ap- 
propriate means for Identity Cond. 6 
was determined by £ tests. Thus the 
functions represented by the A+B 
rows of Table 1 show an increase in 
difficulty of acquisition as degree of 
identity decreases. 

The error analyses permit examina- 
tion of the effects of identity and locus 
of identity upon serial-position errors. 
The interaction of Identity X Serial 
Position is significant for the separate 
analyses of List A and List B, and for 
the analysis of the A + B data. The 
results of the separate List A and B 
analyses are presented in Fig. 1 in 
order to include the serial-position er- 
ror curves for Groups 8A and 8B. 
The relation of degree of identity to 
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Fic. 1. Form of serial-position error 
curves as a function of degree of identity of 
two serial lists acquired simultaneously. 


form of the serial-position error is the 
same for the List A, List B, and A+B 
data. Form of curves can be consid- 
ered independently of absolute magni- 
tude of error when the curves are 
plotted in terms of percentage of error. 
Figure 1 includes serial-position curves 
in terms of both raw error and per- 
centage of total error for each condi- 
tion. Change in curve form as a func- 
tion of degree of identity appears to 
be of two kinds. One is change in the 
location of maximum error. The peak 
has a tendency to be displaced toward 
later serial positions as degree of iden- 
tity increases. The other change is 
perhaps more prominent and is an in- 
crease in error at the terminal serial 
positions as identity increases. It can 
also be stated as an increase in the 
effect of primary in reducing error as 
degree of identity increases. 

Since locus of the identical items in 
the A and B lists could be varied only 
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within Identity Cond. 2, 4, and 6, sepa- 
rate analyses of variance were con- 
ducted on the subgroups of these three 
groups for the List A, List B, and 
A+B data. The Identity x Locus 
of Identity interaction is not significant 
in any of these analyses, thus indicat- 
ing no difference in difficulty of acqui- 
sition as a function of locus within any 
given identity level. Variation in form 
of the serial-position error curves 
within identity conditions is indicated 
by a significant triple interaction of 
Identity X Locus X Serial Position. 
This is the case for List B and A +B 
analyses, but is not so for the A list 
analysis. The quadruple interaction of 
Identity Locus x Serial Position x List 
from the A + B analysis is not signifi- 
cant, thus indicating no reliable differ- 
ence for the triple interactions of Iden- 
tity X Locus x Serial Position between 
Lists A and B. Examination of the 
serial-position error curves for the List 
A and List B data shows substantial 
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agreement of locus effects within Iden- 
tity Cond. 2 and 4, but considerable 
disagreement for Identity Cond. 6. 
Each set of these curves is complex 
and shows no clearcut effect of locus 
of identity. Therefore, the data of the 
A+B analysis are presented in Fig, 
2 which gives the serial-position curves 
in terms of percentage of error. The 
curve of A+B data for Group 0 is 
plotted in each graph of locus and 
identity for comparative purposes, 
The effect of locus of identity for Iden- 
tity Cond. 2 and 4 is to reduce error 
at the serial positions involved at the 
expense of increased error at other po- 
sitions. For Loci E and L of Identity 
Cond. 6, the effect appears to be a re- 
duction of error for the two serial 
Positions that are different, with com- 
pensatory increase of error at some of 
the identical positions, 

When Group C is used as a refer- 
ent, its means for trials (29.83; cy 
— 2.03) and for errors (243.75; om 
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Fic. 2. Effect of locus of identical items upon the form of serial-position 
error curves for Identity Cond. 2, 4, and 6. 
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= 15.95) are not comparable to the 
A + B acquisition measures for Groups 
0, 2, 4, and 6 (see Table 1). The 
latter measures are the sum of the A 
list and B list trials and errors to sepa- 
rate criterial trials for each list of eight 
two-digit numbers. The criterion trial 
for A and B list acquisition is different 
for many Ss. If the alternate practice 
condition is treated as learning a list 
of 16 numbers, then half the Ss learned 
an A+B list and the other half 
learned a B+A list. When the 
Group C criterion of one errorless trial 
for 16 two-digit numbers is applied to 
the data for Groups 0 and 2, the crite- 
rion is met by eight and four of the 
Ss of these groups, respectively. The 
four Ss in Group 0 that did not meet 
the criterion, met a criterion of one 
error on the last trial. Of the eight 
Ss in Group 2 failing to meet the crite- 
rion of one errorless trial, four met a 
criterion of one error, two met a crite- 
rion of two errors, and two met a 
criterion of three errors on the last trial 
of practice. The means calculated on 
these data underestimate the difficulty 
of learning to the criterion of one er- 
rorless trial fully met by all Ss of 
Group C. They are 24.41 trials (om 
= 2.37) and 210.00 errors (ew = 
25.62) for Group 0 and 19.33 trials 
(oy — 221) and 148.92 errors (ex = 
2241) for Group 2. The trial and 
error means for Group 2 are signifi- 
cantly different by f test from the re- 
spective means of Group C, but those 
for Group 0 do not differ significantly 
from those of Group C. 

Analysis of variance of the error 
data for Groups 0 and C shows a sig- 
nificant Group X Serial Position inter- 
action, thus demonstrating a difference 
in the form of the serial-position error 
curves, These curves are presented in 
Fig. 3 and are plotted both in terms 
of raw error and percentage of total 
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Fic. 3. Comparison of serial-position 
curves for acquisition of two lists of 8 two- 
digit numbers by alternate practice and of a 
single list of 16 two-digit numbers. 


error. The curve for Group 0 appears 
as two serial-position error curves in 
sequence, one for the first eight num- 
bers and the second for the last eight 
numbers. Relative to the curve for 
Group C, the reductions in error for 
Positions 7 and 8 appear to be compen- 
sated by increases in error at earlier 
positions. The reductions in error at 
Positions 9 and 10 appear to be com- 
pensated by increases at later serial 
positions. 

The form of the serial-position 
curves for the two halves of the list 
for Group 0 appears to be the same, 
and is not significantly different sta- 
tistically. These curves also appear to 
have a form similar to that of the 
Group C curve, insofar as form of 
curves for lists of different length can 
be judged. A test of this relation is 
presented in the lower half of Fig. 3, 
which presents the data both in raw 
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error and percentage of error. The 
curve for Group C is reduced from 
16 to 8 serial positions by summing 
errors for successive pairs of positions 
(ie, 1+2, 3-- 4, etc.). The curve 
for Group 0 sums the errors for the 
first and second halves of the list (ie, 
1+9, 2--10, etc). In an analysis 
of these data, the interaction of Groups 
X Serial Position is not significant. Tt 
is clear from reexamination of Fig. 1 
that the serial-position curves for the 
A and B lists of Groups 2, 4, and 6 
would show a relation to the curve of 
Group C similar to that for curves of 
Group 0 in Fig. 3, if they were con- 
structed in a similar manner. 


Discussion 


Difficulty in simultaneous acquisition 
of two serial lists by practice on alter- 
nate trials depends in part upon relations 
between the items of the two lists, their 
organization, and the standard of refer- 
ence, If the referent is learning a single 
list, ie. complete identity on alternate 
trials of practice, then difficulty in acqui- 
sition increases as the degree of identity 
decreases, and the relation is similar for 
the two alternate lists, Acquiring two 
lists Simultaneously with zero identity 
between them requires for either list 
three times as many trials and produces 
four times as many errors as acquiring 
one of the lists alone. If the referent is 
learning two lists simultaneously, ie. 
the sum of acquisition measures for two 
lists, each acquired to its Separate crite- 
rion, then the relation of increase in dif- 
ficulty with decrease in identity of the 
two holds over all two list conditions. 
The complete identity condition of ac- 
quiting a single list is not precisely rele- 
vant to the A +B measures of the pres- 
ent experiment, since identity for the 
Single-list condition is confounded with 
a lesser amount of material. If for the 
single-list conditions, Ss had been re- 
quired to meet a second criterial trial, 
then measures of trials and errors to this 
latter criterion would represent complete 
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identity and be compatible to the A+B 
measures of the experimental groups, 
However, the difference in errors and 
trials between a criterion of one trial 
correct and two trials correct probably 
would be numerically small and unreli- 
able statistically. The use of the single- 
list measures in Table 1 as the end point 
of complete identity for the A +B meas- 
ures involves some degree of underesti- 
mation. As such, however, the trials 
and errors for A+B, Group 0 are ap- 
proximately six times and eight times 
larger, respectively. 

The alternate practice condition for 
two lists has another referent in acquisi- 
tion measures for all items organized as 
a single list. The appropriate control 
here would involve acquisition of lists 
of two-digit numbers made up of A+B 
and B+ A from Groups 0, 2, 4, and 6, 
and of A+A and B +B from the sev- 
eral Group 8 conditions. Such a control 
was not included in the present experi- 
mental design. The use of Group C pro- 
vided an estimate of such a referent for 
the data of Groups 0 and 2 when treated 
insofar as possible in terms of a criterion 
of zero error for a 16-item list. The 
failure to find significant differences in 
difficulty of acquisition between Groups 
C and 0 for overall measures is consonant 
With the results obtained by Fischer 
(1964) for her Cond. 3, learn two lists, 
no isolation by type color and case. The 
serial-position curve for this condition, 
however, did show a dip in the middle 
(representing the end of the first list and 
the beginning of the second), but not to” 
the degree of representing two separate 
serial-position curves as reported by 
Wishner et al. (1957) for their Cond. 2 
(instructions to learn two lists that dif- 
fered in ink color and type face). Fischer 
replicated the Wishner et al. results for 
Cond. 2 on curve form and of more effi- 
cient acquisition than for Control Cond. 
1 (learning a single list of the same items 


! 


homogeneous in ink color and type face). - 


Compared to the differences between 
Cond. 1 and 2 of the above authors, our 
Group 0 differs from Group C in instruc- 
tions to learn two lists rather than one, 
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in there being an interlist interval of 10 
sec. rather than none, and in designating 
“the beginning of each list by stimulus 
presentations of A and B. These condi- 
tions of organization produce a form of 
serial-position curve comparable to the 
form obtained by Wishner et al. and by 
Fischer for their Cond. 2. They are not, 
however, as effective as organization of 
the two lists in terms of type color and 
form in providing more efficient acquisi- 
tion than that produced by the control 
condition. 

The change in form of serial-position 
curves for simultaneous practice of two 
lists as a function of degree of identity 
is similar whether the curves are con- 
structed from the A and B list data alone, 
or from the sum of the A and B lists. 
These changes in form are independent 
of the absolute number of errors (which 
increase as degree of identity decreases) 
as is shown by the curves plotted in 
terms of percentage of error. These re- 
sults suggest that the hypothesis of Mc- 
Crary and Hunter (1953), that serial- 
position curves are of the same form over 
a wide range of difficulty of verbal mate- 
rial when plotted in terms of percentage 
of error, does not hold for variation in 
difficulty of simultaneously learning two 
lists. 

The effect of locus of identical items 
within identity condition on the form of 
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serial-position curves confirms the hy- 
pothesized effect of this variable for 
Identity Cond. 2 and 4. The reduction 
of error at positions of identical items, 
compensated by increases of error at 
other positions, is similar to the effect 
upon serial-position curves of pleonasms 
within a single serial list (Ernst, Thomp- 
son, & Brogden, 1962). The effect of 
locus within Identity Cond. 6 is unclear, 
and may reflect an effect of the two items 
that are different in the A and B lists 
rather than of the six identical items. 
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EFFECTS OF NUMBER AND 


SIMILARITY OF PRETRAINING 


NCE 
ALTERNATIVES ON PAIRED-ASSOCIATE PERFORMA 
ON PRETRAINED AND NEW ITEMS UNDER COR- 
RECTION AND NONCORRECTION PROCEDURES? 
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University 


or by 


Recent efforts to demonstrate that 
paired-associate (PA) learning is fa- 
cilitated by various types of familiari- 
zation or discrimination pretraining 
(PT) on individual stimulus (S) 
and/or response (R) components of 
the PA pairs have created some incon- 
sistent and often inexplicable results. 
Pretraining tasks requiring either ob- 
servation or repetition of individual 
items have produced either slightly 
positive, null, or negative effects on 
subsequent PA learning of pairs con- 
stituted from PT items (eg., Goss & 
Nodine, 1965; Noble, 1963; Under- 
wood & Schulz, 1960). Likewise, 
verbal-discrimination (VD) pretrain- 
ing with the same pairs as used in the 
PA list has led to either facilitation 
(Spear, Ekstrand, & Underwood, 
1964) or no effect (Battig & Brackett, 
1963; Battig, Williams, & Williams, 

1 This research was supported by Public 
Health Service Research Grant HD-01062 
from the National Institute of Child Health 
and Human Development. Robert M. Voytas 


also assisted in the preparation of materials, 
and in running Ss from one replication. 


of Maryland 


or 8-pair list under 
ly devised correction 
-item pretraining (PT) 
a set of 2 or 8 alterna- 
intraset similarity between correct 
no PT. The 16-item PT lists in- 


1962), whereas the latter studies both 
have shown consistently negative ef- 
fects if the items are paired differently 
in the VD and PA lists, 

Prior free-recall learning of indi- 
vidual PA items has produced the most 
consistent and sizable PA facilitation 
(e.g., Battig, Merikle, & Schild, 1965; 
Epstein, Rock, & Zuckerman, 1960; 
Underwood, Runquist, & Schulz, 
1959), although comparable facilitative 
effects recently have been shown from 
multiple-choice recognition (MCR) 
pretraining procedures requiring that 
each item be selected from a large set 
of alternatives which are similar to 
one or more of the correct pretrained 
items (Battig et al., 1965; Thompson, 
1964, 1965). Battig (1966) has inter- 
Preted these MCR effects in terms of 
a general principle stating that intra- 
task interference represents an impor- 
tant source of intertask facilitation. 
Thus the positive MCR effects on PA 
performance may result from the siza- 
ble intratask interference produced by 
the large number and/or high simi- 
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larity of incorrect alternatives within 
the MCR task. The present experi- 
ment was thus undertaken principally 
to provide a direct test of the effect of 
these sources of MCR intratask inter- 
ference upon subsequent PA perform- 
ance, by independently varying both 
number and similarity of incorrect PT 
alternatives. Other manipulated vari- 
ables included length of the PA list 
and proportion of pairs therein consist- 
ing of PT items, both of which have 
differed substantially among previous 
studies in a manner suggesting their 
possible relevance to the nature and 
magnitude of PT effects on PA per- 
formance, To assess the permanence 
of any differential effects produced by 
the various PT and/or PA conditions, 
several 24-hr. retention measures also 
were obtained. 

The present design also provided for 
a systematic evaluation of performance 
under the usual noncorrection (NC) 
PA procedure, as compared with a 
newly developed correction (C) proce- 
dure (Battig, 1965) which equates 
each individual PA pair in number of 
correct responses rather than number of 
presentations. Interest in this (C) pro- 
cedure derives from its potential 
advantages as to manipulation and 
control of degree of learning for each 
individual PA pair, thereby greatly 
reducing the difficulties of estimating 
terminal learning level that have so 
severely complicated retention and 
transfer research (Underwood, 1964). 
While no specific interactions were 
expected between the C-NC and other 
variables in the present experiment, the 
new C procedure can be evaluated sat- 
isfactorily only through its comparison 
with the NC procedure under sys- 
tematic variations in other significant 
PA variables. Consequently, the pres- 
ent is one of a series of recent experi- 
ments including this type of C-NC 
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comparison (Battig, in press a; Meri- 
kle, Manatis, & Battig, 1965).? 

C=correction PA procedure (also used 
for PT), whereby all items equated in num- 
ber of correct responses. 

Hi=incorrect PT alternative(s) highly 
similar to correct one. 

Irr = irrelevant conditions, with no PT 
items appearing in PA list, also designated 
by (0). 

Lo = incorrect PT alternative(s) mini- 
mally similar to correct one. 

Mix = mixed conditions, with half (8) of 
the PT items constituting 4 PA pairs, also 
designated by (4). 

MCR = multiple-choice recognition PT 
procedure (correct item presented with sev- 
eral similar incorrect alternatives). 

NC=standard noncorrection PA  pro- 
cedure, whereby all items presented equal 
numbers of times. 

P pairs— PA pairs consisting of correct 
items from PT task. 

PT = individual item pretraining condi- 
tions given prior to paired-associate (PA) 
learning. 

Rel = relevant conditions, with all 16 PT 
items constituting 8 PA pairs, also designated 
by (8). 

U pairs— PA pairs consisting of new 
items not appearing in PT list. 

VD = verbal-discrimination PT procedure 
(correct item presented with one usually dis- 
similar incorrect alternative). 

2 or 8 PT conditions — number of alterna- 
tives (2 or 8) appearing in each PT set. 

12 or 8 pair = number of pairs (12 or 8) 
constituting PA list. 
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Subjects and design.—Each of 208 Ss was 
assigned unsystematically to 1 of 52 groups 
of four Ss each, representing the 48 combina- 
tions of 4 PT with 12 PA conditions plus 4 
control (no PT) groups. 

The four PT conditions all required that 
a single correct CVC trigram be selected 
from a set that included also either one or 


2 The complexity of the present results and 
interpretations regrettably will produce a 
heavy strain on the reader's information- 
receiving capacity. In the hope that the 
contribution thereto of the necessary large 
number of abbreviations may thus be mini- 
mized, a complete list of all nonstandard 
abbreviations employed throughout this paper 
is given below in alphabetical order. 
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seven incorrect CVC alternatives (arranged 
in two columns of four each) that were 
either of high (Hi) or low (Lo) similarity 
to the correct CVC. The 8-Hi PT condi- 
tion thus corresponded closely to previous 
MCR PT procedures, in that each correct 
CVC (eg, FEB) was presented with the 
seven incorrect items involving substitution 
of a single letter for either the first (ves), 
second (FIB), or third letter (rep), for any 
two of these letters (vis, vED, FD), and for 
all three (vm). The latter incorrect CVC 
was always presented along with the correct 
one in the 2-Lo PT condition (e.g., FEB-VID), 
as has characterized most previous VD pro- 
cedures. The 2-Hi PT condition involved a 
single incorrect CVC differing from the 
correct one by a single letter (e.g., FEB-vEB), 
each of the three positions of this changed 
letter being used equally often across the 
various PT items. Finally, the 8-Lo PT 
condition included the correct CVC along 
with seven mutually dissimilar incorrect ones 
(e.g, JEB, KOQ, KUH, SUR, VAF, YIJ, yup) all 
of which were incorrect alternatives taken 
from the 8-Hi sets for seven other correct 
CVCs, 

The 12 PA conditions represented all pos- 
sible combinations of (a) either 12- or 8-pair 
lists; (b) all 16, 8, or 0 of the PT items 
being used to constitute, respectively, 8, 4, or 
0 of the PA pairs, designated as relevant 
(Rel), mixed (Mix), and irrelevant (Irr) 
conditions; and (c) correction (C) or non- 
correction (NC) PA procedures, as de- 
scribed more fully below. The six combina- 
tions of list length and number of pairs of 
PT items are designated as 12(8), 12(4), 
12(0), 8(8), 8(4), and 8(0), respectively. 
The four control groups learned 12- or 8-pair 
PA lists under C or NC procedures without 
any prior PT practice of any kind. 

A predetermined unsystematic order for 
assignment of Ss to groups, on the basis of 
order of appearance in the laboratory, pro- 
vided four independent replications of 40 Ss 
each (each replication including one $ from 
each group except for 12 of the 16 Irr PT 
Eroups). Three additional Ss subsequently 
were run in each of the latter groups, to 
check on preliminary indications of differen- 
tial nonspecific transfer effects from the 
four PT conditions. Two different Es ran 
the first two replications concurrently in 
Md iens roe of these Es subse- 
quently ran the last two replicati 
the additional 48 Irr Soe ““?ications and 
prasad 


" 3 Both PT and PA performance were con- 
Sistently and sometimes significantly better 
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Materials—The same pairs of Archer 
(1960) 75-85% CVC items, selected from 
Battig and Brackett’s (1963) low-similarity 
list, constituted the PA lists for all Ss. 
The 12 pairs were divided into three subsets 
of four each, previously demonstrated to be 
equivalent in difficulty (Battig & Brackett, 
1963). Two of these subsets comprised the 
8-pair list, but the CVCs from one subset 
appeared as correct PT alternatives both for 
Rel and Mix PT conditions, whereas the 
other subset was used only for the Rel PT 
list. The third PA subset appeared only in 
the 12-pair list, and never in the PT task. 
The irrelevant correct PT items used in Mix 
and Irr PT conditions were 16 CVCs com- 
parable in association value but maximally 
dissimilar to all PA items. None of the PA 
or correct PT items ever appeared as an 
incorrect alternative in any of the PT sets, 
and no incorrect alternative was used in 
more than one PT set within the same list. 

Procedure.—During the first 5 min, all 
Ss served in a short-term memory experi- 
ment involving successive presentation and 
testing of four single high- or low-frequency 
English words, as reported in detail else- 
where (Turnage, in press). Performance 
on this task correlated only .19 with appro- 
priately corrected PA performance measures, 
and showed no indication of any effects on 
either PT or PA performance. 

All PT conditions began with the suc- 
cessive individual presentation of each of the 
16 correct CVCs by Kodak Carousel auto- 
matic slide projector at a 4-sec. rate, fol- 
lowed immediately in an unsystematically 
different order by the appropriate 16 sets of 
alternatives. For each set, S was given a 
maximum of 10 sec. to select the one CVC 
that had been presented previously, and to 
indicate its position on the slide, based upon 


for Ss run by the latter E. These Ss were 
Primarily regular-session introductory psy- 
chology students fulfilling a course require- 
ment, but included also 38 paid volunteers 
obtained through the student placement serv- 
ice (each of the latter being assigned to a 
different group). Differences between these 
Ss and the 40 summer-session students run 
by the first E probably account for the E 
discrepancies. Fortunately, the effects of the 
various manipulated variables were compar- 
able across the four replications (as indicated 
by the absence of significant interactions of 
these variables with replications), so that 
the results and analyses to be reported are 
free from contamination by differences be- 
tween Es and/or Ss run by the two Es. 
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a diagram posted next to the projection 
screen that depicted the verbal label for each 
slide position, All correctly selected sets and 
individual CVCs were then temporarily re- 
moved from the list, so that only those CVCs 
not identified correctly were presented again. 
This second presentation again was followed 
by the appropriate sets of alternatives, dif- 
fering from the previous sets only in the 
positions of correct and incorrect alternatives 
on the slide. Once again all correctly se- 
lected sets and individual CVCs were re- 
moved from the list, and the same procedure 
was repeated with the reduced list until S 
had selected each of the 16 correct CVCs. 
Then all 16 items and sets were presented 
again, and the entire procedure was repeated 
through four complete cycles until each 
correct CVC had been selected exactly four 
times. To prevent identification of correct 
CVCs by position in the set, the two slides 
differing in position of alternatives for each 
set were interchanged unsystematically across 
all presentations after the first two, and each 
of the eight slide positions was used equally 
often for the correct CVC within 8-Hi and 
8-Lo PT conditions. 

Immediately after completion of the PT 
task, appropriate PA instructions were given 
which stated that some of the PA items 
might be the same as those from the PT 
task. Under the NC procedure, each trial 
began with the successive presentation of 
all pairs by automatic slide projector at a 
4-sec. rate. After a 4-sec. interval, all S 
terms were presented alone in a different 
order also at a 4-sec. rate while S attempted 
to pronounce the correct paired R term. 
The C procedure differed only in that each 
correctly recalled pair was temporarily re- 
moved from the list, and only the incorrect 
pairs were presented and tested again for 
recall, this procedure being repeated with 
the reduced list until each pair had been re- 
called correctly once. Three cycles of this 
C procedure were repeated, so that all PA 
items were recalled correctly exactly three 
times. To equate C and NC procedures in 
total number of pairs recalled correctly, all 
NC Ss learned to a criterion whereby total 
number of correct recalls summed over pairs 
and trials was approximately three times the 
number of pairs in the PA list. 

Except for the last 48 Ss run under Irr 
PT conditions, all Ss were given a series of 
retention tests on the following day, consist- 
ing in order of (a) attempted written recall 
of both the S and R terms constituting the 
PA pairs, (b) presentation on a dittoed 
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sheet of all PA S terms in alphabetical order 
for attempted R-term written recall, and 
(c) a recognition test. The latter censisted 
of a dittoed list including all PA and cor- 
rect PT items, along with one incorrect 
alternative from the 8-Hi PT sets for each 
PT item, and 8 CVCs not appearing in any 
PT or PA list. Each of these 80 CVCs was 
to be identified as (a) a correct PT item, 
(b) a PA item, (c) an incorrect PT alter- 
native, or (d) not appearing in either list. 


RESULTS AND DISCUSSION 


PT performance.—Significantly fewer 
PT errors were made under 2-Hi 
(2.9) and 2-Lo (1.3) conditions than 
under 8-Hi (84) and 8-Lo (10.6) 
conditions, F (1, 144) = 87.69, p< 
.001. Furthermore, the inferiority of 
2-Hi to 2-Lo as contrasted with the 
superiority of 8-Hi over 8-Lo resulted 
in a significant 2-8 x Hi-Lo inter- 
action, F (1, 144) = 600, p< 02. 
Performance on the Irr PT list (4.6) 
was also significantly superior to the 
combined Rel and Mix lists (6.4), 
F (1, 144) = 4.55, p < .05, probably 
reflecting the lesser difficulty of items 
constituting the Irr list. However, 
nonsignificant effects were obtained 
both for the overall Hi-Lo comparison 
(F <1), and for subgroups represent- 
ing different PA conditions but with 
the same PT condition (F = 1.24). 

Overall PA performance.—Total er- 
rors on the PA list were subjected to 
analyses of variance, as well as an 
analysis of covariance based on the 
deviation of each S’s PT errors from 
his PT condition mean. Besides the 
expected superiority of 8-pair (19.4) 
over 12-pair lists (27.8), F (1, 141) 
= 14.51, p < .001, PA errors also in- 
creased significantly from Rel (19.8) 
to Mix (23.9) and Irr (26.8) condi- 
tions, F (1, 141) =631, p< 02. 
Control-group performance (25.2) was 
insignificantly (F < 1) superior to Irr 
conditions. In sharp contrast with the 
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previous results summarized in the 
introduction, insignificantly (F « 1) 
fewer PA errors were made following 
2-Hi (21.5) and 2-Lo (22.6) than 
8-Hi (22.9) or 8-Lo (26.9) PT con- 
ditions. The only other effect in this 
analysis attaining even marginal sig- 
nificance was a third-order 2-8 x 
Hi-Lo x Rel-Mix-Irr interaction, F 
(2, 141) =3.15, p < .05, reflecting a 
marked Rel (15.3) superiority over 
Mix (26.1) and Irr (26.9) conditions 
for 2-Lo and 8-Hi PT groups, as con- 
trasted with the near equivalence of 
Rel (24.3) to Mix (217) and Irr 
(26.7) conditions for 2-Hi and 8-Lo 
groups. 

PA subset comparisons.—Compari- 
sons of PA performance on pairs which 
did and did not consist of pretrained 
items, designated, respectively, as pre- 
trained (P) and unfamiliar (U) pairs, 
indicated the overall PA effects to re- 
flect differences in relative perform- 
ance on subsets constituted of P and 
U pairs. Fewer errors were made by 
both Rel (7.8) and Mix (8.8) than by 
Irr (113) groups on the PA subset 
representing P pairs for the former 
two conditions and U pairs for the 
latter, F (1, 144) = 747, p< 01. 
Likewise, the PA subset Consisting of 
P pairs only for Rel conditions elicited 
fewer Rel (8.8) than Mix (10.4) or 
Irr (10.8) errors, although this dif- 
ference was marginally significant 
(p = 05) only by analysis of co- 
variance, 

However, the third PA subset in the 
12-pair list (which always represented 
U pairs) also yielded significantly 
fewer errors by Rel (6.3) than Mix 
(9.3) or Irr (9.4) conditions, F (1, 
72) = 593, p < 025. This latter indi- 
cation that differential PT effects were 
localized in the U pairs which had not 
been Pretrained, rather than in the ac- 
tually pretrained P pairs, was corrobo- 
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TABLE 1 


Mean WirHIN-S ERRORS PER PA Pair on 
PRETRAINED (P) AND New (U) Pairs 


PT cond. 
2-Hi | 2-Lo | 8-Hi | 8-Lo | Total 
12(8) 
P 2.67 | 1.92 | 1.88 | 2.39 | 221 
U 1.50 | 1.22 | 0.88 | 2.63 | 1.56 
12(4) 
P 1.84 | 2.78 | 2.25 | 2.19 | 227 
U 1.81 | 3.03 | 2.44 | 2.70 | 2.50 
8(4) 
E 2.16 | 1.88 | 2.69 | 1.84 | 2.14 
U 2.13 | 2.50 | 3.03 | 2.53 | 2.55 
Total 
A 2.22 | 2.19 | 2.27 | 2.14 | 221 
U 181 | 2.25 | 2.11 | 2.62 | 220 


rated by the results of within-list P-U 
comparisons for the 12(8), 12(4), and 
8(4) conditions that included pairs 
of both types. As shown in Table 1, 
differences between P'T conditions over 
the three mixed lists in U-pair per- 
formance were consistently larger and 
led to a significantly greater between- 
conditions variance for U than for P 
pairs, F (11, 11) = 3.94, p < .05. 
The most pronounced difference in 
Table 1 represents the consistent 
U-pair superiority for 12(8) condi- 
tions, which alone involved fewer U 
than P pairs in the list. Not only was 
this superior to 12(8) P-pair perform- 
ance, but also to that by 12(4) and 
8(4) groups on P as well as U pairs, 
leading to a significant P-U x Lists 
interaction, F (2, 72) = 9.67, p < .001. 
Moreover, the lesser P-pair superiority 
both for 12(4) and 8(4) lists was in- 
sufficient to counteract the U supe- 
riority for the 12(8) list, so that over- 
all mixed-list P performance was actu- 
ally slightly but not significantly 
(F <1) below that on U pairs. 
This finding that performance on U 


| 
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pairs is significantly enhanced, when 
these constitute the minority (1/3) of 
a mixed list, is suggestive of a kind of 
"isolation" effect, whereby S's atten- 
tion is attracted to pairs consisting of 
new previously unseen items when 
there are relatively few of these. 
Moreover, since mixed-list transfer 
comparisons typically have included a 
minority of new (C-D control) pairs 
(eg. Kausler & Kanoti, 1963; Twedt 
& Underwood, 1959), what has been 
identified as negative transfer under 
these conditions may instead be an 
artifact of a facilitation on the small 
number of new C-D control pairs and 
the spuriously high zero-transfer base 
line provided thereby. Such a mi- 
nority of control pairs as a mixed-list 
base line was used in both previous 
demonstrations of significant negative 
transfer from VD pretraining corre- 
sponding to the present 2-Lo condi- 
tions (Battig & Brackett, 1963; Battig 
et al., 1962). Consequently, the pres- 
ent absence of any such negative 2-Lo 
PT effects—except within the 12(8) 
list—appears readily explicable in 
terms of this mixed-list isolation effect. 
Clearly the effects of differing propor- 
tions of various types of pairs in a 
mixed list will require extensive sys- 
tematic investigation before transfer ef- 
fects can be evaluated unambiguously 
within mixed-list paradigms. 

Table 1 also suggests differences 
localized in U-pair performance fol- 
lowing the four PT conditions, and the 
superiority of Hi over Lo conditions 
in this respect led to a significant 
P-U X Hi-Lo interaction, F (1, 72) 
= 646, p< 02. However, P-U dif- 
ferences as a function of number of 
PT alternatives (2 or 8) were non- 
significant, F (1, 72) = 2.54, p > .10. 
Moreover, consistently worse perform- 
ance on P relative to U pairs was ob- 
tained under the present 8-Hi PT con- 
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dition corresponding most closely to 
previous MCR procedures that had 
shown sizable PA facilitation on P 
pairs (Battig et al., 1965; Thompson, 
1964, 1965), than under the 2-Lo PT 
condition that previously had shown no 
PA facilitation whatever (Battig & 
Brackett, 1963; Battig et al., 1962). 

Thus the present results provide no 
evidence for differential facilitation of 
PA performance on P pairs by the 
present PT conditions, and indicate in- 
stead that Lo-similarity PT conditions 
(particularly 8-Lo) had produced sig- 
nificant interference with PA learning 
of U pairs. A similar interference 
with U pairs may also account for both 
the overall and subset differences fa- 
voring Rel and/or Mix over Irr 
groups. 

The prime sources of interference 
with PA learning of U pairs appear 
to have been incorrect and irrelevant 
correct PT alternatives that were for- 
mally similar to items constituting 
these U pairs. The limited population 
of CVC trigrams renders it inevitable 
that one or more of those used as PT 
alternatives must have been highly 
similar to U PA items. Moreover, the 
present recognition results revealed a 
highly significant above-chance (.23) 
accuracy for PT incorrect alternatives, 
F (1, 112) = 83.82, p < .001, as well 
as a recognition for irrelevant correct 
PT alternatives (.68) that significantly 
exceeded that for either P (.56) or U 
items (.63) appearing in the PA list, 
F (2, 112) = 3.98, p < .025.4 Thus 
the amount of PT learning of CVCs 
not appearing in the PA list was 
clearly sufficient to interfere substan- 


4 Recognition accuracy was measured and 
corrected for guessing by calculating the pro- 
portion of correct identifications given for 
each type of item, and subtracting from it 
the proportion of times this identification was 
given incorrectly for all other types of items 
combined. 
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tially with PA learning of items similar 
to these PT alternatives. 

That these PT interference effects 
appear to be specific to U pairs, and 
not to apply to P pairs, may reflect a 
counteracting facilitation effect whereby 
conditions of high intratask interfer- 
ence during the PT task may represent 
also a source of intertask facilitation 
for these PT items when they are in- 
cluded in the PA list (Battig, 1966). 
In other words, the P pairs, having 
already appeared on the PT task along 
with those items that interfere with 
PA learning of U pairs, may be less 
susceptible to this interference because 
they have already been learned under 
PT conditions where the interference 
is present. Thus rather than being 
unaffected by PT conditions, the PT 
effects on P pairs may represent a 
cancellation of the differing amounts of 
PT interference operative within the 
PA task by concomitant differences in 
PA facilitation resulting from the pres- 
ence of these sources of interference 
also within the PT task itself. Since 
the latter type of facilitation cannot 
affect new PA pairs not appearing in 
the PT list, only the differential PE 
interference effect would be operative 
therein, as indicated by the present 
pattern of PA performance differences 
for U pairs. 

The sizable PA interference pro- 
duced by incorrect and Irr correct PT 
alternatives under the present 8-Hi 
conditions may also be responsible for 
the marked discrepancy of the present 
8-Hi results from the large facilitative 
effects produced previously by similar 
types of multiple-choice recognition 
(MCR) item pretraining procedures, 
since these latter had excluded most of 
the present PT sources of PA interfer- 
ence. The incorrect alternatives used 
by Battig et al. (1965) were all re- 
arranged letter orders of the correct 
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trigram items, whereas none of Thomp- 
son’s (1964, 1965) incorrect alterna- 
tives differed by more than two letters 
from her correct items (seven-letter 
Turkish words or six-letter CVCCVC 
disyllables). Thus neither type of in- 
correct alternative would be as likely 
to overlap and interfere with other 
items constituting the PA list as would 
the present CVC incorrect alternatives. 

The sizable decrement for U. pairs . 
under the present 8-Lo conditions fur- 
ther suggests the Battig et al. (1965) 
results to reflect primarily interference 
with their U pairs resulting from the 
simultaneous presentation of all cor- 
rect and incorrect alternatives under 
their MCR procedure, rather than ac- 
tual facilitation on PA pairs consisting 
of pretrained items. Moreover, a 
study completed after submission of 
the present report (Battig, in press b) 
has replicated both the present 8-Hi 
and Thompson’s MCR results, in a 
manner suggesting the latter PA fa- 
cilitation to reflect a strategy whereby 
Ss select and concentrate upon the 
learning of familiar PA pairs com- 
posed of items from the PT task, due 
to the extreme difficulty and practical 
impossibility of attempting to learn all 
of the 12 pairs of CVCCVCs constitut- 
ing the PA list. 

C-NC comparisons —Despite the 
fact that the present C procedure en- 
tailed decreased presentations and thus 
fewer opportunities for correct re- 
sponses to easy items, it nonetheless 
resulted in insignificantly (F< 1) 
fewer total PA errors (22.9) than did 
the NC procedure (24.1). Of these 
errors, significantly fewer preceded the 
first correct response to each pair 
under the C (17.8) than NC (21.3) 
procedure, F (1, 141) = 4.09, p « .05, 
Whereas errors after the first correct 
response were significantly greater for 
C (5.1) than NC (2.8) groups, F = 
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11.27, p< .001. These results are 
consistent with previous findings of no 
differences between C and NC proce- 
dures in attainment of a criterion de- 
fined by total correct responses summed 
over trials (Merikle et al., 1965), al- 
though Battig (in press a) has found 
the C procedure to be significantly 
more efficient than the NC procedure 
for attainment of a more stringent cri- 
terion of one errorless trial. 

To evaluate the extent to which 
learning of individual pairs was more 
closely equated under the C than NC 
procedure, all pairs for each S were 
ranked as to number of PA errors, and 
mean ranks were determined for all 
pairs recalled correctly after 24 hr. 
Within the 12-pair list, these ranks 
were significantly larger (ie, more 
difficult pairs were recalled) for C 
(6.0) than NC (5.4) procedures, 
F (1, 60) = 7.82, p < .01. This dif- 
ference, however, was nonsignificant 
for the 8-pair list (F <1), and both 
C and NC procedures showed signifi- 
cantly (p < .001) better overall recall 
of easier pairs. Further analysis re- 
vealed that both recall of more difficult 
pairs and total number of recalled 
pairs were maximal for C groups fol- 
lowing 8-alternative PT conditions, 
which led also to a significant C-NC 
X 8-12 Pair X 2-8 Alternative inter- 
action for total recalled pairs, F (1, 
96) 2413, p<.05. So at least 
under conditions of maximal intratask 
interference produced by 8-alternative 
PT conditions and a 12-pair list, the 
C procedure does lead to facilitated 
recall of more difficult PA pairs. 

Other results—Several additional 
significant PA learning and retention 
main effects and interactions also 
emerged from the various analyses of 
the data. Although overall PA per- 
formance measures were largely free 
of interactions between PT and/or PA 
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variables, there was substantial vari- 
ation between the two PA subsets that 
were common to 8- and 12-pair lists, 
and consequent significant interactions 
of subsets with (a) Rel-Mix-Irr and 
Hi-Lo PT conditions (p < .005), 
(b) Rel-Mix-Irr, Hi-Lo, and C-NC 
conditions (p < .005), (c) 2-8 alter- 
native, 8-12 pair, and C-NC conditions 
(p «.05), and (d) 2-8 alternative, 
8-12 pair, C-NC, and Rel-Mix-Irr 
conditions (p< .01). The first two 
interactions reflected not only the 
greater subset differences for Mix 
groups (where one subset consisted of 
P and the other of U pairs), but also 
subset differences of equal magnitude 
for Rel groups learning under the C 
procedure, which were opposite in di- 
rection for Hi- and Lo-PT conditions. 
By contrast, the remaining two inter- 
actions both resulted from larger sub- 
set differences under NC procedures 
and with Irr PT conditions, These 
subset interactions show the previously 
established equivalence of the two PA 
subsets (Battig & Brackett, 1963) not 
to hold under the present variations in 
PT conditions and C-NC procedures, 
probably because of PT differences in 
interference with individual items con- 
stituting the two subsets and/or 
changes in relative difficulty of individ- 
ual pairs under the C procedure. 
Further evidence that Irr PT items 
were a major source of PA interfer- 
ence was provided by the significantly 
superior recall of PA pairs by Rel 
(81.4%) than Mix (69.9%) groups, 
F (1, 96) = 10.46, p < .005. Recog- 
nition accuracy * for items constituting 
PA pairs was also significantly higher 
for Rel (.71) than Mix (.56) groups, 
F (1, 112) =9.43, p < .005. 
Despite their greater PA learning 
difficulty, percentage of correct 24-hr. 
R-term recall was insignificantly (p > 
.10) higher for 12-pair (72.196) than 
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8-pair lists (67.1%). Moreover, rec- 
ognition accuracy for incorrect PT al- 
ternatives was significantly higher for 
12-pair (.30) than 8-pair (.16) groups, 
F (1, 112) = 7.05, p < .01. Thus the 
greater difficulty of the 12-pair lists led 
if anything to better recall and recog- 
nition performance, in support of Bat- 
tig’s (1966) principle that learning 
under conditions of high intratask in- 
terference may lead to significant fa- 
cilitation on these items when they are 
included in a subsequent new task (e.g., 
written recall or recognition). Two 
additional findings also supportive of 
this principle were the greater recogni- 
tion accuracy (a) for CVCs common 
to the PT and PA lists following 
8-alternative (.69) than 2-alternative 
(.59) PT conditions, F (1, 112) — 
4.07, p< 05, and (b) for incorrect 
PT alternatives following Hi- (.32) 
than Lo- (.15) PT conditions, F ( 
112) —11.36, p < .001. 
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ASSOCIATIVE RULES GOVERNING RECALL 
AND MISRECALL* 


PAUL W. FOX AND 


University of Minnesota Tulane University 


An experiment was performed to evaluate a new conception of the 
relationship between retention and forgetting. Ss were trained with 
the 2nd most frequent free-associative responses (Rəs) to Kent- 
Rosanoff stimuli, and were tested for written recall of the Re words in 
the presence of the corresponding stimulus terms. Forgetting was 
fractionated into components of misrecall of which 2, intrusions of free- 
association primaries (Rs) and intrusions of free-association responses 
RR. from the associative hierarchy (Rex's), were predominant. 
Correct recall of Re words was tested under 2 different associative 
rules: (a) p(S — Re); and (b) p(Rı—> Ra). Misrecalls in each case 
were predicted by associative connections p(S — R:) and p(R — R;). 
The results support the hypothesis that retention and forgetting are 
2 phenomena with multiple components subject to different associative 
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rules. 


The present investigation has its 
origin in the data of a retention experi- 
ment performed by Blick (1965), who 
trained Ss with free-association pri- 
mary words (R,'s) on one page of a 
booklet, the corresponding free-associ- 
ation secondaries (R,'s) on the next 
page, and tested for written recall of 
the R, words a short time later in the 
presence of the corresponding stimulus 
terms. These are the same stimuli 
used in free-association tests (Russell 
& Jenkins, 1954) and for which we 
have an associative hierarchy in terms 
of p. As expected, many R, words 
intruded in Recall and increasingly so 
(up to 56%) the higher the associative 
p(R,) calculated from the free-associ- 
ation norms. Unexpectedly, however, 
a few R, words were exceptionally 
well recalled, regardless of a large as- 
sociative value of p(R,), and weak- 
ened the Blick effect. Examination of 
these too well-remembered words often 


1This work was supported by Contract 
Nonr-475(10) between Tulane University 
and the Office of Naval Research, the second 
author serving as Principal Investigator. 
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suggested that an associative bond ex- 
isted between primary and secondary; 
that is, the R, word was evocable by 
R, as a free associate. There was no 
way of checking because the norms did 
not contain this information, though 
Palermo and Jenkins (1964) provide 
complementary data from a multitrial 
task in extending Jarrett and Scheibe’s 
(1963) proposal that multiplicative re- 
lations between word-association prob- 
abilities serve to determine rate of 
paired-associate learning. 

Discovery of this probable phenome- 
non in Blick’s data led to the hypothe- 
sis that the associative -strength of 
~(R,) helps to govern the incidence 
of R, recalls as well as R, and Rj» 
misrecalls. For example, the stimulus 
word TABLE evokes the cultural pri- 
mary CHAIR with a probability of 44 
in free-association norms (Bilodeau & 
Howell, 1965). The second most fre- 
quently occurring response to TABLE 
is TOP, with p(R,) = .16; carr does 
not evoke Top at all. According to 
the associative rule (Rule 1) put for- 
ward by Blick, cHAIR should intrude 
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often in recall (because of the relative 
strength of the S — R, bond) when S 
is tested for retrieval of the R, term in 
the presence of the stimulus. How- 
ever, an associative connection may 
exist between the R, word and the 
word to be recalled, as in the case of 
the word sir. The stimulus TABLE 
does not evoke sir at all, but the R, 
(CHAIR) does, with p = .37. In this 
case, a second associative rule (Rule 
2) is required for the prediction of cor- 
rect recall and R, intrusion. In the 
example above, sır may be designated 
an R, word, signifying that its recall 
would be a correct response and that 
its probability of associative evocation 
by the stimulus is zero. 

Figure 1 integrates these alternative 
associative strategies for regulating 
correct recall into a general schema for 
a Two-Process model of retention and 
forgetting. While Blick demonstrated 


ASSOCIATIVE RULE FOR MISRECALL 
P(S>R,) and P(Ri>Ro) 
eet airy er B^) 


Vs 


RECALL 


RULE | RULE 2 RULE 3 
P(S5R;) P(Rj*Rg) p(S>R,) & 
PIR >R3) 


THREE ASSOCIATIVE RULES FOR 
CORRECT RECALL 


Fic. d. The relationship between rules and 
components of misrecall and recall. 
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that increasing preexperimental asso- 
ciative values of p(R,) lead to increas- 
ing amounts of recall interference 
through intrusion of the R, term, Fig. 
1 shows that evocation of the R, dur- 
ing recall may play a facilitating as 
well as an interfering role in the re- 
trieval of R, words. Misrecall is rep- 
resented above the chord and this area 
has been fractionated into three com- 
ponents of which two, R, and R,, 
(free-association responses R,-R,), 
have been shown by Bilodeau, Fox, 
and Blick (1963) to vary according to 
rules based upon #(R,). These au- 
thors reanalyzed íree-association data 
from Russell and Jenkins (1954) and 
divided 45 stimulus words into three 
categories operationally distinguished 
by the probability with which the stim- 
ulus evoked an R,, R, or Rg» re- 
sponse. The defining characteristics of 
these categories are shown below. 


Category A: Low /(R,), low 
P (R,), and high ?(Rs.,). 

Category B: Moderate p(R,), max- 
imum $(R,), and moderate P (Rs). 

Category C: High P(R), low 
P(R;), and low &(Rs ,). 


Figure 1 indicates that E may use one 
Set of associative rules based upon 
P(S>R,) to predict which compo- 
nent of misrecall will be dominant, 
high cultural p(R,) (Cat. C) leading 
to many R,’s and a few R;,/s; low 
P(R) (Cat. A) leading to few R,’s 
and many Rss. However, we have 
just hypothesized that it is more com- 
plicated than that, for no knowledge 
of p(R,) is adequate in itself to pre- 
dict level of correct and incorrect re- 
call completely. As. the associative 
tules in Fig. 1 indicate, a value of 
2 (R,) may convert to any level of cor- 
rect recall from p of O to 1.0 thereby 
increasing the obtained R, and de- 
creasing the obtained R, Note that 


ASSOCIATIVE RULES OF RECALL AND MISRECALL 


TABLE 1 


CULTURAL ASSOCIATIVE PROBABILITIES FOR 
RULE 1 anp RuLE 2 Worps 


Rule 1 Rule 2 


Cat. 


(Ri) | (Rs) | p(Ri > Ra)| 2(R9 | (Rx > Ro) 


A} 1 | 10 -00 -00 adit 
B | .40 | .23 .00 .00 34 
C | .70 | .07 .00 .00 152 


Rule 3 represents the combination of 
Rules 1 and 2. 

The orientation of the experiment 
reported below has been to hold the 
probability relations for R, and Rs, 
misrecalls constant for any word, while 
testing Associative Rules 1 and 2 for 
correct recall. The purpose is to in- 
vestigate verbal retention when one set 
of rules based upon stimulus categories 
is formulated for the prediction of 
dominant components of misrecall and 
a different set of rules is proposed to 
govern the level of correct recall. 


METHOD 


Subjects and materials. —Eighty male un- 
dergraduates at Tulane University were 
unsystematically divided into four groups 
of 20 Ss. The booklet was a modified form 
of the one described by Bilodeau et al. 
(1963). It consisted of an opaque cover 
page for S’s name, 1 training page of five 
words, 5 yowel-cancellation pages, and a 
recall page. 

Five R, and five Re words were selected 
according to Rules 1 and 2 illustrated in 
Fig. 1 and described in the example given 
above. The five Re words for Rule 1 were: 
person (Cat. A), FINGER and norse (Cat. 
B), smapE and FELLOW (Cat. C). The R. 
words for Rule 2 were: WOMEN, SHOE, HARD, 
DARK, and TALL. Table 1 displays the mean 
associative probabilities of these words ar- 
ranged by categories and rules. The Rs 
were thus either moderately (Cat. B) or 
weakly (Cat. A and C) evoked as associates 
to their stimuli; the R.’s were not evoked 
at all by these stimuli. Conversely, the R.’s 
ranged from strong to moderate associates of 
the Ri’s when the Ri: words were used as 


- imum R,’s (.28). 
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free-association stimuli. The Rəs were not 
evoked as associates by the Rv's at all. 

Procedure.—The S's task was to study five 
Rs or Re words (20 sec.), cancel vowels (7 
min.), and recall the words of training (2 
min.). The Ss were tested for written recall 
with or without verbal stimulation (S or 
NS). When Stimulated, the printed stimu- 
lus words were displayed to the left of five 
blanks on the recall page and Ss were in- 
structed to refer to them for help in remem- 
bering the words of Training. All Ss were 
instructed to fill in every blank and to write 
the first word they thought of if they could 
not remember. The experiment was carried 
out with one intact group and one proctor 
was used per 25 Ss. 

All responses were scored according to 
sources of variance developed by Bilodeau 
etal. (1963). Those especially important for 
understanding the data below are: position 
in the response hierarchy (Ri and Rs-n mis- 
recalls, Re or Re recalls) and Same List 
recalls (correct responses but written out of 
serial position). 


RESULTS 


The recall probabilities for Stimu- 
lated Ss under Rules 1 and 2 are pre- 
sented in Table 2. Rows represent the 
three word categories and columns the 
critical sources of correct recall (R, 
or R,) and misrecall (R, or Rs-n). 
Rule 1 produces phenomena typical of 
the earlier study of categories: Cat. A 
yields maximum Rs,'s (.56), Cat. B 
maximum R,’s (.67), and Cat. C max- 
The first column 
under Rule 2 shows that the critical 
component of misrecall (R,) varies ac- 
cording to the associative p(R,), from 


TABLE 2 


RECALL PROBABILITIES FOR RULES 1 AND 2 
CLASSIFIED BY WORD CATEGORY AND 
POSITION IN RESPONSE HIERARCHY 


Rule 1 Rule 2 
Cat. 
Ri R: Ran Ri Re Ran 
A HL |e P| 2555 [2:001] 5:80 Pre) 
B 203 |" SAN 22 |'108 1 235- |" 37 
Cc -28 |^.42 | .16 | .12 | .60 | 10 


PROBABILITY OF Rç IN RECALL 
4 a è s 
o 


720. 30 40 $0 86 


T0 80 
ASSOCIATIVE p(R, => Re) 


Fic. 2. Correct recalls under Rule 2 as a 
function of the associative P(ROR.). 


Cat. A lowest to Cat. C highest? The 
recall of R, shown in Table 2 is best 
brought out in Fig. 2. Correct recall is 
plotted on the ordinate and the asso- 
ciative value of p (R, > R,) is on the 
abscissa. It is apparent from Fig. 2 
that recall of R, is positively and 
strongly related to p(R, > Re). Thus, 
to recapitulate, when correct recall is 
based upon Rule 1, the category with 
the highest »(S — R,) (Cat. B) pro- 
duces the best recall. On the other 
hand, when recall is based upon Rule 
2, increasing associative 2(R,— R;)s 
lead to increasing values of Correct re- 
call and best recall can be obtained 
from Cat. A. Lastly, note (in Table 
2) the great range of correct recall ob- 
tained from a list of a mere five words. 
Rule 1 produces a low of 17, Rule 2 
a high of .80, and within Rules the cells 
of correct recall vary by 45 or more 
Percentage points according to their 
Tespective associative connections. 


2 This finding has consistently been ob- 
tained with the methodology of stimulated 
recall. The phenomenon, though not large 
in this iment, replicates that sum- 
marized by Bilodeau (1966) from data pro- 
vided by over 1,000 Ss, 
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The present experiment focuses upon 
recall and misrecall as functions of as- 
sociative rules under conditions of stim- 
ulation. Not Stimulated performance 
conformed in general to findings pre- 
viously reported by Bilodeau et al. 
(1963) and Fox, Blick, and Bilodeau 
(1964). That is, as compared with 
Treatment S, NS produces (a) few if 
any R, and Rs, misrecalls; (b) con- 
Siderably more responses not found in 
the Russell- Jenkins (1954) hierarchies; 
(c) some failures to respond at all; 
and (d) widely varying recalls with 
mean recall (including Same List re- 
sponses) approximating the mean of 
Treatment S summed over all cate- 
gories, 

The experiment nowhere exposed pri- 
maries which might serve to cue a 
correct recall, relying instead upon 
evocation of the primary by S’s preex- 
perimental associative habits during a 
relatively long recall period under in- 
structions favoring the associative 
processes. An additional experiment 
was subsequently performed with a 2- 
page training procedure modeled after 
that used by Blick (1965), where R, 
words were displayed on page 1. This 
supplementary experiment was con- 
sistent in supporting the results re- 
ported above, and also provided a test- 
ing of the facilitative effects of 
Associative Rule 3 when the cultural 
primaries are exposed during training. 


* A table presenting obtained recall prob- 
abilities for Not Stimulated Ss in this ex- 
periment and a table and explanatory material 
summarizing the method and results of the 
Supplementary experiment have been de- 
Posited with the American Documentation 
Institute. Order Doctiment No. 9013 
from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Con- 
gress, Washington, D. C, 20540. Remit in 
advance $1.25 for microfilm or $1.25 for 
Photocopies and make checks payable to: 
Chief, Photoduplication Service, Library of 
Congress, 
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Discussion 


This experiment and that placed on 
record with ADI represent only a limited 
approach to what appears to be an ex- 
tensive area of inquiry. Our incomplete 
knowledge of associative probabilities be- 
tween words within hierarchies has thus 
far limited the testing of many hypotheses 
and restricted the inquiry to the very few 
words with known probability networks. 
Examination of the 100 response hier- 
archies in the Russell-Jenkins (1954) 
norms revealed only one stimulus word 
with the associative connections de- 
manded by Rule 3 (i.e. BITTER — SWEET 
—>sour), where sour is both an R, to 
BITTER and a strong associate of the R,. 
The use of R, words in the present ex- 
periment partially circumvents this obsta- 
cle, but leaves the explication of Associa- 
tive Rule 3 relatively unexplored. 

Norms recently collected at Tulane 
University (Bilodeau & Howell, 1965) 
on both discrete and continued free asso- 
ciations for 45 networks should soon 
enable E to vary the associative proba- 
bility over a broad range for the top 
four terms of the network. Given such 
norms, E might well compose more 
thorough and improved experiments on 
various forms of recall and misrecall. 
At a minimum, such associative data will 
allow examination of the possible role of 
backward associations and will favor a 
more complete evaluation of the exact 
relationship linking (S — R,) and (R, 
— R,) terms, Given this information, it 
may be possible to reverse the Blick ef- 
fect and produce more intrusions of R, 
when the associative p(R,) is low than 
when it is high by using a zero associa- 
tive p(R, > R,) with Cat. A words and 
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a maximally high p(R,—>R,) with Cat. 
C words. The goal of such formulations 
is to enable E to vary correct recall over 
the widest possible range, and at the same 
time maintain control over the amount 
and type of intrusions produced. 
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16 practiced Os monitored an 8 X 8 matrix for 4 successive 100-min. 
sessions and detected, located, and identified additions and deletions of 


alpha-numeric stimuli. Signal fri 


equency (60 and 150 per session) 


was varied between Ss, and stimulus density (4, 8, 16, and 32 stimuli) 


within Ss. 


Identification accuracy 


and detection latency were the 


most sensitive measures, revealing a vigilance decrement followed by 
an end spurt Monitoring performance was poorest overall and the 
decrement largest under low frequency 32 density, was poorer for dele- 
tions than for additions, and was better at the mean than extreme inter- 


signal intervals. 


These trends were based on short-term wanderings 


of attention, operationally defined as missed signals and very long de- 
tection latencies, rather than on changes in absolute Sensitivity. 


Research on human attentiveness 
over extended periods of time has con- 
centrated primarily on the monitor’s 
ability to detect signals occurring in a 
simple display (Bakan, 1955; Mack- 
worth, 1948; Nicely & Miller, 1957). 
The classical vigilance decrement iden- 
tified in this work has appeared as a 
monotonic decline in detected signals 
with duration of watch. However, re- 
cent investigations of monitoring com- 
plex displays have consistently revealed 
a complete absence of a vigilance decre- 
ment in terms of detection accuracy 
and only an elusive one in terms of 
detection latency (Adams, 1963). 
Thus, Adams and Boulter conclude: 
“Although vigilance decrement is often 
substantial for very simple monitoring 
tasks . . . the attentive powers of the 
human $ in complex visual tasks ap- 


pear to be far greater than many would 
surmise [1964, p. 131]." 

Yet, despite the apparent stability 
of attention to complex displays, con- 
tinued investigation seems warranted 
in order to determine if complex dis- 
Plays foster more subtle forms of vigi- 
lance decrement than heretofore iden- 
tified. For example, while prolonged 
watch may not affect sheer detectability 
of signals, it may weigh heavily on the 
ability to describe or remember signals. 
Furthermore, the effect of continued 
Monitoring could be one of increased 
variability, rather than overall suppres- 
sion, of performance. Even if long- 
range effects are minimal, short-range 
deteriorations in attention which are 
confined to the intersignal interval 
(ISI) could be prominent. The pres- 
ent study sought to explore these possi- 
bilities and to identify factors which 
regulate attention to complex displays. 

The design permitted an examination 
of signal frequency (a classical variable 
in studies of vigilance) under varying 
levels of display complexity (stimulus 
density). While most major theories 
of vigilance (Frankmann & Adams, 
1962) predict maximal vigilance decre- 
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ment under low signal frequency, they 
yield different predictions of the effect 
of stimulus density. Thus, assuming 
that amount of novel stimulation and 
frequency of observing responses (Hol- 
land, 1958) are a direct function of 
stimulus density, maximal vigilance 
decrement is predicted under low 
density (low stimulus novelty) by 
Scott’s (1957) arousal theory and 
Broadbent’s (1953) filter theory, and 
under high density (low ratio of sig- 
nals to observing responses) by Jerison 
and Pickett’s (1964) reinforcement 
theory. The present study provided a 
test of these opposing predictions. 


METHOD 


Subjects —The Ss were 16 university un- 
dergraduate males who had from 2 to 5 hr. 
of practice on the basic task prior to the first 
experimental session. Practiced Ss were 
used to minimize confounding of learning 


and vigilance effects. Each S was paid 
$6.00 upon completion of his four 100-min. 
monitoring sessions. 

Apparatus and stimulus material —All 
programming of input and scoring of re- 
sponses was performed using a combined IBM 
1401/7094 digital computer system. Four 
cathode ray tube display consoles were linked 
directly to this system in such a way that 
four Ss could serve simultaneously under the 
same conditions. A 5-in. square, 8X8 
matrix of the basic form shown in Fig. 1 
was presented on each display. A detection 
button and a light pen served as response 
devices for each S. The stimulus population 
was composed of the 64 combinations of the 
letters A through H with the numbers 01 
through 08, Asterisks denoted empty cells 
and signals occurred either as additions 
(adds) or deletions (omits) of the alpha- 
numeric stimuli. 

Experimental design and signal program- 
ming—Two levels of signal frequency (60 
and 150 per session) and four levels of 
stimulus density (4, 8, 16, and 32 stimuli) 
were combined factorially to produce eight 
experimental conditions. Signal frequency 


ABCDEFGH OI 02 03 0405 06.07 08 


Desk Spoce Containing Response Devices 


Fic. 1. Front view of pertinent features of S's display. (See text for a full explanation.) 
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was manipulated between Ss and stimulus 
density within Ss yielding two groups of 
eight Ss each; all Ss served one session un- 
der each density level for a total of four 
sessions. The alpha-numerics appearing on 
the starting display and those added in a 
given session were selected randomly from 
the stimulus population. Those to be omitted 
were selected randomly from the subset 
chosen for the starting display and the adds, 
Adds and omits occurred with equal fre- 
quency in order to maintain the desired 
density levels, The temporal order of the 
signals was randomized with the obvious 
restriction that a stimulus could not be 
omitted unless it was already on the matrix, 
The selection of empty cells for placement 
of adds was also random. Since the tem- 
poral order of adds and omits was unsys- 
tematic, the density levels fluctuated from 
moment to moment and those specified above 
are actually averages. The duration of the 
ISIs varied normally about means of 40 sec, 
(high signal frequency) and 100 sec, (low 
signal frequency) with the restriction that 
all values were multiples of 10 sec, The 
temporal order of ISIs per condition was 
random so that at unpredictable moments in 
time randomly selected signals would appear 
in varying display locations. 
Procedure.—The eight Ss who served un- 
der each signal frequency condition were 
divided into two Squads of four Ss each, 
The Ss in each squad performed simultane- 
ously under identical conditions. The same 
random order of density levels was deter- 
mined for one Squad from each signal fre- 
quency group, and the reverse random order 
was administered to the other squad. The 


When $ saw a signal, he was to complete 
a response sequence of detection, location, 
and identification. 
Was to be made as rapidly as possible by 


mounted such that S could rest the index 
finger of his left 
throughout each session without accidentally 
activating it, 
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neath the matrix) of the stimulus or DNO. 
The DNO response was to be made only 
when $ was unaware of the exact location 
or complete identification of the signal. 
Partial identifications were accepted but were 
extremely rare. The computer provided im- 
mediate feedback to S after each step in the 
response sequence so that S could determine 
if his responses were Properly recorded. If 
a detection was not recorded, S merely re- 
peated the response; if a location or identi- 
fication was not Properly recorded, $ was 
instructed to erase the faulty record by il- 
luminating ERS (written to the right of the 
matrix), to repeat the correct response until 
Properly recorded, and finally to illuminate 
EXC (also written to the right of the 
matrix). The entire Sequence of detect, 
locate (or DNO), EXC, identify (or DNO), 
and EXC was to be completed before 5 
responded to a new signal. 


REsutts 


Major analyses.—Since a short-term 
memory requirement was more inti- 
mately involved in responses (particu- 
larly identifications) to omits than to 
adds, it was necessary to separate the 
two types of signals for analytical pur- 
Poses. Hence, the adds and omits of 
each condition were independently 
organized into successive blocks of five 
signals. This led to six blocks of each 
signal type for each low-frequency con- 
dition, and to 12 of each type for each 
high-frequency condition. Each block 
was scored in terms of the mean time 
of occurrence (into the Session) of the 
five signals, and these scores were used 
to match high-frequency blocks with 
low-frequency blocks so that corre- 
sponding high- and low-frequency con- 
ditions could be equated in terms of 
numbers of signal blocks, Among the 
measures obtained for each Ș per block 
were percentage of signals detected, 
Percentage of detected signals cor- 
rectly located, percentage of detected 
signals Correctly identified, and mean 
detection latency (in .001 sec). La- 
tencies of signal detections that oc- 
curred while $ was still responding to 
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previous signals were spuriously high 
and, therefore, did not enter the calcu- 
lation of detection latencies. An 
ordinal time scale in the form of signal 
blocks was preferred to the more com- 
monly used interval scale in order to 
avoid possible measurement artifacts 
produced by an uneven distribution of 
signal frequencies across actual time 
intervals. 

Detection and location scores were 
extremely high (90.4% and 93.2%, 
respectively) and largely invariant. 
Consequently, only identification and 
latency data will be discussed. Analy- 
ses of variance were performed on both 
measures, and Duncan’s test (Edwards, 
1963), using the .05 level of signifi- 
cance, was employed to identify the 
source of each significant effect. The 
analyses of variance are summarized in 
Table 1 and the latency trends are 
plotted in Fig. 2. The significant main 
effects of frequency were based on su- 
perior performance under high fre- 
quency. This result is consistent with 


previous findings of a positive rela- 
tionship between detectability and sig- 
nal rate (Jenkins, 1958). The signi- 
ficant main effects of density reflected 
an inverse relationship between moni- 
toring performance and density level. 
This result also agrees with previous 
data (Adams, Stenson, & Humes, 
1961). In all cases performance under 
Density 32 was inferior to that under 
all other levels, and in the case of iden- 
tification of omits, performance under 
Density 4 was superior to that under 
all other levels. The significant Fre- 
quency X Density interaction, which 
characterized both adds and omits, was 
defined by a restriction of the frequency 
effect to Density 32, and the density 
effect to low frequency. 

Blocks was a significant main effect 
only for the latency data and was gen- 
erally based on poorest performance in 
Blocks 3-5. This curvilinear vigilance 
function is in contrast to the classical, 
linear vigilance decrement. The Den- 
sity X Blocks interaction, which was 


TABLE 1 


SuMMARY OF ANALYSES OF 
IDENTIFICATION 


VARIANCE OF MEAN DETECTION LATENCY AND 
ACCURACY FOR ÁDDS AND Omits 


Between Ss 
Frequency (F) 
error 


Detection Latency 


q.89***| 237.83 
4.39** | 82.97 


Identification Accuracy 
cee 
Adds Omits 


1654.22| 2.99 |14751.03] 5.13* 
551.94 2874.02 


1436.51 | 4.62** |20236.48 23/12t** 
2.56* | 207.31 | 1.56 
4.13* | 1004.25 | 3.23* 
AT 51.37 | .39 
1.97* 88.40 | .74 
19 99.07 | .83 
310.75 
132.92 
119.47 
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lance function was most prominent un- 
der lowest and highest density levels in 
the case of omits. Figure 2 suggests 
that a magnification of the blocks effect 
under Density 32 also characterized 
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add signals. The Frequency X Blocks 
interaction barely missed significance 
for latency of detecting adds (F = 2.35 
was required and F = 2.32 was ob- 
tained), and Duncan’s test confirmed 
the indication in Fig. 2 that the blocks 
effect was most conspicuous under low 
frequency. In short, then, the locus 
of largest vigilance decrement was low 
frequency 32 density. 

The overall effects of signal type 
were ascertained by Frequency X Den- 
sity X Signaltype analyses of variance. 
Responses to adds were significantly 
superior to responses to omits in terms 
of both identification accuracy, F (1, 
14) = 238.46, p < .001, and detection 
latency, F (1, 14) = 19.05, p < .001. 
The Density X Signaltype interaction 
was significant in terms of identifica- 
tions, F (3, 42) = 9.01, p < .001, and 
was based on a magnification of the 
density effect with omits. In addition, 
the Frequency X Signaltype interac- 
tion was significant in terms of-detec- 
tion latencies, F (1, 14) = 8.04, p 
< .05, and was based on a restriction of 
the frequency effect to omits. Appar- 
ently, then, the short-term memory 
attribute of omits retarded monitoring 
performance and tended to augment 
the density and frequency effects. 

Fine-grain analyses —Being the locus 
of maximal vigilance decrement as 
well as a condition of high display com- 
plexity, the low frequency 32 density 
condition was selected for further 
analysis. 3 

In order to eliminate possible arti- 
facts incurred by analytically separat- 
ing omits and adds, the signals of hoth 
types were put back in their original 
serial arrangement and then reorga- 
nized into six blocks of 10 signals. 
Hence, a given block contained both 
adds and omits, The mean, longest, 
and shortest latencies were determined 
for each 10-signal block and were sub- 


mitted to trend analyses. A significant 
quadratic component was obtained only 
for mean, F (1, 35) = 11.10, p < .01, 
and longest latencies, F (1,35) = 5.73, 
p «.05. This set of results suggests 
that the curvilinear vigilance functions 
portrayed in Fig. 2 are not artifactual 
and are determined primarily by the 
larger latency scores. 

An effort was also made to specify 
the relationship between attention and 
ISI. This function is presented in Fig. 
3, and a trend analysis of these data 
exposed a significant quadratic compo- 
nent, F (1, 42) =6.05, p<.05. In 
agreement with the expectancy theory 
of Baker (1959) then, latencies were 
shorter at the mean than at the extreme 
ISIs. 

Tt was also of interest to test the 
notion that the memory element of re- 
sponses to omits augmented the size of 
the vigilance decrement. A greater 
decrement fostered by omits than by 
adds would be consistent with this 
hypothesis. However, while a Signal- 
type X Blocks analysis of variance of 
latencies exposed a significant blocks 
main effect, F (5, 35) = 2.96, p < .05, 
the interaction did not attain signifi- 
cance (p > .05). Neither was a sig- 
nificant interaction revealed by the 
same analysis applied to identification 


Mean Detéction Latency (in sec) 


His eo 30 0 O —15— 39— 
Intersignal Interval (in sec.) 
Fic. 3. Detection latency as a function of 


ISI in the low frequency 32 density condition 
(adds and omits pooled). 


742 


data. Therefore, acceptance of the 
hypothesis is not warranted. 

Finally, error analyses of the low 
frequency 32 density condition showed 
that missed signals made up the bulk of 
the errors (82%) and that the prob- 
ability of these errors varied directly 
with the functions in Fig. 2 and 3. 
Thus, occasional long wanderings of at- 
tention, operationally defined as long- 
est latencies and missed signals, were 
most likely to occur during the extreme 
ISIs and in Blocks 3-5, Furthermore, 
these periodic lapses in attention did 
not seem to be uniquely affected by 
the memory requirement underlying 
omits, 


Discussion 


Of the major theories of vigilance, the 
present data are most consistent with 
Jerison and Pickett’s (1964) reinforce- 
ment theory which is based on the as- 
sumptions that signals are reinforcers, 
that monitoring is composed of observing 
responses, and that performance improves 
with increases in percentage of reinforce- 
ment (ie. ratio of signals to observing 
responses). If it can also be assumed 
that observing rate increases with stimu- 
lus density, then increasing signal fre- 
quency or decreasing density should in- 
crease percentage of reinforcement and 
thereby facilitate performance, The main 
effects of frequency and density are con- 
sistent with this reasoning. Furthermore, 
the Frequency x Density interaction agrees 
with the previous finding that the effect 
of a change in percentage of reinforce- 
ment is greater for low, than for high, 
initial percentages (Michels, 1955). 
Thus, the frequency effect is greater 
under the low initial percentage of re- 
inforcement assumed to accompany high 
density, and the density effect is greatest 
under the low initial percentage assumed 
to accompany low frequency. 

In discussing the obtained vigilance 
functions, it is convenient to distinguish 
between short-range effects limited to the 
ISI and long-range effects accruing over 
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the entire monitoring session (Frank- 
mann & Adams, 1962). The obtained 
short-range vigilance function is appar- 
ent in Fig. 3 and is entirely consistent 
with the reinforcement theory of vigilance 
which predicts that S is more alert around 
the most frequent ISI. 

The curvilinear long-range vigilance 
function may be described as a decrement 
followed by an end spurt. An end spurt 
is to be expected when practiced Ss are 
aware of session length (Bergum & Lehr, 
1963). More important is the decrement 
component of the vigilance function. The 
fact that the decrement was largest under 
low frequency 32 density conforms to 
Jerison and Pickett's theory. However, 
the additional fact that the decrement in 
responses to omits was second largest 
under Density 4 is incompatible with all 
prominent theories of vigilance (Frank- 
mann & Adams, 1962). Perhaps as dis- 
play complexity increases, the underlying 
determinant of the vigilance decrement 
shifts from a paucity of novel stimulation 
(Broadbent, 1953; Scott, 1957) to a low 
ratio of signals to observing responses. 
Unfortunately, the lack of consistent in- 
teractions of blocks with frequency and 
density renders these theoretical impli- 
cations only tentative, 

The fundamental nature of the short- 
and long-range decrements obtained in 
the present study were identified as short- 
term lapses in attention which were most 
likely during the extreme ISIs and in 
Blocks 3-5. The analyses of longest la- 
tencies and missed signals provide firm 
evidence favoring this conclusion. Since 
Shortest latencies were invariant with 
blocks, absolute sensitivity to signals was 
apparently not impaired by prolonged 
monitoring. Furthermore, the vigilance 
decrement appears to have been specific 
to attention. A special intrasession mem- 
ory decrement would be expected on the 
basis of an accumulation of proactive in- 
hibition, particularly when items of the 
same form-class (alpha-numerics in the 
Present case) are successively retained 
(Postman, 1961; Postman, Keppel, & 
Stark, 1965). However, this possibility 
Was supported neither by the Signaltype 
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x Blocks fine-grain analysis (which 
failed to reveal a significant interaction), 
nor by the fact that the vigilance decre- 
ment tended to be largest under low sig- 
nal frequency (maximum proactive in- 
hibition would be expected under high 
frequency). 

Finally, it is noteworthy that the 
latency decrements shown in Fig. 2 are 
considerably larger than ones found in 
previous studies of complex display moni- 
toring (Adams, 1963), and are correlated 
with a mean decrement of 50% in accu- 
racy of identifying omits. Contrary to 
the conclusions of Adams and Boulter 
(1964), then, vigilance decrements in the 
framework of complex displays may well 
be of practical significance. 
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A PSEUDOCONDITIONING EFFECT ON REACTION TIME! 
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AND EVERETT A. GARVIN 
North Central Massachusetts Mental H, ealth Center 
This experiment was performed (a) to compare the effect upon re- 


action time of paired and unpaired Presentations of the reaction-time 
signal and electric shock and (b) to determine whether use of a ready 


a means of gauging the progress of 


by the reaction-time signal The major findings were: (a) The slow- 
ing of reaction time occurs as a pseudoconditioning Phenomenon. (b) 
This slowdown is reduced or eliminated by the use of a ready signal, 


(c) The ready signal also prevents 


In a recent study Goerk and Kamin 
(1959) reported a slowing of reaction 
time to a signal previously paired with 
unavoidable shock, This was the case 
whether or not the RT response ter- 
minated the signal, They concluded 
that the phenomenon was one effect of 
an “anxiety-eliciting” CS. On the 
other hand, no latency changes were 
Observed by Bersh, Notterman, and 
Schoenfeld (1957a) when each stimu- 
lus term in a word-association series 
was interpolated between a tone signal 
and an unavoidable shock. Among the 
many procedural differences between 
the two experiments, one in particular 
suggests itself as a possible basis for 
the contrasting results, The tone in 
the investigation by Bersh et al, 
(1957a) might very well have acted as 
a ready signal, since the stimulus word 


ne authors and do not necessarily reflect 
Official Department of Army Policy, 
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always occurred 3 sec. after its termi- 
nation. It is well established that a 
ready signal interferes with eyelid con- 
ditioning (Dufort & Kimble, 1958; 
Dufort & Rollins, 1965; McAllister & 
McAllister, 1960a, 1960b ; Prokasy & 
Whaley, 1962). Presumably any fac- 
tor which detrimentally affects classical 
conditioning may impede the acquisi- 
tion of aversive Properties by a neutral 
stimulus paired with an aversive UCS. 

he present experiment was per- 
formed to determine whether a ready 
signal does, in fact, play the role sug- 
gested above. At the Same time con- 
trols were run for Pseudoconditioning. 
Kimble ( 1961) has noted that “pseudo- 
conditioning may be a Part of all condi- 
tioning in which a noxious stimulus is 
employed.” While evidence was pro- 
vided by Goerk and Kamin (1959) 
against a Pseudoconditioning basis for 
the latency rise they obtained, this evi- 
dence is somewhat equivocal, 


quisition by the RT signal of aversive 
Properties. The underlying assump- 
tion, as already implied, is that acqui- 
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sition of such properties by the stimulus 
is related to its development, through 
classical conditioning, of control over 
respondents. 


METHOD 


Subjects—The Ss were 40 male under- 
graduates, screened for cardiac and psychi- 
atric history and paid for their services. 
Four groups of 10 Ss each were run in 
sequence, with assignment to groups on a 
random basis. 

Apparatus—Heart rate was recorded by 
a Sanborn twin-viso recorder, Model 60- 
1300B. Reaction time, which involved the 
pressing of a telegraph key, was measured 
by a Meylan millisecond timer. The Ss re- 
ceived the RT stimulus, a 2,000-cps tone well 
above threshold, through a headset. A red 
pilot light, positioned at eye level, served as 
a ready signal for two of the groups. Shock, 
set at 25 v. de, was provided by an Electro 
Model 612 filtered de power supply. A 
small room within a larger one constituted 
the S cubicle, The E and control apparatus 
were outside the larger room, an arrange- 
ment which eliminated apparatus cues. 

Procedure.—Al| Ss were run on 2 con- 
secutive days, the daily sessions lasting ap- 
proximately 2 hr. The four groups involved 
were conditioning, no ready signal (Group 
A); conditioning, ready signal (Group B); 
pseudoconditioning, no ready signal (Group 
A'); and pseudoconditioning, ready signal 
(Group B'). Except for the presence or 
absence of a ready signal, the two condi- 
tioning groups were treated exactly alike. 
With the same exception, Groups A' and B' 
were also treated alike. For all groups the 
RT signal was a l-sec. tone. The ready 
signal for Groups B and B’, a 5-sec. light, 
preceded the tone at intervals varying ran- 
domly between 3 and 6 sec. 

Preparation of S for heart-rate record- 
ing and shock application is described in 
earlier papers (e.g, Notterman, Schoenfeld, 
& Bersh, 1952). The telegraph key was 
placed on the right armrest of a chair con- 
venient to S’s hand. Standard instructions 
pertaining to tone, shocks, and heart-rate 
recording were read to all Ss (see Notter- 
man et al., 1952). In addition, all groups 
received RT instructions, which for Groups 
B and B' included information about the 
role of the ready signal. 

For Groups A and B, the first day con- 
sisted of 60 basal trials, involving tone- 
alone presentations for Group A and light- 
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tone sequences for Group B; no shock was 
administered. Twenty additional basal trials 
were given on the second day. The 80 basal 
trials were designed to eliminate any precon- 
ditioning heart-rate response, and, at the 
same time, to provide RT practice. These 
were followed by 10 regular reinforcement 
trials, on which a 6-sec. shock was applied 
6 sec. after the end of tone or light-tone 
presentation. Finally, a series of random 
reinforcement trials was given, with 10 
reinforced trials randomly interspersed 
among 30 trials. 

For Groups A’ and B', the first day also 
consisted of 60 basal trials. On Day 2, 20 
basal trials were followed by 40 trials dur- 
ing which 20 shocks were randomly inter- 
spersed among 40 tone or light-tone presen- 
tations. An effort was made to preserve 
the same distribution of shocks as for the 
two conditioning groups. The series began 
for each S with a shock presentation, so that 
any immediate effect might be reflected in 
the first RT trial of the phase. Then, 9 
shocks were randomly interspersed among 
the first 10 tone or light-tone presentations, 
while the remaining 10 shocks were ran- 
domly interspersed among the last 30 tone 
or light-tone presentations. To minimize 
the possibility of conditioning, no tone or 
light-tone presentation was closely followed 
by shock. The interval between successive 
stimuli was varied randomly between 30 and 
120 sec., with a mean of 75 sec. Thus, shock 
never followed any tone or light-tone com- 
bination by less than 30 sec., an interval too 
long to produce heart-rate conditioning, as 
the data show. 


RESULTS 


The reaction-time data are graphed 
in Fig. 1 and 2. Each value is a mean 
of S medians. The division labels on 
the abscissas of the two graphs repre- 
sent equivalent trials for all groups, but 
apply directly to the conditioning 
groups, A and B. Thus “C1,” the last 
basal trial, is also the first trial on 
which tone or light-tone is paired with 
shock. Similarly “R1,” a trial in the 
regular reinforcement series as far as 
RT or heart rate is concerned, is also 
the beginning of the random reinforce- 
ment phase. For any given S in the 
conditioning groups, tone (or light- 
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Fic. 1, Mean RT by five-trial blocks for 
groups without a ready signal. (The verti- 
cal dashed lines demarcate the regular rein- 
forcement trials for the conditioning group. 
The remaining trials of the shock phase 
involved a random reinforcement schedule 
for that group.) 


tone) may not have been followed by 
shock on that trial. 

Figure 1 depicts roughly parallel 
curves for Groups A and A’. In the 
absence of a ready signal, neither group 
showed any trend during the basal 
phase. A nonsignificant difference in 
basal RT, about equal to that shown 
in Fig. 1, existed between the groups 
from the beginning of the phase. Ex- 
posure to shock produces a Sharp rise 
in RT followed by a gradual recovery. 
For Group A' the rise was immediate, 
since RT was significantly higher (.02 
> p >.01)? on the first trial after 
shock than on the last basal trial. In 
fact, with the exception of Random Re- 
inforcement Trial Block 21-25, all 
Shock phase values shown in Fig. 1 
for Group A’ were significantly higher 
than the value for the last basal trial 
block. For Group A the initial shock 
phase value, as well as the values for 
Random Reinforcement Trial Blocks 
24, 7-11, and 12-16, were also sig- 
nificantly higher than its last basal 


signed-ranks test (Siegel 


value. Note that recovery is far from 
complete after 40 shock phase trials, 
For both groups, every value for the 
shock phases is higher than any basal 
value. In each group, the initial rise 
was manifested by 9 of 10 Ss. More- 
over, some Ss failed to show any re- 
covery. 

Figure 2 indicates clearly that expos- 
ure to shock has a much smaller effect 
upon RT for the ready signal groups. 
In fact, Group B shows virtually no 
effect; none of the shock phase values 
was significantly higher than the last 
basal value. The initial rise in RT for 
Group B' is significant (p< .05). In 
addition, the second and last shock 
phase values were significantly higher 
than the final basal value. However, 
the rise merely constitutes a return to 
the level prevailing for that group 
during the first day of basal, Not even 
the elevated value for the last five trials 
is significantly greater than the final 
five-trial value for the first day of basal. 
(For Groups A and A’, the initial 
shock phase values are significantly 
greater than any values for the first 
day of basal.) It is, of course, possible 
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TABLE 1 
P VALUES FoR RT CHANGE COMPARISONS 
BETWEEN GROUPS WITH AND WITHOUT 
READY SIGNAL 


Shock Phase 
"Trials Probability 
2-6 02 >p>.01 
7-11 ‘02 $ b > .01 
2-6R* > <.05 
7-AIR p<.0l 
12-16R p <.05 
17-21R > > .05 
22-26R b > .05 
27-30R p.05 


Note.—Where p is significant, the increase in RT 
for the groups without ready signal Was tly 
greater than for the groups with ready 

aR signifies trial blocks during the phase when 
reinforcement was random for the conditioning groups. 


that exposure to shock precluded a 
further drop in basal RT for Groups B 
and B’, since 80 trials may have been 
inadequate to reduce RT to an asymp- 
totic level. 

Intergroup comparisons provide fur- 
ther evidence that the ready signal 
reduces the RT effect of exposure to 
shock. These comparisons involved 
the changes in RT from the last basal 
trials to five trial blocks during the 
shock phase. Neither conditioning 
group was significantly different from 
its pseudoconditioning control for any 
basal-shock change. Accordingly the 
change distributions for the two groups 
with ready signal (B and B^) were 
combined and compared with the com- 
bined distributions for the groups with- 
out ready signal (A and A. Two- 
tailed p values are presented in Table 1. 

Finally, analogous comparisons be- 
tween the two conditioning groups and 
between the two pseudoconditioning 
groups reveal a number of significant 
differences, always in favor of the 
groups without ready signal. 
Group A shows a significantly greater 
slowdown in RT than Group B for the 

S All i comparisons reported in 
this miei Me Mann-Whitney U 
test (Siegel, 1956). 


first 5 trials of the conditioning phase 
(p < .05), while the slowdown for 
Group A’ as compared with Group B’ 
becomes significant during the second 
10 trials of the shock phase (p < 05 
for Trials 11-15; p < 02 for Trials 
16-20). 

Table 2 summarizes the heart-rate 
data. The paired values are the group 
mean pretone and group mean post- 
tone heart rate in beats per minute, 
respectively. Measurement of pretone 
and posttone heart rate is described in 
Notterman et al. (1952). To provide 
comparability with the conditioning 
groups, a 6-sec. posttone interval was 
selected for Groups A’ and B', though 
use of any specific posttone interval for 
these groups is more or less arbitrary. 

In Table 2 the p values for the last 
basal trials are the result of applying 
the Wilcoxon test to the differences 
between S mean pretone and mean 
posttone heart rate for these trials. 
The remaining p values were obtained 
by computing S mean heart-rate 
changes for the indicated trials of the 
shock phases and by comparing them 
with the mean changes for the last basal 
trials. The table reveals that condi- 
tioning was successful for Group A, 
but not for Group B. Even for Group 
A the typical conditioned heart-rate de- 
pression is delayed until the random 
reinforcement phase. This agrees with 
previous work (Bersh et al., 1957a ; 
Bersh, Notterman, & Schoenfeld, 
1957b), which suggested that the oc- 
currence of motor (including verbal) 
responses in the posttone interval inter- 
feres with heart-rate conditioning. 
Further support for this suggestion is 
provided by the shock phase data for 
Group A’ which show a tendency for 
the heart to accelerate when RT stim- 
ulus (tone) and shock are unpaired. 

The addition of a ready signal in- 
creases the tendency. Groups B and B’ 
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both show a significant unconditioned 
acceleration during the last basal trials 
(see Table 2). The normal uncondi- 
tioned effect of a tone by itself (i-e., un- 
complicated by the RT task) is a small 
deceleration which disappears by the 
end of a 20-trial basal series (Notter- 
man et 214 1952) RN the present 
experiment, the acceleration. for the 
two groups with ready signal persisted 
through 80 basal trials, and, in fact, 
throughout the shock-phase trials, de- 
spite the use of a conditioning pro- 
cedure for Group B. Comparisons of 
the prelight heart rate (computed for 
the two beats immediately before light 
onset) with the pretone rate reveal no 
significant differences, so that the post- 
tone acceleration cannot be attributed 
even in part to a prior deceleration to 
the light. 


Discussion 


The results clearly indicate that the 
slowing of RT observed in this experi- 
ment was à pseudoconditioning phenome- 
non. Even the marked slowdown for 
Group A appears to stem from exposure 
to shock rather than from acquisition of 
aversive properties by the RT signal. 
The slowdown was maximal at the outset 


the shock phase. Since this trial followed 
at least the first shock to each S, a back- 
ward trace interval of 30 sec. or longer 
was involved. 

In view of these findings, the condi- 
tioning interpretation of Goerk and 
Kamin (1959) is suspect. This was 
based in part upon the absence of a sig- 
nificant slowdown for à sensitization con- 
trol group which received five shocks, 


followed by five RT signals with no op- 
portunity for the RT response, and finally 
by 25 RT trials. Shock was withheld 
during the latter two phases. Experi- 
mental groups, given five buzzer-shock 
pairings, followed immediately by 25 RT 
trials, manifested significantly slower RT 
than the control group. Their control 
procedure was quite different from that 
used in the present experiment, where 
shocks were interspersed with RT sig- 
nals throughout the postbasal phases. 
Possibly the insertion of the five RT sig- 
nals between the shock series and the test 
trials established a discrimination of these 
signals as insuring against shock pres- 
entation. At any rate, it is unfortunate 
that they report only summary data. 
Trial-by-trial or trial-block data might 
have revealed an jnitial slowdown for 
the sensitization group, thus suggesting 
some effect of prior exposure to shock. 
(The mean RT for this group was some- 
what, though not significantly, higher 
than for a second, never shocked, control 
group.) 

Acquisition of aversive properties by 
the RT signal cannot, therefore, be con- 
sidered a major influence on RT in this 
experiment. However, the opposite con- 
clusion may be drawn for the ready sig- 
nal, since its use counteracts significantly 
the increase in RT resulting from expo- 
sure to shock. One possible basis for 
both the slowdown and its reduction or 
elimination by the ready signal may be 
the relationship between the RT response 
and the unconditioned effect of the shock. 
A sufficiently intense shock to the finger 
or hand produces an unconditioned with- 
drawal response (e.g. Kimble, 1961; 
Woodworth & Schlosberg, 1954). After 
exposure to shock, the withdrawal tends 
to be triggered by any sudden stimulus, 
ie, sensitization occurs (Woodworth & 
Schlosberg; 1954). It is not unreason- 
able to assume that there may be a simi- 


. This would oppose the 
key press specified as the RT response 
in the present experiment. On the other 
hand, the occurrence of a ready signal is 
accompanied by à build-up of appropriate 
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muscular preparation for quick reaction 
(Woodworth & Schlosberg, 1954), and 
the muscle-action potentials generated by 
such a signal would, accordingly, tend to 
cancel antagonistic potentials generated 
by shock sensitization, While admittedly 
speculative, this analysis has readily test- 
able implications. It implies, for exam- 
ple, that if the RT response were key 
release, exposure to shock would lower 
or at least fail to increase RT even in 
the absence of a ready signal. 

A second effect of the ready signal was 
to prevent the occurrence of heart-rate 
conditioning. This is similar to, though 
considerably more severe than, the inter- 
ference by a ready signal with eyelid con- 
ditioning (e.g., Dufort & Kimble, 1958). 
In the latter case, a CR of appreciable 
magnitude does develop. A parsimonious 
explanation for the failure to condition 
a heart-rate depression with the ready 
signal procedure is the significant uncon- 
ditioned acceleration to the light-tone se- 
quence. Perhaps a similar antagonistic 
unconditioned response tendency under- 
lies the interference with eyelid condi- 
tioning. Dufort and Rollins (1965) have 
indeed suggested that such interference 
may result from the elicitation of compet- 
ing responses by the ready signal. 
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3 probability-learning experiments were run to test predictions made 
by 2 models—one developed by Restle (1961), called the run model, 
and the other, an extension of a model proposed by Burke and Estes 


(1957), called the k-span model. 


each trial, S remembers a fixed number of the preceding events. The 
run model assumes that S remembers the run in progress—i.e., the last 
event and the number of such events that have occurred since the last 
event alternation. Although neither accounted for all of the findings, 
the predictions derived from the run model were considerably more 


accurate. 


The most frequently performed 
probability-learning experiment is a 
two-choice experiment with a simple 
noncontingent event schedule. On a 
typical trial, S predicts which one of 
two events will occur, and E indicates 
whether the prediction was correct. 
An event schedule is a simple noncon- 
tingent one if the probability of each 
event is independent of the trial num- 
ber and of the preceding sequence 
of responses and events. Goodnow 
(1958) and Estes (1964) have re- 
viewed the principal experimental stud- 
ies of this situation that were published 
prior to 1962. 

The two models most commonly sug- 
gested to describe the data from these 
experiments are an equal-alpha experi- 
menter-controlled-events linear model 
and a finite-state Markov model (see, 
eg. Atkinson & Estes, 1963; Bush & 
Mosteller, 1955, Ch. 13; Estes & Strau- 
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ghan, 1954; Friedman, Burke, Cole, 
Keller, Millward, & Estes, 1964). 
Both make a considerable number of 
predictions that are substantiated ex- 
perimentally. However, alternative 
approaches have been suggested be- 
cause these models fail to account for 
the following two properties of the 
data observed early in learning. 

1. These models do not accurately 
predict the first-order sequential effects 
observed during the initial trials of 
learning. Both predict that, if Event 
E; occurs on Trial n, the probability of 
predicting E, on Trial n + 1 will be 
not less than the probability of pre- 
dicting E; on Trial » Brown an 
Overall (1959) and Bush and Mor- 
lock (1959) report data that permit a 
direct test of this prediction. Within 
each group of Ss in these experiments, 
the event sequences were identical and 
because of this a test of the simple re- 
ward assumption can be made for 
every pair of trials: whenever E, oc- 
curs on Trial n, we expect the propor- 
tion of Ss who predict E; on Trial 
n+ 1 to be not less than the propor- 
tion predicting E; on Trial n. Of 418 
independent tests, 262 are not in the 
direction expected on the basis of the 
simple reward assumption. The prob- 
ability that the number of disconfirm- 
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ing tests will be this large is less than 
0001. 

An alternative method of analysis of 
sequential changes in response proba- 
bilities reveals more clearly the nature 
of the discrepancy between the models’ 
predictions and data from the initial 
trials in the probability-learning situ- 
ation. Jarvik (1951), Nicks (1959), 
Derks (1962), and others present their 
sequential data in the form of recency 
functions; i.e., for each Event E; they 
plot the proportion of predictions of E; 
on trials preceded by exactly k pres- 
entations of E; as a function of k. The 
simple reward assumption implies that 
this proportion is an increasing func- 
tion of k, but it has been found that 
early in learning this function increases 
to a maximum, usually at k = 2, and 
then decreases. 

2. Neither of these models can ac- 
count for the fact that Ss respond ap- 
propriately when the probability of an 
event is made conditional on the pre- 
ceding event. Hake and Hyman 
(1953), Engler (1958), and Anderson 
(1960) found that the proportion of 
trials on which Ss predicted event 
Tepetition changed in the direction 
of changes made in the first-order 
conditional probabilities in the event 
schedule. This was true even for 
schedules in which the probability 
of event repetition was less than 1/2; 
in these cases, the Proportion of E, 
predictions was larger if an E, event 
had occurred on the Preceding trial 
than it was following an E, event. 
Such a result cannot be handled by any 
model that makes both the simple re- 
ward assumption and the independence- 
of-path assumption, Further, it is 
well known that Ss can learn to re- 
spond perfectly if the event schedule 
consists of a repeating periodic se- 
quence, such as simple alternation, or 
even repetition of subsequences as long 
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as 12 (see, e.g., Galanter & Smith, 
1958; Keller, 1963; Schoonard & 
Restle, 1961). Clearly, Ss who are 
correctly predicting a repeating peri- 
odic sequence must be influenced by 
the immediately preceding pattern of 
events. Even when a simple noncon- 
tingent event schedule is used, Ss re- 
port after the experiment that they 
were searching for patterns. For ex- 
ample, in studies by Nies (1962), 
using 96 Ss, and by Jarvik (1951), 
using 57 Ss, approximately half the Ss 
reported that they had searched for a 
pattern in the event schedule. Feld- 
man (1963) instructed his Ss to tell 
him why they were making each pre- 
diction. He presents the entire proto- 
col of one S. The reasons given by 
this S indicate that he thought there 
were regularities in the schedule. 

Two models have been proposed to 
account for these findings. In each, 
the independence-of-path assumption 
is weakened, but a simple reward as- 
sumption is made. Both assume that 
Ss remember several of the preceding 
events, and that these influence their 
choices; they differ in what they as- 
sume that Ss remember. i 

The first of these models is an ex- 
tension of the simplest stimulus- 
sampling process. It was first stated 
by Burke and Estes (1957) and later 
extensively elaborated by Restle (1961, 
Dp. 109-111). This model assumes 
that on each trial S's memory spans 
the events of the last p trials. Thus, 
the stimuli from which S samples con- 
sist of the last k events plus some 
constant or background stimulation. 
Some fraction, 6, of these stimuli is 
sampled on each trial, and these be- 
come conditioned to the event that oc- 
curs on that trial. Hereafter, this 
model is referred to as the k-span 
model, 

The second model was originally 
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suggested by Hake and Hyman 
(1953, p. 72). They observed that 
“some previous events, especially ho- 
mogeneous runs of the same symbol, 
were more easily discriminated and 
consistently responded to than others.” 
This observation has been reiterated 
by Goodnow (1955, 1958), Lawlor 
(1956), and Nicks (1959); Restle 
(1961, Ch. 6) has developed a detailed 
model based on it. He assumes that 
Ss remember the run in progress—i.e., 
on each trial, they can recall what the 
previous event was and how many 
such events have been presented since 
the last alternation in the event se- 
quence. The Ss predict that the run 
in progress will either continue or end, 
and their choice probabilities are a 
function of the run structure of the 
event sequence. Restle (1961, 1966) 
has compared this model's predictions 
of sequential changes in response prob- 
abilities with those estimated from a 
variety of experiments. His model, 
which is referred to below as the run 
model, receives support from studies 
that have shown that Ss respond ap- 
propriately when the run structure of 
the event sequences is varied (Derks, 
1963; Goodnow & Pettigrew, 1955; 
Goodnow, Rubinstein, & Lubin, 1960; 
Nicks, 1959). 

Despite the success of the run model 
in handling these data, it is not pos- 
sible to reject the k-span model be- 
because that model also predicts, al- 
though for different reasons, that Ss 
will respond appropriately to vari- 
ations in the run structure. For ex- 
ample, suppose a sequence were con- 
structed with no runs of length 3516: 
a sequence in which the event four- 
tuple E,E,E,E,, or 0111, is always fol- 
lowed by 1, and 1000 is followed by 0. 
The k-span model predicts that the 
four-tuple 0111 will become condi- 
tioned to the response “1,” and 1000 to 


“0,” The following three experiments 
provide a direct confrontation of the 
k-span model with the run model. 


GENERAL METHOD 


Subjects.—Thirty-six undergraduates 
served as Ss, 12 in each experiment. 

Procedure and apparatus.—The S sat in 
a booth and viewed the event sequence 
through an open window. Each event was 
presented on a 3 X 5 in. card on which was 
printed either the number “0” or “1.” The 
cards were prearranged and stacked in a box 
visible to S. Thus, it was clear to S that 
the order of the stimuli did not depend on 
his responses. The S was told that he was 
taking part in an experiment on problem 
solving and that his task was “to predict 
correctly the number that will appear on 
each card.” He was paid $1.00 at the start 
of the experiment and told, “You will re- 
ceive 3¢ for every correct prediction and lose 
3¢ for every incorrect prediction. You need 
not worry about losing money as no student 
ever has. Keep working hard because at 
3¢ for every card it is possible to make a 
considerable amount" The S indicated his 
first prediction by saying either «0" ori" 
and E then held up the first card in the se- 
quence; this process was repeated until S 
had seen each card once. Because the box 
of cards was visible, S knew approximately 
the proportion of trials remaining at any 
given time. The experiment was paced 
by S; the majority completed all trials in 
35-45 min. There were 700 trials in Exp. I 
and III; 720 trials in Exp. IL Two 
sequences of events were generated for each 
experiment. Interchanging 0 and 1 in these 
provided two additional sequences. Each 
was presented to three Ss. 

At the end of the experiment each S filled 
out a questionnaire listing any regularities 
he had observed in the order in which he dis- 
covered them. 


EXPERIMENT I 
Method 


In this experiment the event schedule was 
constructed by modifying a simple noncon- 
tingent event schedule with r = 0.5 according 
to the following four rules :? 


2 For brevity, we refer to trials on which 
Rule 1 applies as Rule 1 trials, errors in 
prediction made on Rule 1 trials as Rule 1 
errors, etc, 
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1. No runs of length 3 occur; ie. the 
four tuple 0111 is always followed by 1, and 
1000 is followed by 0. 

2. No runs longer than 4 occur; ie, the 
five tuple 01111 is followed by 0, and 10000 
is followed by 1. 

3. The four tuple 1101 is followed by 0, 
and 0010 is followed by 1. 

4. The five tuple 11010 is followed by 0, 
and 00101 is followed by 1. On half the 
trials, none of these rules applied; in such 
cases, the next event was independent of 
preceding events, and both 0 and 1 were 
equally frequent. 

This particular process was selected be- 
cause the unconditional probability of each 
event and the probabilities conditional on the 
preceding one, two, or three events are all 
3. Thus, the S who adopts a simple strategy, 
such as always predicting one event, always 
predicting event repetition, or predicting 
event alternation, cannot be correct on sig- 
nificantly more than half the trials. The 
process generates sequences in which half 
the runs are of length 1, a quarter are of 
length 2, a quarter of length 4, and no 
other run lengths occur. 

Learning these rules requires that Ss re- 
member, at least, the run in progress, but 
learning Rules 3 and 4 requires that Ss 
also remember some events that precede the 
run in progress. Hence, the run model pre- 
dicts that Ss will learn Rules 1 and 2, but 
not 3 and 4, If S always recalls the four 
preceding events, then, according to the k- 
span model, he can learn all four rules. This 
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model does not assume that any four tuple 
is more easily remembered than another and, 
therefore, it predicts that all four rules will 
be learned at approximately the same rate, 


Results 


The data that allow the clearest com- 
parison of the two models are pre- 
sented in Table 1. Column 1 shows 
the number of Rule 1 errors made by 
each S. Columns 2, 3, and 4 show the 
number of errors made when Rules 2 
3,and 4 applied. Only the first 80 in- 
stances of each rule were considered 
so that the columns would be directly 
comparable; this procedure led to 
omission of less than 5% of the data. 
The Ss are ranked by the sum of the 
errors they made on Rule 1 and Rule 2 
trials. 

Table 1 strikingly reveals the major 
finding: each S made fewer errors 
when Rules 1 and 2 applied than when 
Rules 3 and 4 did. Further, these dif- 
ferences are not small; the average 
difference is 42.4 errors. This result 
is predicted by the run model but not 
by the k-span model. The postexperi- 
ment questionnaires also indicate that 
Rules 1 and 2 were more easily learned 


TABLE 1 


Tuple Determining Next Event in Sequence 


ERRORS IN PREDICTION AT DETERMINED Pornts IN EVENT SEQUENCE or Exp. I 


Total Errors 
S 
0111 or 1000 01111 
Wwe)" ed mue | pue 
1 4 4 25 61 
8 86 
A ? d 38 28 10 66 
1 A a 23 45 11 68 
E a ; 50 15 12 65 
P : É 24 26 14 50 
° na 36 32 18 68 
i 2 6 30 48 29 78 
j 1 12 57 24 31 81 
di 5 28 31 17 41 48 
it 46 21 47 25 42 72 
u Je 11 53 33 57 86 
Mean 18.6 33 41 31 58 72 
R 11.9 37.9 32.1 27.6 70.0 


Note.—This table is based 9n the first 80 instances or each rule, 
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than Rules 3 and 4. Specifically, 6 Ss 
stated Rule 1, 10 stated Rule 2, but 
none stated either Rule 3 or 4. 
Because these results support the 
run model, it is of interest to consider 
some of the more detailed predictions 
made by this model. First, it predicts 
that, asymptotically, Rules 1 and 2 will 
be learned perfectly. Eleven of the 12 
Ss made errors on only 7% of Rule 1 
or Rule 2 trials in the second half of 
the experiment. Many of these errors 
appeared, from Ss’ comments, to be 
due to mistakes in counting the length 
of the run in progress. Only S 12 
made an appreciable percentage of 
errors (35%) on these rules in the 
second half of the experiment. 
Approximately half the instances of 
10 and 01 were followed by Rule 3 or 
Rule 4 trials. The run model assumes 
that Ss remember only the run in 
progress and, thus, will fail to differ- 
entiate Rule 3 or Rule 4 trials from 
other trials preceded by 10 or 01. 
These predictions apply to asymptotic 
response probabilities; hence, in order 
to test them, we have examined only 
the second half of each S’s protocol. 
Correct application of Rule 3 required 
prediction of event alternation after 
0010 or 1101. We find that Ss pre- 
dicted event alternation after 10 or 01 
more often on Rule 3 trials than they 
did when no rule applied, t(11) = 48, 
p «.001. Correct application of Rule 
4 required prediction of event repeti- 
tion after 1010 or 0101. However, Ss 
predicted event repetition after 10 or 
O01 just as often on Rule 4 trials as 
they did when no rule applied. 
Several Ss reported that, after they 
had mastered Rule 2, they tried to 
identify regularities in the sequence 
that followed Rule 2 trials—i.e., that 
followed runs of length 4. Approx 
mately half the Rule 3 trials occurred 
one trial after the end of a run of length 
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4 (i.e. after 1000010 or 0111101), and 
half occurred one trial after the end of 
a run of length 2 (i.e., after 10010 or 
01101). The Ss responded correctly 
on 71% of the Rule 3 trials that fol- 
lowed a run of length 4. They applied 
this rule correctly significantly less 
often, 48% of the time, following a run 
of length 2, £(11) = 4.4, p < .01. Fur- 
ther, they predicted event alternation 
after 10 or 01 significantly less often, 
34% of the time, when no tule was 
applicable, £(11) 224, p< 055. 
similar analysis performed on Rule 4 
trials found neither of these differences 
to be significant. 

We summarize these findings as fol- 
lows: contrary to the prediction of the 
run model, Ss showed evidence of 
learning Rule 3, and they learn it most 
rapidly when it occurs immediately 
after a run of length 4. They did not 
learn Rule 4. 


EXPERIMENT IT 


Method 


A possible criticism of Exp. I is that the 
results could be peculiar to the particular 
set of rules chosen. Therefore, another set 
of rules was used to generate a second event 
schedule similar to the one used in Exp. I. 
This schedule was a simple noncontingent 
one with r — 0.5, modified according to the 
following rules: 

1. No runs of length 3 occur; ie, the 
four tuple 0111 is followed by 1, and 1000 is 


0. 
2. No runs of length 4 occur; ie, the 
five tuple 01111 is followed by 1, and 10000 


3. No runs longer than 5 occur; ie, the 
six tuple 011111 is followed by 0, and 100000 
is followed by 1. 

4, The four tuple 1100 is followed by 1, 
and 0011 is followed by 0. 

5, The five tuple 11001 is followed by 0, 
and 00110 is followed by 1. 

6. The six tuple 110010 is followed by 0, 
and 001101 is followed by 1. 

Whenever one of these six rules did not ap- 
ply the procedure used to determine the next 
event was identical to that used in Exp. I. 
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This process generates sequences in which 
half the runs are of length 1, a third are of 
length 2, a sixth are of length 5, and no 
other run lengths occur. The unconditional 
probability of each event and the probabili- 
ties conditional on the preceding one or two 
events are all 4. However, the event proba- 
bilities conditional on the preceding three 
events are either $ or à. Thus, Ss who 
adopt a strategy based on the last one or two 
events cannot be correct on significantly 
more than half the trials, However, Ss who 
adopt a strategy based on the three most 
recent events can be correct on, at most, $ 
of the trials. This schedule differs from 
that used in Exp. I primarily in that Ss must 
remember the preceding six events in order 
to respond correctly on Rule 3 and Rule 6 
trials. 


Results 


The analysis of this experiment 
closely parallels that of Exp. I, and 
our findings are similar, The k-span 
model predicts that all six rules, if 
learned at all, will be learned at ap- 
proximately equal rates, whereas the 
run model predicts that Ss will learn 
Rules 1, 2, and 3, but not the others. 
Table 2 shows the number of errors 
made on each rule by each S. As the 
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run model predicts, each S made fewer 
errors when Rules 1, 2, and 3 applied 
than when Rules 4, 5, and 6 did. The 
average difference is 38.5 errors. Fur- 
thermore, in the questionnaires eight 
Ss stated Rule 3, and, of these, five 
also stated Rules 1 and 2. Only $ 9 
stated Rules 4, 5, and 6, but, curiously, 
her responses did not indicate that she 
made use of this knowledge. 

In this experiment, Rules 1, 2, and 3 
were not learned as well as Rules 1 
and 2 in Exp. I. Eight of the 12 Ss 
appeared to have mastered Rules 1, 2, 
and 3; on the average, in the second 
half of the experiment, they made 
errors on only 10% of the trials on 
Which these rules applied. Subjects 8, 
10, and 11 learned Rules 1 and 2 ap- 
proximately as well, but did not learn 
Rule 3. Subject 12 mastered none of 
these rules. 

The run model predicts that Ss will 
not learn Rules 4, 5, and 6 perfectly. 
Subject 4 appeared to be on the way. 
to mastering these rules as well as 
Rules 1, 2, and 3; in the second half 


BLE 2 


ERRORs IN PREDICTION AT DETERMINED Ports IN EvENT SEQUENCE or Exr. II 


Tuple Determining Next Event in Sequence Total Errors 
s Prem) 
Moe | tior | onnror| inoo | nioo or | 110010 
Ott 1 1 E | Rul Ru 
Quien | Rue) | Rues | qure | gus, Quid Juzas |as &6 
1 1 3 7 26 39 
48 11 113 
4 d d 5 35 24 26 12 85 
j 15 6 9 14 19 27 42 
12 1 12 23 14 28 49 
> 1n 6 13 18 25 21 29 64 
; ie 14 10 19 29 38 32 86 
i n 13 16 42 28 24 47 94 
3 n 13 36 49 49 19 60 117 
i i 28 18 32 29 21 66 82 
i 5 18 35 35 43 19 66 97 
1 2 13 36 28 34 14 70 76 
Min 102 30 24 32 27 24 78 83 
eai 5 13.7 17.2 28.1 30.3 23.9 43.8 82.3 


Note.— This table is based on the first 60 instances of each rule, 
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of the experiment, he made errors on 
only 4% of the Rule 4, 5, or 6 trials 
that followed shortly after a run of 
length 5, and on 33% of the Rule 4, 
5, or 6 trials that followed shortly after 
a run of length 2. No other S per- 
formed as well on these rules as S 4, 
but, in general, fewer errors were made 
on them than the run model predicts. 
We now consider the overall perform- 
ance on each of these rules during the 
second half of the experiment. 

About one third of the instances of 
100 and 011 were followed by Rule 4 
trials. Correct application of Rule 4 
required prediction of event alterna- 
tion. The Ss predicted event alter- 
nation after 100 or 011 more often on 
Rule 4 trials than they did when no 
rule was applicable. Approximately 
half the Rule 4 trials occurred one 
trial after the end of a run of length 5 
(ie. after 01111100 or 10000011), 
and half occurred one trial after the 
end of a run of length 2 (ie. after 
01100 or 10011). The Ss responded 
correctly on 65% of the Rule 4 trials 
that followed a run of length 5. How- 
ever, they responded correctly signifi- 
cantly less often, 48% of the time, 
when a Rule 4 trial followed a run of 
length 2, #(11) =3.2, p < 01. More- 
over, on trials that followed 100 or 011 
and on which no rule applied, they 
predicted event alternation 41% of the 
time. This is not significantly different 
from the probability of correctly apply- 
ing Rule 4 after a run of length 2. 
Therefore, we conclude that most Ss 
learned Rule 4 only when it occurred 
one trial after a run of length 5. 

An analysis like the one above was 
performed on the responses made on 
Rule 5 trials and Rule 6 trials. There 
was no evidence that Ss learned Rule 
5 when the rule was immediately pre- 
ceded by either a run of length 5 or a 
run of length 2. 


Surprisingly, performance on Rule 
6 was much better. Correct applica- 
tion of Rule 6 required prediction of 
event repetition after 110010 or 001101. 
Half the Rule 6 trials followed three 
trials after the end of a run of length 
5, and half followed three trials after 
the end of a run of length 2. Perform- 
ance on Rule 6 following a run of 
length 5 was not significantly different 
from that following a run of length 
2. However, Ss responded correctly 
on 69% of all Rule 6 trials, whereas 
event repetition after 10 or 01 when 
no rule applied was predicted only 
5396 of the time. This difference is 
significant, t(11) = 3.55, p < .01. 

Unfortunately, the event sequences 
were generated in such a way that 
event repetition occurred on 3 of the 
trials that followed 010 or 101, and on 
3 of the trials that followed 0010 or 
1101. Hence, it is possible that, in- 
stead of learning Rule 6, Ss were learn- 
ing to predict event. repetition after 
010 and 101 or after 0010 and 1101. 
However, Ss did not predict event 
repetition after these event tuples 
when no rule applied significantly more 
often than they did after 10 or 01 when 
no rule applied. In addition, there 
was a higher percentage of correct 
responses made on Rule 6 trials than 
there was of predictions of event repe- 
tition after 010 or 101, #(11) = 4.5, P 
« .001, and more correct responses on 
Rule 6 than predictions of event repe- 
tition after 0010 or 1101, #(11) = 3.3, 
p<.02. For these reasons, we con- 
clude that Ss actually showed that they 
had partially learned Rule 6. 

To summarize: one 5 learned Rules 
4, 5, and 6. Performance on Rule 4, 
when it was immediately preceded by a 
run of length 5, and on Rule 6 was 
better than that predicted by the run 
model. Rule 5 was only learned by 
S 4. 
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Exrertment III 
Method 


The run model assumes that Ss remember 
only the run in progress, However, in Exp. 
I, Ss made fewer errors than the run model 
predicts on Rule 3 trials that followed 
shortly after a run of length 4. The same 
pattern of responses was observed on Rule 4 
trials in Exp. II, These two findings sug- 
gest that the end of a long run is a particu- 
larly well-defined point in the sequence, and 
that it is easy to identify regularities in the 
Sequence that consistently follow such a 
point. Experiment III provides a test of this 
hypothesis, as well as another comparison 
of the two models. 


A simple noncontingent schedule with 
7T—0.5 was modified according to the fol- 
lowing four rules: 

1. No runs of length 3 occur; ie, the 
four tuple 0111 is followed by 1, and 1000 is 
followed by 0. 

2. No runs longer than four occur; ie, 
the five tuple 01111 is followed by 0, and 
10000 is followed by 1. 

3. Runs of length 4 are not followed by 
runs of length 1; ie, the six tuple 011110 
is followed by 0, and 100001 is followed by 1. 

4. Runs of length 2 arepfollowed by runs 
of length 1; ie, the four tuple 0110 is fol- 
lowed by 1, and 1001 is followed by 0. 


On half the trials no rule applied, and the 
procedure used to determine the next event 
was identical to that used in Exp. I. 
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This process generates sequences in which 
half the runs are of length 1, a quarter are 
of length 2, a quarter are of length 4, and 
no other lengths occur. The unconditional 
probability of each event and the probabilities 
conditional on the preceding one, two, or 
three events are all 3, 

Rule 3 trials occurred on the trial follow- 
ing the end of a run of length 4, and Rule 4 
trials occurred immediately after runs of 
length 2. On the basis of our findings in 
Exp. I and II, we expect that, if S's learn 
either of these rules, they will learn Rule 3 
faster than Rule 4, 


Results 


Table 3 presents the number of 
errors made on each rule by each S. 
These results are not in agreement 
with either model, The k-span model 
predicts that all four rules will be 
learned equally rapidly ; obviously, this 
did not occur. 

The run model predicts that Rules 1 
and 2 will be learned perfectly, but 
that Rules 3 and 4 will not. However, 
performance on Rule 3 was better than 
performance on any other rule. By 
the end of the experiment, all Ss were 
making only an Occasional error on 
Rule 3 trials; the average error rate 
on Rule 3 trials in the second half of 


Tuple Determining Next Event in Sequence Total Errors 
S 
0111 or 1000 01111 or 1i 
(Rute) Chale” argo! | 0110 or 100 reo Ra 
1 5 11 24 52 
16 76 
E A 8 16 44 17 60 
H A 10 11 40 19 51 
EXE. 2 10 12 29 23 41 
H fi s 11 42 29 53 
7 36 1 8 42 36 50 
$ do 1 12 38 47 50 
ai an 18 13 50 61 63 
i 1 29 19 50 63 69 
n 2 21 22 50 68 72 
17 3 26 23 49 76 72 
Mea 2 64 19 49 82 68 
n 3 18.5 15.8 44.6 44.8 60.4 


Note.—This table is based on the first 80 instances of each rule, 
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EVENT RUNS IN PROBABILITY LEARNING 


the experiment was 10%. Perform- 
ance on Rule 2 was only slightly worse 
than on Rule 3; however, performance 
on Rule 1 was considerably worse than 
on either Rule 2 or Rule 3. Near the 
end of the experiment errors were 
made on approximately half the Rule 1 
trials by Ss 7, 8, 10, and 11 and on 
approximately half the Rule 2 trials 
by Ss9 and 12. The poor performance 
on Rule 1 is surprising in view of the 
success with which the run model 
identified the rules that were most 
rapidly learned in Exp. I and IT. 

The data support our conjecture that 
it is easier to identify a regularity that 
consistently follows the end of a long 
run than one that follows a short run. 
Each S made fewer Rule 3 errors 
than Rule 4 errors. Only Ss 2 and 4 
appeared to be on the way to learning 
Rule 4 by the end of the experiment. 
The others made errors on over half 
the Rule 4 trials in the last half of the 
experiment. 


DISCUSSION AND CONCLUSION 


We have considered several models that 
have been proposed for the probability- 
learning situation. On the basis of pre- 
viously existing evidence, two models that 
embodied the independence-of-path as- 
sumption were rejected. The experi- 
ments reported here were designed to test 
the adequacy of two models that have 
been proposed more recently and that do 
not assume path independence. 

The k-span model assumes that, on 
each trial, S remembers the last k events. 
Each of our experiments indicates that 
this is not an adequate characterization 
of S’s memory. 

The run model assumes that Ss remem 
ber only the run in progress. This mode 
generates predictions that are consider- 
ably more accurate than the correspond- 
ing predictions generated by the k-span 
model, In particular, the run model ac- 
counts for the major findings of Exp. I 
and II—i.e., Ss learn simple modifications 
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of the run structure much more easily 
than other modifications of the sequence 
that require the same memory length for 
correct prediction. Furthermore, the re- 
sponses to our questionnaires indicate 
that almost all Ss characterized the se- 
quence as a succession of runs of events. 

However, our data indicate that the 
run model’s memory assumption is not 
adequate. Under some conditions, Ss re- 
member runs that preceded the run in 
progress. In Exp. I, Rule 3 was partially 
learned; in Exp. II, Rules 4 and 6 were 
partially learned; and in Exp. III, Rule 
3 was learned. To apply any of these 
rules correctly, Ss must remember at 
least the run that preceded the run in 
progress. Although we cannot specify 
precisely the conditions under which the 
preceding run is remembered, we can 
conclude that such a run is more likely 
to be remembered if it is long. This is 
based on the following findings: Rule 3 
of Exp. I and Rule 4 of Exp. II were 
learned particularly well when they were 
applicable immediately after long runs; 
in Exp. III, performance was much bet- 
ter on Rule 3 trials, which always fol- 
lowed runs of length 4, than on Rule 4 
trials, which always followed runs of 
length 2. 

A second inadequacy in the run model's 
memory assumption is the absence of any 
mechanism for the confusion of two pat- 
terns of events. It was clear that such 
confusion did occur in Exp. II. Towards 
the end of the experiment, several of the 
faster learners made an occasional error 
on Rule 3 (i.e., 011111 is followed by 0, 
and 100000 is followed by 1), and often 
these errors were accompanied by com- 
ments such as, “I must have miscounted.” 
The same pattern of errors and com- 
ments occurred, but less frequently, in 
Exp. I and III. 

It now is clear that a model that ade- 
quately describes human probability- 
learning data will not only include learn- 
ing axioms, but will also involve assump- 
tions that deal with short-term memory 
and simple coding or chunking. The run 
model makes such assumptions, and it has 
been shown both here and by Restle 
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(1961) that these assumptions are a step 
in the right direction. Nevertheless, they 
fail to describe all our findings, and it 
appears that a more adequate model will 
require more complex assumptions. 
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RESPONSE TIMES WITH A MEMORY-DEPENDENT 
DECISION TASK* 


RAYMOND S. NICKERSON 
Decision Sciences Laboratory, L. G. Hanscom Field, Bedford, Massachusetts 


4 experiments were conducted to determine the time required to make 
some simple memory-dependent decisions. S’s task was to decide 
whether any of the items of a memorized check list were contained in 
a visually displayed search list, and to register his decision as quickly 
as possible by pressing 1 of 2 response keys. RT varied directly both 
with the number of items in the check list and the number in the search 
list, and inversely with the number of items common to both lists. 
Practice reduced RT across conditions, and it also decreased, but did 
not eliminate, the effects of the independent variables. Decreases in 
RT with practice were accompanied, in most cases, with increases in 


the frequency of errors. 


How does the search of a memory 
representation of a visual display com- 
pare with the search of the visual dis- 
play itself? Suppose that S is given 
the task of deciding whether two 
visually presented sets of characters 
have any items in common. If one 
set is presented only after the other 
has been erased, then the task is mem- 
ory dependent and may be described 
either as that of determining whether 
any of the items in the memory repre- 
sentation are included in the visual 
display, or, conversely, as that of de- 
ciding whether any of the displayed 
items are included in the memory 
representation. 

Consider in particular two contrast- 


1 This report is identified as ESD-TR-65- 
414 of the AF Electronic Systems Division, 
Air Force Systems Command. This re- 
search was performed at the Decision Sci- 
ences Laboratory as part of Project 7682, 
Man-Computer Information Processing and 
was reported in abbreviated form at the 
American Psychological Association Conven- 
tion in September 1965. Further reproduc- 
tion is authorized to satisfy the needs of the 
United States Government. 

The author wishes to thank Alice Gilooly 
for her assistance in collecting and organiz- 
ing data and Saul Sternberg for several 
helpful criticisms of a draft of this report. 
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ing instances of this task: (a) a single 
item in memory and several on the dis- 
play, and (b) several items in memory 
and one on the display. The two tasks 
are formally similar; in both cases S 
must determine whether a single item is 
included in a set of items. We might 
say that the task is to search a set for 
the occurrence of a specified item. In 
one case, however, the area of search is 
recent memory; in the other, it is 
an externally available visual dis- 
play. The question of interest is how 
the search process may differ depend- 
ing on the area of search, The ex- 
periments described in this report were 
addressed to a particular aspect of this 
problem, viz., the time required to 
perform the decision task. 


GENERAL METHOD à 


Task—On each trial S was shown two 
sets of letters on a visual display. De task 
was to decide whether any of the first set of 
letters (check list) were included in the 
second set (search list), and to register his 
decision as quickly as possible by pressi 
one of two response keys. The offset ol 
the check list preceded the onset of the 
search list by 1 sec. 

Stimuli—The stimuli were the 16 English 
consonants B, C, D, F, H, J, K, L, N, P, R, 
S T, W, X, Z. The letters were block 
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capitals, constructed from the elements of a 
5X7 dot matrix, vs in. in height, and 
were displayed on a 14 in. cathode ray tube 
(CRT) which S viewed from a line-of-sight 
distance of approximately 3 ft. 

General procedure—The experiments were 
run under the control of a PDP-1 com- 
puter coupled with a typewriter, a CRT, and 
a pair of telegraph keys. Prior to each ex- 
perimental run E could Specify, via type- 
writer input, the following: experimental 
Parameters: the number of items in the 
check list (c), the number of items in the 
search list (s), the probability that the two 
lists would contain at least one item in com- 
mon ($), and the maximum number of items 
that the two lists could have in common 
(m). The latter, of course, could not be 
greater than the number of items in the 
smaller of the lists, but it could be less if E 
so specified. The Probability that the two 
lists would have exactly b items in common 
was p/m for 15 b - m. To clarify the last 
point, if c, s, and m were all set to 2 and $ 
to .5, then the probability that the two lists 
would have 0, 1, or 2 items in common would 
» respectively, 

The computer was programmed to execute 
the following procedure on each trial: (a) 
randomly select c letters for the check list 
from the 16 Possible, (b) determine by 
simulating the toss of a coin (with bias $) 
whether the search list was to contain any 
items in common with the check list, (c) 
if the toss came up “heads,” determine, by 
drawing a random number, 
ie, how many items the two lists were to 


(9) present the h li 
CRT mt search list on the 


ruled just often enough to satisfy the de- 
sired constraint. In each of the experiments 


* 
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described in this report, the option was exer- 
cised, and the segment over which the rela- 
tive frequencies were forced was 16 trials, 
For example, consider again the case in 
which c, s, and m all equal 2 and p equals .5. 
In each successive block of 16 trials, 8 pairs 
of lists had no items in common (b=0), 4 
had 1 item in common (b=1), and 4 had 
both in common (b—2). Within this con- 
straint, the order of occurrence of the differ- 
ent b values was determined by a pro- 
grammed random number generator. 

On each trial the check list was displayed 
for 4 sec. in a horizontal row, approxi- 
mately 2 in. directly above the center of the 
CRT. (The CRT had a standard P7 screen 
with a short persistence blue fluorescence and 
a long persistence yellow phosphorescence. 
A blue pass filter masked out the yellow 
component completely; the blue component 
decayed to 2%, not visible, of its peak 
intensity within 200 usec, following intensi- 
fication.) The Search list appeared, also in 
a horizontal row, approximately 2 in. below 
center, 1 sec, following the offset of the 
check list; it remained illuminated until 5 
Pressed a response key, The intertrial in- 
terval, ie, the time from a response to the 
onset of the following checklist, was con- 
Stant at 1 sec. A single experimental ses- 
sion consisted of 64 trials and usually took 
Toughly 8 min. to complete, 

At the end of each session the computer 
Produced a punched Paper tape trial-by-trial 
record of that 
concerning the values of the experimental 


Experiment I 


_ The purpose of Exp. I was to inves- 

tigate the effects of three variables: 
(a) the number of items in the check- 
list, c, (5) the number of items in the 
search list, s, and (c) the number of 
items common to both lists, b. Tt was 
of particular interest to make compari- 
Sons of the type c = x, s = y vs. C = y, 
3-4. Hence, both c and s assumed 
the values 1, 2, and 4, each of the nine 
Possible combinations constituting a 
different experimental Condition. 


RT IN A MEMORY DEPENDENT DECISION TASK 


Method.—Under each condition p was .5, 
and m was equal to the number of items in 
the smaller of the two lists. The relative 
frequencies of occurrence of the possible 
values of b were constrained as explained in 
the description of the general procedure. 

Twenty-one female college undergraduates 
participated in the experiment, their dis- 
tribution over conditions being dictated by 
the coincidence of $ and computer availabil- 
ity. Most Ss participated in several, but 
not all, of the nine conditions. An effort was 
made to overlap as much as possible the 
particular sets of conditions in which differ- 
ent Ss participated and to vary the order 
in which they encountered the different 
conditions. 

Twelve runs (2 apiece for each of 6 Ss) 
were conducted under each condition. In 
each case 3 Ss used the dominant hand, and 
3 the nondominant, to press the "yes" key. 
Any given S used the same hand to indicate 
“yes” in each of the conditions under which 
she ran. 

Results.—Table 1 shows RT (means 
of RTs pooled over Ss) and RFE 
(relative frequency of errors) for each 
c-s-b combination. All RTs in this 
report represent correct responses only. 
RFE represents the ratio of the num- 
ber of erroneous responses with a par- 
ticular c-s-b combination to the total 
number of times that combination oc- 
curred. 
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It is clear that RT increased both 
with the number of items in the check 
list and with the number of items in the 
search list. Moreover, with the excep- 
tion of the condition c=s=2, for a 
given check list-search list combina- 
tion, RT varied inversely with the 
number of items common to the two 
lists. 

It may be noted that for a given 
value of b, RT was at least as sensi- 
tive to differences in c as to differences 
ins. However, a graph showing RT 
as a function of c with s as the param- 
eter would not differ greatly in appear- 
ance from one showing RT as a func- 
tion of s with c as the parameter. RT 
was more sensitive to the number of 
pairwise comparisons that could be 
made between check list and search list 
(c X s) than to the particular distribu- 
tion of items between the two lists. 

In this experiment, as in any yes-no 
choice situation, two types of errors 
could occur: the response “yes” when 
it should have been “no” (false posi- 
tives) and the response “no” when it 
should have been “yes” (false nega- 
tives). Generally, the two would not 
be considered equally probable; their 


TABLE 1 


RT (MsEc.) AND 


RFE (RELATIVE FREQUENCY OF ERRORS) UNDER 
EacH c-s-b COMBINATION 


0 1 2 3 4 

Cee ee Le ME GREET a 

. “no” tyes” 
RT RFE RT RFE RT RFE RT RFE RT RFE 

1 1 348 | (.02) | 488 | (-04) 
1 2 620 (04) 572 ibd 
15 85 | 0s} | eo | (62) 
2 | 2 15; ton 789 et 677 | (.06) 
2 | 4 | 95 (09 al (43) 677 | (.03) 
4 1 751 | (02. 5 Pa 
4 > | 1020 | C04) | 820 | (12) | 677 (4 
4 4 | 1244 (04) 1001 C5) 359 | (05) | 717 | C02) | 651 | COD) 
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relative frequencies of occurrence 
would be expected to vary both with 
the physical aspects of the decision en- 
vironment and with strategic and criter- 
ial variables which are dependent in 
Part on the costs and values associated 
with the possible decision outcomes, 
In this experiment false negatives out- 
numbered false positives by about 2 to 
1, their relative frequencies being .06 
and .03, respectively. As an examina- 
tion of Table 1 will reveal, this differ- 
ence is largely attributable to the rela- 
tively high incidence of false negatives 
which occurred when a multiitem 
search list contained only a single item 
in common with a multiitem check list, 
Apparently Ss’ Propensity to say “no” 
when they should have said “yes” was 
rather sharply increased when the sit- 
uation allowed the Possibility of more 
than one point of correspondence be- 
tween check list and search list, but 
only one in fact obtained. More gen- 
erally, it appears from Table 1 that in 
those cases in which there existed the 
possibility of multiitem correspondence 
between the two lists, the relative fre- 
quency of false negatives was inversely 
related to the degree of correspondence 
(number of common items) which 
actually occurred, 


ExPERIMENT II 


relatively Prolonged practice on both 
the least and the most difficult of the 
conditions studied in Exp. I. 


of 4 Ss (female college 
undergraduates), none of whom had partici- 


1 run per day for 22 
days under each of the 2 conditions: 
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Fic. 1. Mean response time as a function y 
of number of day’s practice, for both “yes” 
and “no” responses, and one-item and four- | 
item lists, | 
| 
assumed each of the values 1, 2, 3, and 4 
exactly twice in each successive block of 16 
trials. Two Ss used the dominant hand to 
indicate “yes” and two the nondominant. 

Results—Figure 1 shows RT for both - 
“yes” and “no” responses and both check 
list-search list combinations as a function 
of days of Practice, each point representing . 
a mean of RTs pooled over Ss. RT de- 
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Fic. 2. Mean response time as a function 
of the number of items common to the two 
four-item lists, with leve] of practice as the * 
parameter. 
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RELATIVE FREQUENCY OF ERRORS 


FALSE ___ 
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Fic. 3. Relative frequency of errors as a 
function of the number of items common to 
the two four-item lists, with level of practice 
as the parameter. 


creased with practice, at least over the first 
few days, under both experimental condi- 
tions. The decrease was markedly greater 
and continued over a longer period of time 
in the case of the more difficult task; the 
initially large differences between RTs ob- 
tained with the one vs. the multiitem lists 
had diminished greatly by the last few days 
of practice. Although the possibility that 
differences would have decreased even more 
with additional practice cannot be ruled 
out, the relative stability of performance over 
the last few days suggests that detectable 
differences would have persisted from some 
time, if not indefinitely. Under both of the 
check list-search list conditions, no RTs 
were consistently longer than "yes RTs. 

Figure.2 shows RT as a function of the 
number of items common to both of the four- 
item lists at various stages of practice. The 
results corroborate those of the prior ex- 
periment in that RT tended to vary inversely 
with the number of items common to both 
check list and search list (b); however, it 
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is clear that practice was effective not only 
in reducing RT in general, but also in de- 
creasing the effects of this variable. 

Error rates with the single-item lists were 
low (the relative frequency of false negatives 
and false positives being about .04 and .03, 
respectively) and relatively constant over 
practice. Figure 3 shows the relative fre- 
quency of errors for each number of items 
common to the two lists at different stages of 
practice with the four-item lists. As in the 
prior experiment, false negatives were the 
more common type of error and their fre- 
quency of occurrence was inversely related to 
the degree of correspondence between the 
two lists. Moreover, not only did the fre- 
quency of errors increase with practice, 
but the increase in the frequency of false 
negatives was considerably greater than the 
increase in the frequency of false positives, 
and the increase appears to have been some- 
what greater for b equal to 1 or 2 than for 
b equal to 3 or 4. 


EXPERIMENT III 


Method—The same four Ss who partici- 
pated in Exp. II had two runs apiece under 
each of the conditions of Exp. I, except those 
conditions which comprised Exp. II. 


Results.—Table 2 shows mean RT 


and RFE for each of the c-s-b combi- 
nations used. The practice received in 


TABLE 2 


RT (Msec.) AND RFE (RELATIVE 
FREQUENCY OF ERRORS) UNDER 
EacH c-s-b COMBINATION 


0 1 2 
e| s J | 
‘no’ "yes" 
RT|RFE|RT | RFE| RT RFE 
2 |429|(.06) | 366 | (.11) 
1| 4 |524|(.08) | 449 | (.13) 
2 | 1 | 433} (.06)| 386 (.09) 
2| 2 |440|(.07)| 425 (.22) | 368 | (.04) 
2| 4 |536|(13)| 475 (.35) | 434 | (.12) 
4 | 1 |455|(-15)| 407 (.18) 
4 | 2 |493|(.09)| 474 (.37) | 399 | (.14) 
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Exp. II was apparently effective not 
only in substantially reducing RT 
across the conditions of Exp. III, but 
also in decreasing the magnitude of 
the effects of the experimental vari- 
ables. However, the relatively high 
error rates shown in Table 2, as com- 
pared to those in Table 1, suggests that 
the decrease in RT was realized at the 
expense of a proliferation of errors, 
particularly in the case of two multi- 
item lists. 


ExrPERIMENT IV 


Method.—Experiment IV was similar to 
Exp. II except that the two conditions used 
were c— s— land c=4, s—1. The intent 
was to provide prolonged practice on two 
conditions which differed only with respect to 
their demands on memory, hence the pairing 
of a single-item search list with both a 
single-item and a multiitem check list. 

Each of 4 Ss (female college undergrad- 
uates) received one 64-trial run per day 
under both conditions. Two Ss used the 
dominant hand, and two the nondominant, 
to indicate “yes.” One of the Ss in each of 
the “dominance” categories received the runs 
in the order single-item check list, multi- 
item check list; the other S in each case 
received them in the reverse order. Runs 
were conducted on as nearly consecutive days 
as work schedules and computer availability 
would permit. The experiment had to be 
terminated after 17 days, when Ss, who 
were summer employees of the laboratory, 
were lost to the beginning of the fall 
semester. 


Results —RT decreased with prac- 
tice under both conditions. With the 
single-item check list, mean Positive 
RT went from 454 msec. on the first 
day to 304 msec. on the seventeenth ; 
mean negative RT dropped from 494 
msec, to 361 msec. With the four-item 
check list, mean positive and negative 
RT went from 550 msec. to 412 msec, 
and from 581 msec. to 461 msec., re- 
spectively. It may be noted that with 
the single-item lists, RTs obtained on 
the first day of practice were consider- 
ably shorter than those obtained during 
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the first day of practice under the same 
conditions in Exp. II (Fig. 1). This 
may be attributed, at least in part, to í 
the fact that all four Ss of this experi- ^ 
ment had had at least one run in a 
similar experiment and two had had | 
several; whereas, none of the Ss in 
Exp. II had had prior experience with 
the task. 

For both “yes” and “no” responses 
RTs were roughly 100 msec. greater 
with a four-item than with a one-item 
check list over the entire course of the 
experiment. In addition as in Exp. 
II, "no" RTs were consistently greater 
than “yes” RTs, even when both check 
list and search list contained only a 
single item. The latter result seems 
somewhat surprising since a determina- 
tion that two things are identical would 
seem to require a more thorough per- 
ceptual analysis than the determination 
that they differ. The finding is not 
unique, however, to this study, but has 
been obtained in subsequent experi- 
ments, some of which were designed 
specifically to test it (e.g., Nickerson, 
1965). It has also been observed inde- 
pendently by Sternberg (1963), when 
the relative frequency of positive and 
negative cases was equal. 

RFE with the one-item check list 
was about .06, and was quite stable 
over the course of the experiment. 
False negatives and false positives oc- 
curred with almost exactly the same 
frequency. With the four-item check 
list, the error rate increased from .09 
during the first four sessions to .15 
during the last four; the relative fre- 
quency of false negatives and false 
positives was .15 and -11, respectively. 


Discussion 


The results confirm the importance of 
each of the three variables, the number of 
items in the check list (c), the number 
in the search list (s), and the number in 
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common to the two lists (b), as deter- 
minants of the time required to perform 
the decision task. Moreover, at least 
with relatively unpracticed Ss (Exp. I), 
RT was at least as sensitive to differ- 
ences in c as to differences in s; i.e., the 
time required to perform the task was as 
much affected by what S had to look for 
as by what she had to look af. Error 
rates were particularly high when a 
multiitem search list and a multiitem 
check list had only a single item in com- 
mon. Practice was effective in decreas- 
ing RT across conditions, and in attenu- 
ating, but not eliminating, the effects of 
the variables c, s, and b; however, de- 
creases in RT were in most cases accom- 
panied by sizable increases in the fre- 
quency of errors. 

Neisser (1964; Neisser, Novick, & 
Lazar 1963) has conducted a variety of 
experiments with a list-searching task to 
study certain aspects of the analysis of 
visually presented information. He has 
described character recognition as a 
multilevel process in which the outcomes 
of rudimentary analytic operations (e.g. 
detection of basic distinctive features 
such as angularity, curvature, slopes of 
lines) feed into higher order systems 
which in turn perform more complex 
recognitive functions (eg., identification 
of letters as such). On the basis of his 
finding that after several days of practice 
Ss can scan through an array of alpha- 
numeric “context” characters, looking for 
the occurrence of any of 10 “target” 
characters, at about the same rate and 
with the same degree of accuracy as 
when looking for only one, he has con- 
cluded that at least at the level of feature 
detection several operations may occur 
simultaneously, or in parallel. 

The present results clearly do not sup- 
port the notion that S can search an 
array for any of several characters as 
efficiently as he can search for a single 
one, even after fairly extensive practice. 
Among the procedural differences be- 
tween the present experiments and Neis- 
ser’s, one seems particularly likely to 
have been the source of the difference in 
results. In Neisser’s procedure, mutu- 
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ally exclusive context and target char- 
acter sets were defined prior to the ex- 
periment, the target characters for any 
particular experimental run being chosen 
from the latter set. No character served 
as both target and context in the course 
of the same experiment. In the present 
case the set of characters comprising the 
check list varied from trial to trial, and 
characters included in the check list on 
one trial were certain to occur as non- 
critical characters in the search list on 
subsequent trials. One tentative conclu- 
sion that may be drawn from a compari- 
son of the results of the two sets of ex- 
periments is that the possibility of paral- 
lel processing as hypothesized by Neisser 
is contingent not only on sufficient prac- 
tice with the task, but also on the use of 
a paradigm which requires activation of 
the same set of feature-detecting opera- 
tions trial after trial. 

Sternberg (1963, 1964) has presented 
evidence favoring the hypothesis that 
when S must decide whether or not a 
stimulus is one of several represented in 
memory: (a) the items represented in 
memory are checked serially, one at a 
time, and (b) the search of memory is 
terminated only after all relevant items 
have been checked, ie, it is not termi- 
nated as soon as the target item is found. 
The present experiments were not de- 
signed to be a test of Sternberg's hy- 
pothesis, and in fact were conducted 
without knowledge of his work. Nu- 
merous procedural differences dictate 
caution in a direct comparison of results. 
Nevertheless, the present findings clearly 
are relevant to Sternberg's hypothesis, 
and should be assessed in terms of it. In 
general, they are consistent with a serial- 
search hypothesis, whether the area of 
search is considered to be a visually dis- 
played array, or a memory representation 
of such an array. 

With respect to the hypothesized ex- 
haustiveness of the memory search, the 
most salient aspect of Sternberg's results 
was his finding that with a single-item 
search list the increase in RT per check- 
list item was about the same for positive 
as for negative instances. (If the search 
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were exhaustive only when it logically 
was required to be, ie, when the object 
of search was not in the check list, one 
would expect an increase in the number 
of check-list items to have a greater ef- 
fect on RTs associated with negative re- 
sponses than on those associated with 
positive responses.) The present results 
bear out Sternberg’s findings in this re- 
gard. In Exp. I, with a single-item 
search list, the difference between posi- 
tive (b=1) and negative (b—0) RT 
did not increase with the number of items 
in the check list. This result was corrbo- 
rated in Exp. III and IV. The latter 
case is particularly noteworthy since the 
difference between "yes" and "no" RTs 
was as great with a single-item check list 
as with a multiitem check list even after 
several days of practice with the task. 
The data for s>1 are more difficult to 
relate to Sternberg's hypothesis, since the 
latter is concerned explicitly with the 
Situation in which several items in mem- 
ory must be examined for the occur- 
rence of a single item on the display. 
The meaning of exhaustiveness is less 
straightforward when both the display 
and the memory representation contain 
several items. What the present results 
do indicate is that with respect to the 
number of pairwise check list-search list 
comparisons that can be made exhaus- 
tiveness is fairly conclusively ruled out, 
The fact that for a specific number of 
items in each list RT varied inversely 
with the number common to both lists is 
Suggestive of a search that is terminated 
as soon as a matching pair of items is 
found, ie, is not 
not be. 


but had only one item in common sug- 
gests that the decision that there is not 
à common item is sometimes made on the 
basis of less than an exhaustive compari- 
Son of the lists, i.e., that the Search is not 
always even as exhaustive as it should be, 
both error rate and RT would 
be expected to relate directly to the diffi- 
culty of the decision task; however, for 
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any given situation there must exist some 
trade-off relationship between time and 
errors. The “exchange rate” would be 
expected to depend on the specific experi- 
mental conditions, and the particular 
compromise struck to reflect the relative 
importance ascribed to the speed of deci- 
sions as opposed to their accuracy. In 
these experiments, Ss were given the 
somewhat ambiguous instructions to be 
as fast as possible, while trying not to 
make errors. No effort was made to 
relate time and errors to an explicit 
payoff structure. However, if it may be 
assumed that in the absence of explicit 
instructions, S' establishes a time-error 
compromise that reflects his own subjec- 
tive values, the increase in error rate 
which in most cases accompanied the de- 
crease in RT suggests that the practice 
effect was due, to some degree at least, 
to a change of emphasis on the part of S, 
ie, an increasing stress on speed accom- 
panied by a relaxation of the demand for 
accuracy. The distribution of false nega- 
tives over the parameter b suggests a 
search strategy which might be described 
by the rule: “Search until either (a) a 
matching pair is found, or (b) a crite- 
rion is reached for deciding that there 
arè no matching pairs.” Relaxation of 
the criterion for saying “no” would not 
only result in a decrease in both positive 
and negative RTs but would lead to dif- 
ferential increases in false negatives over 
the parameter b similar to those obtained 
(Table 2, Fig. 3). As stated the rule 
has the obvious limitation that it makes 
no allowance for the occurrence of false 
Positives; it does not, however, preclude 
the possibility that factors in addition to 
the hypothesized one also were instru- 
mental in the production of errors, It 
seems likely, for example, that interitem 
confusions of the sort described by Con- 
rad (1964) and Wickelgren (1965), and 
mistakes at the level of the motor re- 
Sponse (i.e., instances in which S decides 
Yes" and erroneously presses the "no" 
key, or vice Versa) contributed in some 
measure to the determination of the error 
rates obtained. At any rate, in toto the 
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results make it clear that any model 
which is to be reasonably descriptive of 
the processes underlying the performance 
of this type of task must explicitly take 
into account the trade-off between speed 
and errors. 
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UNIQUENESS OF PERCEIVED HUES INVESTIGATED WITH 
A CONTINUOUS JUDGMENTAL TECHNIQUE! 


CHARLES E. STERNHEIM AND ROBERT M. BOYNTON 


University of Rochester 


quantal methods, with a high degree of reliability. Red, orange, yel- 
low, and green were evaluated on the basis of criteria established for 


The Controversy over the number of 
psychologically unique hues existed in 
the beginning of the Present century 
(Boring, 1942). Those investigators 
who believed that there were only four 
unique hues (red, yellow, green, and 
blue) assumed monochromatic light to 
be perceived as having one, or a com- 
bination, of these hues, Other investi- 
gators have argued that the experience 
of hue associated with any spectral 
band is just as unique as that associ- 
ated with any other. That the prob- 
lem still exists today is clearly reflected 
in the choice of color response cate- 
gories in those experiments investigat- 
ing the relationship between mono- 
chromatic light and hue, as revealed by 
the color-naming response. Thomson 
(1954) allowed his Ss to select one 
Tesponse from among the following 
categories: red, orange, yellow, green, 
blue, and combinations of neighboring 
colors. Beare ( 1963) limited the judg- 


1This research was Supported by Grant 
NB-00624 from the National Institute of 
Neurological Diseases and Blindness, United 
States Public Health Service, 

The da in this report were 
Presented at the Spring, 1965 meeting of the 
Association, Atlantic 
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mental categories in her experiment to 
violet, blue, green, yellow, orange, and 
red. The initial Tesponse was often 
followed by an unrestricted qualifying 
response if S thought it resulted in a 
better description of the perceived hue. 
Boynton, Schafer, and Neun (1964) 
developed a Tesponse procedure in 
which information about primary and 
Secondary subjective components asso- 
ciated with monochromatic light was 
retained. They felt that the spectrum 
could be described adequately with the 
color response names of red, yellow, 
green, and blue, 

In the present experiment interest 
Was restricted to the hues associated 
with the long Wavelength part of the 
spectrum, A continuous judgmental 
technique was developed based upon 
Metfessel’s (1947) “Constant Sum” 
method. Three criteria for the unique- 
ness of a perceived hue were estab- 
lished. These criteria, discussed at the 
end of the Procedure section, were used 
to evaluate the color Tesponse orange, 
in addition to red, yellow, and green. 

he orange Category was chosen for 
investigation on the assumption that its 
frequency of usage is equal to that of 
the other categories with which it was 
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compared. Chapanis (1965) reported 
that when Ss were asked to select 
Munsell colored chips corresponding to 
different color names, they were at 
least as consistent in their selection of 
orange chips as they were in their se- 
lection of red, yellow, and green chips. 


METHOD 


Apparatus—The apparatus consisted of a 
Bausch and Lomb grating monochromator 
and a Maxwellian view optical system. The 
S’s head was held in place with the help of 
a dental bite. The test stimulus, lasting 300 
msec., was a 3° diameter flash of light con- 
centric with the fovea. The spectral range 
investigated was from 530 to 620 nm., in 10 
nm. steps. Test flashes of different wave- 
length were equated for luminance by a 
heterochromatic matching technique. All 
stimuli were presented at 1,000 trolands. The 
apparatus is described in more detail in a 
previous publication (Boynton et al., 1964). 

Subjects—A total of nine Ss were tested, 
including five male graduate students, and 
two female undergraduates at the University 
of Rochester. The remaining two Ss, one 
male and one female, were high-school 
students. 

Procedure—There were four sessions for 
each S, each of which was performed on a 
separate day. Sessions differed only in the 
color-response categories that were per- 
mitted on that day. These were as follows: 
Day 1—Red, Green, and Blue (R,G,B) ; 
Day 2—Red, Yellow, and Green (RY,G); 
Day 3—Red, Orange, and Green (R,0,G) ; 
Day 4—Red, Orange, Yellow, and Green 
(R,O,Y,G). Since it was felt that the order 
in which yellow and orange became avail- 
able (on the second and third day) might 
influence Ss’ responses on the final day, when 
both categories were available, this order 
was counterbalanced over Ss. 4 

Each session was preceded by 5 min. of 
dark adaptation. A session consisted of eight 
presentations, selected randomly, of each of 
10 wavelengths. There was a time int 
of 15 sec. between each presentation. An S 
was asked to consider the total amount of 
color in the sensation, including white, as 
being represented by a value of 100. He was 
then instructed to assign numerical values 
to the response categories that he was al- 
lowed to use, "in relation to that proportion 
of the total of 100 that seems to consist of 
the sensation appropriate to these cate- 
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gories.” It was made clear to S that he was 
not required to have the total of his responses 
add to 100, since it was not to be presumed 
that all response categories, which he felt 
were required for an adequate description of 
what was seen, would be allowed. Two Ss 
seemed to ignore saturation differences. 
This, however, did not differentially effect 
their judgments of the presence of different 
hues. The criteria used to assess whether 
a given hue is, or is not unique, were the 
following: (a) The color category should 
be used with a relatively high degree of reli- 
ability; (b) The color function (of wave- 
length) should reach a maximum in a 
region where neighboring hues are at a mini- 
mum; (c) A unique hue, X, associated with 
a particular spectral region should not be 
represented in the permitted response cate- 
gories in a session where the color category 
associated with X has been eliminated. In 
such a session it should be possible to com- 
pute the missing color function. 


RESULTS 


Color-naming functions for each of 
the four sessions are shown for repre- 
sentative Ss in Fig. 1-5. The mean 
of eight responses are plotted as a 
function of wavelength for each color 
category available in a particular ses- 
sion. Computed functions (CF) for 
prohibited categories were obtained by 
subtracting the sum of the ratings for 
each wavelength from corresponding 
sums in the four session when all cate- 
gories were available. 

In Fig. 1 data for the first session 
are plotted separately for two Ss, who 
are both representative of the total 


2 Two tables giving individual data for all 
subjects have been deposited with the Amer- 
jcan Documentation Institute. Table A con- 
tains mean response values as a function of 
wavelength. Table B contains the reliability 
of the mean of eight judgments and the re- 
liability of single judgments. Order Docu- 
ment No. 9014, from ADI Auxiliary Pub- 
lications Project, Photoduplication Service, 
Library of Congress, Washington, D. C. 
20540. Remit in advance $1.25 for micro- 
film or $1.25 for photocopies and make 
checks payable to: Chief, Photoduplication 
Service, Library of Congress. 
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Fic. 1. Color-naming functions of first session. 


group. Although a blue category was 
available in this session, it was used 
so infrequently that the function could 
not be plotted. It can be seen that the 
red, green, and the computed function 
all peak in regions where neighboring 
functions are approaching zero, 
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Ond session, 
Were red, yellow, and green.) 


(The categories permitted 
Were red, green, and blue.) 


Figure 2 shows the results of the 
second session for the same two Ss 
as in Fig. l. All three functions (red, 
yellow, and green) appear to approach 
à maximum in regions where neigh- 
boring color functions approach or are 
at zero. The computed function makes 
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Fic. 3. Color-naming functions of third session. (The categories permitted 
were red, orange, and green.) 


only slight deviations from the zero 
base line for both of these Ss, and for 
all the remaining Ss except one. 

The results from the third session 
are shown in Fig. 3 for two Ss. The 
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data for S M.N., representative of a 
group of six Ss, show all three func- 
tions (red, orange, and green) peaking 
in a region where neighboring func- 
tions are at a minimum, The results 
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were red, orange, yellow, 
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for S S.M., representative of the re- 
maining three Ss, are similar to the 
first group except that for these Ss 
the orange function reaches a maxi- 
mum where red is the predominant 
color. For both groups of 5s the com- 
puted function peaks in the region from 
570 to 580 nm, where neighboring 
functions approach the zero base line, 

Figure 4 shows the results of the 
fourth session for the same two Ss as 
in Fig. 3 The order in which the yel- 
low and orange response categories be- 
came available in the second session 
did not systematically affect the use 
of these categories in this last session. 
The red, orange, and green functions 
Show the same characteristics as cor- 
responding curves in the third session. 
The yellow function is similar to the 
computed function in the third session, 
where the use of the yellow response 
category was prohibited. 
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In Fig. 5 red and yellow color- 
naming functions are plotted for those 
sessions in which the categories were 
available. It can be seen that for S 
M.N. the red function has a greater 
positive slope in both sessions where 
the orange category was permitted. 
This is not true for S S.M., for whom 
all red functions appear to have the 
same slope. For both Ss, and for 
both of the groups they represent, the 
yellow function has a greater negative 
Slope in those sessions where the 
orange category was permitted. 

The reliability of the mean of eight 
judgments and the reliability of single 
judgments were estimated using a one- 
way analysis of variance model with 
repeated measures (Winer, 1962). 
Within- and between-session compari- 
sons of reliability coefficients were 
made with an analysis of variance of 
ranked data, Friedman test (Winer, 


Fic. 5. Red and € color-naming functions obtained in those sessions in which 
lese response categories were permitted, 
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1962), and the Wilcoxon matched-pairs 

signed-ranks test (Siegal, 1956). Sig- 
nificant differences existed between 
coefficients in the fourth session, where 
the green category had the highest 
reliability (.90), red (.80), and yellow 
(.84) the next highest, and orange 
(.63) the lowest. Coefficients for the 
mean of eight responses averaged in 
the high 90’s for green, red, and yel- 
low, and in the low 90’s for the orange 
category. 


Discussion 


The employment of the “constant sum" 
method for the description of perceived 
hues represents an improvement over 
previous methods. In weighting his own 
judgments of the predominance of com- 
ponent hues, S is giving us more infor- 
mation with a high degree of reliability. 

The uniqueness of the perceived hues 
associated with the long wavelength part 
of the spectrum can be evaluated on the 
basis of the criteria established. It was 
found, first of all, that each of the cate- 
gories was used with equally high de- 
grees of reliability, except in the last ses- 
sion, when the green category ranked 
highest, and orange lowest. An addi- 
tional comparison of the reliability co- 
efficients associated with the green cate- 
gory in all four sessions suggested that 
a learning effect might be responsible for 
the extremely high reliability associated 
with the green category the fourth time 
it was used in the experiment, Although 
the reliability associated with the use of 
the remaining three categories were all 
lower, that associated with the orange 
category was reduced the furthest. It is 
not clear, however, why these differences 
exist in the final session and not in the 
session when the orange category was 
used for the first time. p 

The color functions associated with the 
red, yellow, and green categories always 
peaked in spectral regions where neigh- 
boring hues approached the zero level. 
This was true for all Ss and in all ses- 
sions. For three of the nine Ss the 
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orange function peaked in a region where 
the dominant color was clearly red and/or 
yellow. This means that for these Ss 
orange was never seen in the absence of 
red and/or yellow. The remaining six 
Ss were willing to report orange, alone, 
when stimulated with monochromatic 
light from a spectral region of 590-600 
nm. 
Although the color name associated 
with a perceived hue may be prohibited 
in a particular session, it should be pos- 
sible to compute the color function if that 
hue is unique. This would not be pos- 
sible otherwise, since a complex hue 
would be analyzed into more fundamental 
components. When both the yellow and 
orange categories were prohibited in the 
first session, and when the yellow cate- 
gory was prohibited in the third ses- 
sion, the computed function assumed the 
same characteristics as the yellow func- 
tion would have in sessions where that 
category was available. In the second 
session, however, when the orange cate- 
gory was prohibited the hue which 
would be called orange by some Ss in 
later sessions was almost completely 
analyzed into neighboring hues. <A 
comparison of the color functions in 
all four sessions (Fig. 5) indicated that 
many responses from the red and yellow 
categories are shifted into the orange 
category when it becomes available in 
the last two sessions. 

In conclusion, the results indicate that 
on the basis of the criteria established, 
the hue associated with the red, yellow, 
and green color names are unique. On 
the other hand, we have found some evi- 
dence which indicates that the hue asso- 
ciated with the color name orange is not 
unique in all aspects. When the orange 
response category is available along with 
the response categories of red, yellow, 
and green, it is used with the lowest reli- 
ability. When the orange response cate- 
gory is prohibited, the hue associated 
with this category is completely analyzed 
into red and yellow. Finally, for a group 
of three Ss the orange color function 

in a region predominated by red 
and/or yellow. These results suggest 
% 
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that some Ss are willing to call “orange” 
a certain combination of red and yellow 
only when the latter response category 
is available. An analysis of the slopes 
of the red and yellow color functions 
indicated that responses are shifted from 
these categories into the orange response 
category when the latter is permitted. 
It is concluded, therefore, that the hues 
associated with the long wavelength part 
of the spectrum can be adequately de- 
scribed without the orange category. 
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EFFECTS OF CODING STRATEGY ON PERCEPTUAL MEMORY 


WILLIAM E. MONTAGUE anp JOSEPH S. LAPPIN 
University of Illinois 


Haber (1964) found that groups using different encoding strategies 
differed in accuracy of stimulus description. He hypothesized that en- 
coding from a decaying memory trace could produce such results 
since one strategy took longer to execute. The present study at- 
tempted to determine whether Haber's results were due to a decaying 
trace, Neither under conditions replicating his with brief stimulus 
presentation nor under conditions where no trace decay was possible 
did performance differ as a function of strategy. It does not seem 
likely that trace decay is a factor in such stimulus encoding. 


Haber (1964) instructed two groups to 
encode stimulus dimensions from briefly pre- 
sented multidimensional stimuli using differ- 
ent strategies. One group took significantly 
longer to encode the stimuli than the other 
and made more errors. On the basis of 
earlier work by Sperling (eg, Sperling, 
1960), Haber (1964) proposes "that longer 
latencies and longer durations of initial en- 
coding produce more errors in encoding be- 
cause the short-term memory on which 
encoding is based, is fading rapidly [p. 32].” 
However, the fact that the two strategies 
probably differed with regard to Ss’ pre- 
experimental verbal habits raises some ques- 
tion about this interpretation. The longer 
encoding times and higher errors observed 
for one method of stimulus description 
might be the result of unfamiliarity or diffi- 
culty of the encoding procedures rather than 
stimulus trace decay. In an attempt to 
clarify this problem the present experiment 
compares the two encoding procedures under 
conditions duplicating Haber's with brief 
stimulus presentation and longer presenta- 
tion where no memory trace decay was pos- 
sible prior to encoding. 

Method.—As closely as possible the mate- 
rials, methods, and procedures duplicated 
those of Haber (1964). The design provided 
for two coding groups each performing the 
task with first a brief (replicating Haber) 
and then with an extended stimulus exposure, 
The task was the same for both presentation 
times but, during the last half of the session, 
exposure was sufficiently long so that “en- 
coding” was nearly always completed before 
the stimulus disappeared. It was assum 
therefore, that any performance differences 
between the groups using the two coding 


strategies under this condition would not be 
due to a difference in the availability of the 
stimulus trace at the time of encoding, 

Thirteen undergraduates served as Ss in 
each of the coding groups. 

The stimuli were the same as those used 
by Haber. Prior to the experiment each i. 
was instructed to use one of the two coding 
strategies to describe the stimulus, ie, Ob- 
jects Code, and Dimensions Code. Training 
and “Emphasis instructions” were given as 
described by Haber, but no “Equal instruc- 
tions” were given since this comparison was 
irrelevant to our purposes. 

The Ss were instructed to respond and the 
responses were tape recorded as described by 
Haber. A stopclock was used to time the 
duration from stimulus onset to completion 
of the first description. The rest of the 
times were taken from the tape recordings. 

The stimuli were presented by a Gerbrand’s 
tachistoscope. Their luminance was ap- 
proximately .8 ftc. and they subtended visual 
angles of 14° horizontally and 7° vertically. 
The S pressed a switch any time he was 
ready after E said “ready.” Each S viewed 
40 stimuli with one dimension emphasized at 
each presentation. Two exposure times were 
used. For the first 20 trials the stimulus 
duration was 100 msec, and for the last 20 
trials the duration was lengthened to 5 sec. 
The latter time was chosen, on the basis of 
exploratory work with other Ss, to be suffi- 
ciently long for Ss to verbalize the stimulus 
description. Order of stimulus presentation 
was counterbalanced within each duration 
and for each coding strategy. 

Results.—The analyses of the results fol- 
low Haber (1964) as closely as possible ex- 
cept where the differences in procedure dic- 
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tated or where another comparison statistic 
was considered preferable. The major com- 
parisons of interest are between the groups 
using the different encoding strategies. 
Within-Ss comparisons of the effect of ex- 
posure time on performance are not conclu- 
sive since practice effects covary with expos- 
ure time. The between-groups comparisons, 
however, should be unaffected since in analy- 
ses of variance run on the data no interac- 
tions between groups and exposure time were 
significant. 

Table 1 presents the mean times (Columns 
1, 2, 3) and errors (Columns 5, 6, 7, 8). 
Contrary to Haber’s results no differences 
were found between groups in encoding 
latency (stimulus onset to the beginning of 
S’s utterance) for either exposure time. 
However, Objects coders are able to describe 
the stimulus (length of utterance) faster 
than Dimensions coders at both ure 
durations, 100 msec. : t (24) = 5.151, p < .01; 
5 sec.: t (24) = 3.520, p < .01. Therefore, 
Objects coders can encode the stimuli com- 
pletely considerably faster (by 1 sec. or 
more). Similarly, more repetitions of the 
description are made by Objects coders with 
the 100-msec. exposure, ¢ (24) =2.566, p 
< .05, and the difference between groups at 
the S-sec. exposure just fails to reach sig- 
nificance, £ (24) = 1.888. 

The differences between the groups in 
initial encoding errors (Column 4) for both 
exposure times were insignificant. Since six 
errors could be made on each of the 20 stim- 
uli given at each exposure duration, the error 
percentages for 100-msec. exposure are 45% 
which is about half that reported by Haber 
(9 and 11% for the encoding groups, a 
difference which was significant). 
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Tn order to examine the relationship of en- 
coding speed and accuracy the durations of 
initial encoding for trials with and without 
error were compared. Contrary to Haber's 
findings, no difference was found for either 
group in the present study. 

Another measure of accuracy, not used by 
Haber, is the number of trials on which the 
description was correct at the end of the 
rehearsal period. These data are shown in 
Column 9 in Table 1. No significant differ- 
ence between the groups was found. Within 
each exposure-time condition both groups 
were correct on about the same number of 
trials. 

No analysis was made for the effects of 
the emphasis instructions and order of en- 
coding, since the number of errors in each 
category would have been small. Cursory 
examination revealed the same tendency 
noted by Haber for Dimensions coders to 
encode the emphasized dimension first. 

Following  Haber's analysis, the 
groups were compared with respect to 
the effects of rehearsal on the retention of 
the encoding, particularly on the effects of 
interference to the encoded description by 
repeated rehearsal [Haber, 1964, p. 360]." 
Interference was measured by (a) changes in 
description from one repetition to the next, 
and (b) stumbles, hesitations, breaks in 
rhythm, etc. These data are given in Col- 
umns 5, 6, 7, and 8 in Table 1. Changes in 
description include right to wrong (R-W), 
wrong to right (W-R), and wrong to wrong 
(W-W). The total number of changes was 
considerably smaller than found by Haber. 
No significant differences were found be- 
tween the groups either in total number of 
changes, number of changes making encod- 


two 


TABLE 1 


SUMMARY OF RESULTS INCLUDING LATENCY, 


DURATION OF THE First REPETITION 


(Encopinc Time), ENCODING Errors, CHANGE Errors, 
AND NUMBER OF Correct TRIALS 


No. Errors during 


No. i 

E ir | aims. e a | Ce 

tratesy | tion | (Sec) | et Ren-| tions in | ot Rep- aca [on stumbles} (Out of 

(Msec.) * | etition | T5 R-W. | Changes 20) 

[s W-W | WR 

x a) (2) (3) (4) (5) (6) (7) (8) [2] 
ts i 

i 100 147 1.90 7.87 5.08 1.53 .85 «69 16.54 | 14.69 

s = 13)| 5000 140 | 2.38 8,78 062 | 0.92 223 69 | 1215 | 19.15 
imensions| 100 1. 

d eo. so 78 3.12 6.37 5.92 2.00 1.38 .62 33.0 13.62 

1.65 3.39 7.50 0.54 0.92 46 46 27.0 19.00 


| 
| 
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ing wrong (R-W, W-W), or changes cor- 
recting an error (W-R), using the Mann- 
Whitney U test (Siegel, 1956). Stumbles 
occurred significantly more often for Dimen- 
sions coders at both exposure durations, 100 
msec.: t (24) — 358, 5 sect d (24) 2 2.56, 
p< for both. 

Discusion.—The results of the present ex- 
periment are at odds with Haber's in several 
ways. The main area of agreement is that 
Dimensions coders take longer to verbalize 
their description of the stimuli and as a re- 
sult are able to rehearse the description fewer 
times. However, in the latency of beginning 
the description and in the various error 
measures no substantial differences were 
found. Thus, the data of this experiment do 


caying memory trace. Not only was there 
no difference between groups with the 5-sec. 
exposure, which eliminated any decay, but no 
more errors occurred as a function of encod- 
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ing strategy when the 100-msec. exposure 
was used. 

‘Although reasons for these discrepancies 
are not entirely clear, minor changes in 
stimulus presentation may be noted. For 


operation of a fading stimulus trace has been 
altered by these procedural changes, the ef- 
fect of coding strategy in perceptual memory 
may not be very important. On the other 
hand, the relationship between response 
latency and accuracy which was obtained by 
Haber may not imply a fading stimulus trace. 


REFERENCES 


Haser, R. N. Effects of coding strategy on per- 
i tof col ole 1968, "08, 


snort, S. Nonparametric statistics, New York: 
McGraw-Hill, 1956. 
G.' The inf available in brief 


SPERLING, G. 
i tations, Psychol, Monogr., 1960, 74(11, 
Whole No. 498). 


(Received September 24, 1965) 


SENSITIZED HUMAN HEART-RATE RESPONSE AS EVALUATED 
BY SIMPLE PROCEDURES * 


PAUL A. OBRIST AND DONALD M. WOOD 
School of Medicine, University of North Carolina 


The sensitized heart-rate 
ple method where the CS an 
under 4 experimental conditions. 
those obtained to the CS- using 


eralization. Evidence is al 


lated to drug administration potentiate sensi! 


response in humans as evaluated by the sim- 
S and UCS are not paired was determined 


‘A comparison of these effects to 
differential procedures from 2 pre- 


vious studies indicated that the pin was in 


itization effects. Other 


aspects of the data suggest that there are differences in both the direc- 


tion and amplitude of sensitiza! 
CSs. 


In two previous studies (Obrist, Wood, & 
Perez-Reyes, 1965; Wood 1 
in which differential procedures Were 
evaluate sensitization effects om heart rate 
during classical conditioning there was some 
indication that the sensitiz 
be influenced by stimulus gen! i 4 
the CS+. In order to evaluate this possi- 


1 This research Research Grant 
MH-07995, National. Erie by Went, United 
States Public Health Service. 


tion effects between visual and auditory 


bility sensitization effects have been deter- 
mined in four additional experimental groups 
by using “simple procedures” where the 
paired with a CS. Three of 
these groups were comparable in all other 
respects to the experimental groups of the 
two previous studies, thus providing a com- 
parison between sensitization effects as de- 

ined by each method. Of these groups 
sensitization effects were evaluated using an 
auditory CS in one case (Wood & Obrist, 
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1964) and a visual CS at two different UCS 
intensities (Obrist et al., 1965) in two cases. 
In the latter instance, Ss also received 
venipunctures and were led to believe that 
they received a drug (atropine). Therefore, 
a fourth experimental group was run in 
which the venipuncture was omitted in order 
to determine its influence on sensitization 
effects. 

Method—The Ss were 80 healthy male 
undergraduates randomly selected from the 
introductory psychology class. Twenty were 
assigned to each of four experimental groups. 
However, data were not collected on 14 Ss 
either because they refused to be in the 
experiment or because of equipment failure 
and related difficulties. The final sample 
sizes were: Group 1, N=14; Group 2, 
N=19; Group 3, N=14; Group 4, N — 19. 
The equipment, conditioning procedures, and 
method of data analysis were identical with 
that previously reported, except that the 
CS-- paired with each presentation of the 
UCS was omitted. This involved 10 pres- 
entations of each of two CSs and 16 un- 
paired shocks. The CS+ was included when 
it previously occurred as a nonreinforced 
test trial. The experimental groups and 
their counterparts in the previous two studies 
were: Group 1—comparable to Conditioning 
Day 1 in the Wood and Obrist (1964) 
study. This involves, among other things, 
an auditory CS and no experimental control 
of respiration. Group 2—comparable to the 
low intensity UCS saline group in the Obrist 
et al. (1965) study. This involves, among 
other things, a venipuncture with S believing 
he received atropine, a visual CS, and experi- 
mental control of respiratory activity. Group 
3—comparable to the high intensity UCS 
saline group in the Obrist et al. (1965) 
study. This is similar to Group 2 except for 
a more intense UCS. Group 4—comparable 
to Group 2 of the present study except for 
the omission of the venipuncture, and, as 
such, used as an evaluation of this procedure 
on sensitization. 

Results and discussion.—Second-by-second 
changes in heart rate for the 15 sec. follow- 
ing CS onset were obtained from the average 
Pre-CS base level for the 10 CS— trials 
when using a visual CS. For the auditory 
CS, all 20 high and low tone CS trials were 
used because in the previous study the CS— 
was a 1,000-cps tone for half the Ss and a 
500-cps tone for the other half, 

A consistent difference in sensitization 
effects between differential and simple meth- 
ods was observed with differential procedures 
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Fig, 1. Heartrate changes from Pre-CS base level 
to CS— using differential (solid line) and simple 
(dashed line) methods. (Asterisks indicate change 
significant, p < .05, from zero base line.) 


resulting in either a greater deceleratory or 
smaller acceleratory response. With a tone 
CS, a reliable deceleratory response was ob- 
served using both methods (see Fig. la), 
but it was both more extensive and of 
greater amplitude with differential pro- 
cedures, being reliably greater (p < .05) on 
Sec. 5, 6, and 7 ( tests of independent 
means: Sec. 5, £— 249; Sec. 6, t= 228; 
Sec, 7, 1— 223). With simple procedures, 
a reliable acceleratory response was ob- 
served immediately following CS onset. 
This appeared to be influenced by respiratory 
changes. However, in the light of previous 
results (Wood & Obrist, 1964), it is likely 
that only the initial differences between the 
two methods (that is, during the acceleratory 
phase) could be accounted for by respiratory 
effects, 

In both experimental groups with a visual 
CS, the sensitization effect using simple 
methods was a reliable acceleration of heart 
rate (see Fig. 1b and Ic). On the other 
hand, differential procedures at the low 
shock intensity revealed a reliable period of 
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deceleration which was significantly less (p 
« 05) than the response with simple pro- 
cedures on Sec. 5, 6, 8, and 9 (t tests of 
independent means, Sec. 5, £—2.55; Sec. 6, 

— 2.64; Sec. 8, #=2.07; Sec. 9, 1-211). 
With the more intense UCS, a reliable ac- 
celeration of heart rate was observed with 
differential procedures, but this response was 
both smaller and less sustained than with 
simple procedures. However, the difference 
in response between the two methods was not 
significant. There appeared to be no respira- 
tory influence on these changes in heart rate. 
Therefore, in the light of the fact that heart- 
rate changes to the CS+ during differential 
conditioning are primarily a deceleratory re- 
sponse, these data indicate that there appears 
to be a significant influence of stimulus gen- 
eralization on the response to the ESS 
Although this effect is not large, it is con- 
sistently observed under various conditions, 
and appears to justify the recommendation 
that sensitization effects be evaluated by 
simple methods. 

No reliable change was observed in heart 
rate in the fourth experimental group where 
no venipuncture was given. Compared to 
Group 2, where on 7 sec. a significant ac- 
celeration of heart rate was found, heart-rate 
changes were always smaller though the 
difference between groups at any 1 sec. was 
not quite significant (largest difference f 
=1.83, p <.10). Therefore, this suggests 
that the effect of a venipuncture is to 
potentiate sensitization effects and that under 
these conditions with a visual CS and ex- 


781 


periment controlled respiration, there is 
no sensitization effect if there is no veni- 
puncture. 

Finally, data from this as well as the 
previous two studies suggest that an auditory 
CS results in a different sensitized response, 
both in terms of direction and amplitude, 
from a visual CS. This issue cannot be 
directly evaluated by the present study be- 
cause there are not available two experimental 
groups matched in all respects except for 

of CS. However, sensitization effects 
as evaluated by an auditory CS have been 
found to be primarily a deceleration of heart 
rate in three independent samples and on twe 
different conditioning days in two of these 
samples. On the other hand, sensitization 
effects to a visual CS have now been found 
in five independent samples to be either a 
smaller deceleration, primarily an accelera- 
tion, or no change at all. Also, differences 
between visual and auditory CSs on heart 
rate in dogs have been previously noted by 
Gantt (1960). Therefore, a systematic study 
of this issue seems warranted. 
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A REPLICATION OF HOVLAND'S STUDY OF GENERALIZATION 
TO FREQUENCIES OF TONE* 


SEYMOUR EPSTEIN anp KENNETH R. BURSTEIN * 
University of Massachusetts 


48 Ss were tested for generalization with 4 tones of the same fre- 
quencies as in Hovland’s classic study. The 16 tones were presented 
in a 4X4 Latin square during adaptation and testing for generaliza- 
tion. 4 of the Ss were conditioned to a tone at 1 end and 3 to a tone 


at the other end of the dimension. 


U-shaped curves of generalization 


were found for all testing cycles. No significant difference was found 
between electrodes on and off during generalization, nor between the 
exosomatic and the endosomatic method. It was concluded that testing 
for generalization is complicated by alerting reactions and stimulus- 
disparity effects, and that the overall evidence for monotonic decre- 
mental gradients of primary stimulus generalization of the GSR in 
humans is highly questionable, apart from a consideration of the exact 


shape of the curve. 


In spite of its central role in contemporary 
learning theories, the empirical status of 
gradients of primary stimulus generalization 
(SG) in humans remains highly question- 
able. While Hovland (1937a), in an early 
study of SG of the GSR, reported that the 
generalization function was concave upward, 
attempts to replicate this oft-cited study have 
produced ogival, convex, and concave mono- 
tonic gradients, as well as gradients of “zero 
slope” (see Mednick & Freedman, 1960). 
In the face of these widely divergent find- 
ings, it is Hovland’s results which are con- 
sistently cited and his gradients which are 
reproduced in textbooks. The present study 
is the first in a series attempting to deter- 
mine the conditions necessary for the demon- 
stration of GSR gradients of primary SG 
within human Ss. Also, because Hovland 
used a skin potential (SP) measure (method 
of Tarchanoff), while almost all subsequent 
investigators have employed a skin resistance 
(SR) measure, both responses were inves- 
tigated. 

Procedure.—The Ss, 48 students in intro- 
ductory psychology courses, were randomly 
assigned to two groups of 24 each. Five 
additional Ss were discarded due to pro- 
cedural errors. 

The stimuli were tones of 153, 468, 1,000, 
and 1,967 cps delivered through both ear- 
phones of a matched headset, and judged in 
preliminary bas B be of equal loudness, 

ome pressure levels at the earphones 
63, 70, 79, and 82 db., Tid. For half 

1 This study was si 
S, Epstein fom the Warne) (rans MHL 01299 to 
F? lental 
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of the Ss the CS was the 153-cps tone, and 
for the other half the 1,967-cps tone. The 
duration of the CS was 500 msec. The UCS 
was a shock of 50 msec. delivered to the 
right wrist. The intertrial interval varied 
from 10 to 60 sec. with a mean of 35 sec. 
A programming system made available 24 
Latin squares of four cycles of the four 
stimuli. One of these Latin squares was 
used to present 16 adaptation trials. This 
was followed by 16 conditioning trials, con- 
sisting of the 153-cps tone paired with the 
UCS for half the Ss, and the 1,967-cps tone 
paired with the UCS for the other half. Im- 
mediately following the conditioning trials, 
a Latin square of 16 test trials was pre- 
sented, which was different from the original 
Latin square. Because Hovland's procedure 
was unclear, half of each CS group received 
the test trials with the shock leads on and 
half with the shock leads off. 

Results and discussion.—Identical analyses 
of variance were performed on the adaptation 
trials and on the test trials for both the SP 
and the SR measures. As the results were 
the same, only the findings on the SR meas- 
ure will be reported. Any deflection from 
the base line of 1 mm. or more with a latency 
of 0.5-3.0 sec. was scored as a response. 

, There was a significant adaptation effect 
in the Preconditioning trials, with response 
magnitude decreasing with successive cycles 
of tones, F (3, 120) = 6.29, p<.001. There 
was also a significant effect associated with 
the differences among tones, F (3, 120) 
—1071, p «.001, before conditioning. The 
mean response magnitudes for the 1,967-, 
1,000-, 468-, and 153-cps tones, in that order, 
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were .674, .515, .514, and 427 „mhos. Al- 
though the greater response strength to the 
1,967-cps tone during the adaptation series 
could present a problem in the interpretation 
of the generalization curves, the problem was 
adequately handled by the experimental de- 
sign and the analysis of the data. For one, 
the 1,967-cps tones was the CS for half the 
Ss and the 153-cps tone for the other half. 
The conditioned generalization curves for 
these groups were virtually identical when 
plotted as a function of dissimilarity from 
the CS. Secondly, difference scores obtained 
by subtracting magnitude of response to a 
particular tone in a particular adaptation 
cycle from the corresponding tone in the 
corresponding test cycle produced similar 
generalization curves to simply plotting the 
absolute scores. The same was true when 
regression equations were employed for pre- 
dicting response amplitudes, and deviations 
from these plotted. 

The results for the generalization phase 
are plotted in Fig. 1. There was a signifi- 
cant effect associated with differences among 
tones. The greatest response occurred to the 
stimulus farthest removed from the CS. 
This was true regardless of whether the CS 
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Fio. 1. Magnitude of GSR as à function of a stimulus 
di Magni Cycles treated as a parameter 


was the 153 or the 1,967-cps tone, there 
being no significant difference between the 
two groups. There was a highly significant 
decrease in response strength with increas- 
ing test cycles, F (3, 120) = 10.78, p< 001, 
which reflected the rapid extinction of the 
CR after the first test cycle. The level of 
response on the first cycle of trials subse- 
quent to conditioning was higher for all 
stimuli than the level of response on the last 
cycle of the adaptation trials, a finding which 
has frequently been interpreted as evidence 
for conditioning. 

The form of the generalization curve was 
the same when the data were analyzed for 
frequency of responses to the four trials in 
place of magnitude of response. Generaliza- 
tion was also evaluated by a between-S's 
design, using only the first response to 
the generalization stimuli. The results were 
similar to the within-Ss design, a U-shaped 
generalization curve again being obtained. 
The results for SP paralleled in every way 
those for SR. 

It is important to consider that not only is 
the shape of the gradient of primary stimulus 
generalization of the GSR in humans in 
question, but the evidence for any continuous 
decremental gradient is questionable. In 
commenting on what they interpreted as a 
positive finding in support of SG, Grant and 
Schiller (1953) noted 


Littman (1949) was also impressed with the 
widely divergent results stemming from 
replications of Hovland's study, as indicated 
by his comment on “the large variability 
which the phenomena of generalization seem 
p. 880].” A review of the total 


monotonic decremental gradients of the GSR, 
regardless of exact shape, have not been 
i established in humans. 


One explanation for inconsistencies in the 
results of studies using what appear to be 


have not been adequately taken into account. 
For example, it may be that what an S 
thinks a stimulus js is more important than 
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what the stimulus actually is. In interviews 
conducted at the end of the experimental 
sessions, some Ss failed to identify the tone 
which had been accompanied by shock, some 
reported being shocked to a tone which was 
not accompanied by shock, and some Ss re- 
ported being shocked to all tones, We are 
currently conducting experiments in which 
it will be possible to plot responses once as a 
function of the objective dimension, and 
again as a function of a subjective dimen- 
sion, as Ss are required to label the stimuli 


reported in this and another study (van Olst 
& Orlebeke, 1965) reflect the occurrence of 
two distinct Phenomena, a decremental gradi- 


Presentation of which Produces an anchoring 
effect, or “base stimulus,” for perceptual dis- 
Parity responses. It is noteworthy that 
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another study: “Quite marked in the present 
experiment, and much more Pronounced in 
an’ accessory experiment on tactual condi- 
tioned responses, is the augmented size of 
the response to the intensity farthest removed 
from the conditioned stimulus on the first 
cycle of testing [p. 237).” 

It is, of course, possible that stable gradi- 
ents of primary stimulus generalization 
simply do not exist in individual humans, 
difficult as it is to believe that such is the 
case. The apparent reasonableness of the 
phenomenon is, of course, not sufficient 
reason for accepting it, and, even if it were, 
it would still be necessary to produce it in 
the laboratory in order to investigate its 
parameters, 
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INFORMATION TRANSMISSION (7) IN RECOGNITION AND 


RECALL AS A FUNCTION OF 


ALTERNATIVES (£k)* 


WILLIAM H. FIELD AND ROY LACHMAN 
State University of New York at Buffalo 


Information transmission (I) in free recall has been reported as equal 
or superior to I in recognition where it is assumed that the number 


scanned (k) in recall is. established by de- 


during recall and to compare resulting values of I. The results sug- 
gest: (a) The number of symbols scanned can be manipulated by in- 
structions and different values of k may be appropriate for different 


recall instructions. 


(b) As S becomes more familiar with character- 


istics of the stimulus population, more response alternatives become 
available for scanning during recall and the appropriate value of k 
increases. (c) Manipulation of k through instructions has, subtle 


effects on recall raw scores. 


(d) Depending on the value of k used 


to calculate I for recall, I can be larger or smaller for recall vs. recog- 


nition, (e) The value of k use 
grossly overestimated in previous 
from a value of k that appears to 


than in free recall. 


In the past research comparing rec- 
ognition and free recall after compar- 
able training, the general conclusion 
has been that recognition is superior to 
free recall (Postman & Rau, 1957). 


1Supported by National Institutes of 
Health Grant No. MH 10764 and based on 
an MA thesis submitted by W. H. Field to 
the State University of New York at 
Buffalo. The authors gratefully acknowl- 
edge the assistance of B. R. Judd (Lawrence 
Radiation Laboratory) and N. S. Suther- 
land (University of Sussex) with certain 
aspects of the mathematical formalism. 


d to compute I in free recall, was 
research. (f) When I is calculated 
the best estimate of scanning in con- 


ventional free recall, more information is transmitted in recognition 


Davis, Sutherland, and Judd (1961) 

recently dissented from the prevailing 
view. They argued that retention er-, 
rors have been ignored or dealt with 
arbitrarily in comparing recognition 
and recall A method was: developed 
for comparing recognition and recall 
with respect to the amount of informa- 
tion transmitted (I) (Judd & Suther- 
land, 1959). The method requires that 
the number.of alternatives (k) from 
which Ss select their answers be known 
in both test conditions. In a recogni- 
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tion test k is directly determined by 
the number of test alternatives. In a 
free-recall test the value of k is un- 
known. In an experiment, Davis, 
Sutherland, and Judd (1961) com- 
pared J in recognition out of 30, 60, 
and 90 alternatives to 7 in free recall. 
Training stimuli were sampled from a 
population of 90 bigrams. Since this 
finite and restricted population was 
described to Ss during pretraining, the 
critical assumption was made that this 
population would be scanned during 
free recall, and that & — 90 for the 
recall condition. They found that an 
equal amount of information was trans- 
mitted in recognition out of 90 alterna- 
tives and in free recall, also that J for 
recall was higher than for recognition 
out of 30 and 60 alternatives. Davis 
et al. (1961) concluded that in previous 
studies “recognition is superior to re- 
call only because it usually involves 
selection from fewer alternatives [p. 
427]." 

The present experiment was designed 
(a) to investigate the validity of the 
Davis et al assumption concerning 
their value of k in recall; and (b) to 
compare the amount of information 
transmitted in recognition and recall 
with empirically determined values of 
k. Three different free-recall groups 
were employed in an attempt to esti- 
mate the number of response alterna- 
tives available to Ss in recall. One 
group, a conventional free-recall group, 
was required to estimate b after in- 
creasing familiarity with the stimulus 
population. A second recall group was 
required to write down the items 
remembered from training as well as all 
other items thought of while attempt- 
ing to remember the correct ones, The 
third recall group was instructed to 
write down the entire stimulus popula- 
tion and at the same time to indicate 
the items remembered from training, 
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METHOD 


Design—Six independent experimental 
groups consisting of 30 Ss each were de- 
fined by the type of retention test employed. 
There were three conditions of successive 
binary recognition testing (Lachman & 
Tuttle, 1965) : recognition out of 30 alterna- 
tives (Recog-30), recognition out of 60 
alternatives (Recog-60), and recognition out 
of 90 alternatives (Recog-90). The three 
recall conditions are differentiated according 
to test instructions which are described below. 
Each of the six groups received four suc- 
cessive training and testing sessions. A 
session consisted of a single exposure to an 
independent training list followed by a re- 
tention test. The retention test was of the 
same type for each S in each session. 

Subjects—The Ss were 180 introductory 
psychology students at the State University 
of New York at Buffalo ranging in age from 
17 to 23 yr. 

Verbal materials—Verbal materials con- 
sisted of 90 bigrams (Davis et al, 1961) 
constructed by beginning each bigram with 
a consonant and ending each with a vowel. 
The consonants q, x, and z were excluded. 
A total of 720 training lists consisting of 
15 different bigrams each were constructed 
by successively randomly selecting 15 bi- 
grams from the set of 90. 

Recognition testing lists were made up by 
randomly intermixing the 15 training bi- 
grams with either 15, 45, or 75 different 
randomly selected distracter bigrams. 

The bigrams were printed on blank IBM 
cards, one to a card, for presentation during 
training and for sorting during recognition 
testing. 

Procedure —The Ss were tested in groups 
of five. Each group was seated at a table 
containing the training and testing materials. 
All Ss received four different training decks 
of IBM cards for four independent training 
sessions. The testing condition assigned to 
each S was constant for the four tests. 

Immediately before the presentation of the 
first training deck general training instruc- 
tions were read to all Ss. These instructions 
consisted of a description of the stimulus 
population: each bigram begins with a con- 
sonant and ends with a vowel, excluding the 
Consonants q, x, and z; and a description 
and demonstration of the bigram presenta- 
fion procedure. The bigrams were presented 
by baving Ss flip the IBM cards in time 
with tape-recorded “clicks” spaced every 
15 sec, 
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Following the general training instruc- 
tions the appropriate test instructions were 
read to the different retention groups. The 
three recognition groups were given the same 
test instructions. They were told that im- 
mediately following training they were to 
sort the test alternatives into “yes” or “no” 
categories depending on whether or not they 
had seen the test bigrams in the training list. 
The three recall groups were given different 
test instructions. 

1. Recall 1, conventional free recall: This 
group was told to simply write down all the 
bigrams remembered from the training list 
without regard to order, and to begin the 
test immediately after training. Following 
the fourth session, each S in Recall 1 was 
required to estimate the total number of 
bigrams in the stimulus population. 

2. Recall 2, suppressed response recall: 
The Ss in this group were instructed to 
write down all the bigrams they happened 
to think of while attempting to remember 
the syllables from the training deck. The 
Ss were further instructed to place a check- 
mark only after those bigrams remembered 
from the training list and to start the test 
immediately after training. 

3. Recall 3, response scanning recall: This 
group was instructed to generate and to 
write the entire stimulus population (ie., all 
the two-letter syllables that begin with a 
consonant and end with a vowel, excluding 
the consonants q, x, and z), and at the same 
time to put a checkmark after those bigrams 
remembered from the training list. The Ss 
were told that credit would be received for 
the bigrams correctly checked from the train- 
ing list and for all other population bigrams 
written. For Recall 3, the test instructions 
were read both before list presentation and 
before testing. 

Preceding each training-testing session 
the description of the stimulus population 
and the appropriate test instructions were 
repeated to all groups. There was approxi- 
mately a 90-sec. interval between each ses- 
sion. Before each test all Ss were told that 
they could guess but not to guess randomly. 
No time limit was imposed for the recall 
tests. 


RESULTS AND DISCUSSION 


Several recall conditions were ad- 
ministered in this experiment for the 
purpose of estimating the appropriate 
value for k, the number of response 
alternatives that are available to S dur- 
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ing conventional free recall. If this 
proved impossible to achieve unequiv- 
ocally, then a secondary objective was 
to establish reasonable upper limits 
for the value of k in conventional free 
recall. Recall 1 in the present experi- 
ment was a conventional free-recall 
test of the type frequently employed 
in memory research. This type of re- 
call test was utilized by Davis, Suther- 
land, and Judd (1961). The Ss in 
Cond. Recall 1 were required to give 
a single estimate of the number of 
alternatives comprising the stimulus- 
training population. The estimate was 
made aíter the fourth testing session. 
The purpose of this estimate was to 
see if it might yield a value anywhere 
near 90. In Recalls 2 and 3, k was 
calculated directly from the response 
protocols, based on the following ra- 
tionale. In conventional free recall, 
such as Recall 1, it is assumed that Ss 
implicitly generate response alterna- 
tives, some of which are emitted as 
correct while others are suppressed. 
Recall 2 instructions require 5 to make 
explicit the suppressed response alter- 
natives which are assumed to occur 
during free recall. Such a suppressed 
response recall test provides a relatively 
direct measure of k while presumably 
leaving the retention raw scores the 
same as those obtained in conventional 
free recall. However, under the pres- 
ent conditions, differences between Re- 
call 1 and 2 in the retention raw scores 
were observed and are described be- 
low. Recall3, in contrast, may be char- 
actered as a response scanning free- 
recall task since Ss were instructed to 
indicate the recalled items while re- 
producing all the possible response 
alternatives. Recall 3 calls for mani- 
festly different behavior on the part of 
S than does Recall 1 and 2. Con- 
sequently, consistent differences were 
obtained for certain aspects of the raw 
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TABLE 1 
EMPIRICAL DETERMINATIONS OF THE MEAN NUMBER OF 
RESPONSE ALTERNATIVES (k) IN RECALL 
Training-Testing Session 
Cond, Source 
1 2 3 4 
Recall 2 21.0 27.9 29.5 30.6 Computed from protocols 
Recall 3 67.8 81.2 82.0 81.8 Computed from protocols 
Recall 1 after four sessions 48.6 Ss’ estimates 


scores. The purpose for including Re- 
call 3 was to observe the number of 
reproduced alternatives under optimal 
scanning conditions that perhaps would 
produce a maximum value of b. The 
thrust of the evidence reported below 
is that different values of k appear 
to be appropriate for different recall 
instructions while the initial objective 
of the research was to find the best 
value of k for conventional free recall. 

The mean numbers of response 
alternatives for the recall conditions are 
presented in Table 1. It is evident 
from Table 1 that Ss in Recall 1 
grossly underestimated the population 
value of 90 and that in Recall 2 
(suppressed responses) Ss consistently 
produced lower values of k than in 
Recall 1. In Recall 3, where Ss were 
instructed to write down all the popu- 
lation alternatives, the mean values of 
k are notably higher than those in Re- 
calls 1 and 2, but they do not reach the 
maximum value of 90. Even if the 
values of k given in Recall 3 had been 
equal to 90 this in no way would sug- 
gest that the appropriate value of k 
in Recall 1 is 90 since the two condi- 
tions involve widely different tasks 
(cf. error column, Table 2). Mann- 
Whitney U tests indicate that at the 
fourth session 5 in Recall 1 is sig- 
nificantly lower than b in Recall 3: 
2 — 98, p< 001, and that, similarly, 
k in Recall 2 is significantly lower than 
k in Recall 1; 2237, 5 — Q0], Tt 


will be recalled that Recall 2 provides a 
relatively direct measure of response 
alternative scanning. The values of k 
obtained in Recall 2, therefore, may 
represent the best estimates of k for the 
conventional free-recall condition, Re- 
call 1. Thus, the assignment of the 
value 90 to k by Davis, Sutherland, 
and Judd (1961) is unsupported by 
the present findings and appears to 
involve an overestimation. 

A major purpose of the successive 
training and testing sessions was to 
assure that the three empirical deter- 
minations of the value of & were based 
on reasonable familiarity by S with the 
stimulus material. To evaluate the 
effect of repeated exposures to the 
stimulus material on the values of k 
given in Recalls 2 and 3 two Friedman 
two-way analyses of variance by ranks 
were performed. The analyses showed 
that the number of response alternatives 
in Recalls 2 and 3 increased signifi- 
cantly over sessions; xr = 31.67, p< 
001, and 4%, = 24.09, p < .001, re- 
spectively. These results throw doubt 
on the assumption that P in recall re- 
mains constant throughout repeated 
exposures to the stimulus material. 
As Ss become more familiar with the 
characteristics of the stimulus popula- 
tion, it appears that a greater number 
of response alternatives become avail- 
able for scanning. 

The mean number of correct re- 
sponses and the mean number of errors 


| 
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TABLE 2 
MEAN CORRECT Scores AND MEAN ERRORS 


Session 1 Session 2 Session 3 Session 4 
Cond. 

Cor. Err. Cor. Err. Cor. Err. Cor. Err, 
Recog-30 10.8 2.5 11.4 3.9 11.3 3.6 10.8 3.1 
Recog-60 10.3 9.2 10.0 9.9 10.1 9.7 10.0 9.6 
Recog-90 8.8 15.6 8.6 14.7 8.3 12.3 8.5 11.8 
Recall 1 712 3.5 TS 3.8 1.2 3.5 7A 3.7 
Recall 2 6.2 24 7.3 3.5 7.5 3.5 8.1 3.4 
Recall 3 6.5 4.1 6.7 54 1.2 6.0 LEY 4.8, 


are presented in Table 2. One pertinent 
aspect of the raw data is that in the 
recognition conditions an increase in the 
number of distracters resulted in a con- 
siderable increase in the number of er- 
rors with a somewhat less noticeable 
but consistent decrease in the number 
correct. This is completely consistent 
with the results of Lachman and Tuttle 
(1965) that were obtained with differ- 
ent stimulus materials, high-frequency 
words. The trends of increasing errors 
and decreasing correct responses as the 
number of distracters in recognition in- 
creased are also consistent with the 
Davis, Sutherland, and Judd (1961) 
findings. 

Variations in the number of re- 
sponse alternatives produced in the 
recall conditions did not result in the 
obvious trends regarding correct re- 
sponses and errors observed in the 
recognition conditions. Manipulations 
of k through test instructions in Recall 
2 had subtle effects on the raw scores. 
Although it appears that there are no 
consistent differences among the raw 
scores in Recalls 1 and 2, closer 
Scrutiny reveals that the scores re- 
main virtually constant across sessions 
in Recall 1 whereas in Recall 2 the 
number correct increase progressively 
with sessions. The interaction between 
training sessions and Recall Cond. 1 
and 2 for the number of correct re- 
sponses is significant at p < Ol, F 


(3, 174) = 3.94. It is possible that 
instructions to emit suppressed re- 
sponses become increasingly effective 
with training and that some of the sup- 
pressed responses are in fact correct 
responses. This might account for the 
interaction. Whatever the basis of the 
interaction, the data on Recall 2 still 
suggest that 90 is much too high an 
estimate of k in conventional free recall. 

Although the value of k obtained 
in Recall 3 is substantially greater than 
that obtained in Recall 2, it is still con- 
siderably less than 90. The high value 
of & obtained indicates that Recall 3 
is not a conventional free-recall task 
and therefore requires a different k 
value for estimating J. This is born 
out by the errors in Table 2. Sig- 
nificantly more errors were made in 
Recall 3 than in Recall 1 or 2, F (1, 
58)— 6.40, p < .025, and F (1, 58) 
= 9.84, p < .01, respectively. 

In order to compare the recognition 
and recall conditions with respect to 
amount of information transmitted, Ss’ 
retention raw scores were converted to 
information measures (J scores) using 
the formulas derived by Judd and 
Sutherland (1959). A computer pro- 
gram (Lachman & Field, 1966) ex- 
ecuted the conversion. For Recall 1 
two sets of J scores were computed for 
each S; one set based on each S’s 
estimated value of k (Recall 1—S-Est) 
and another based on k = 90 (Recall 1 
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TABLE 3 
MEDIAN INFORMATION SCORES 
Session 
Cond. k 
1 2 3 4 
Recog-30 30 | &1| 6.0| 64] 6.0 
Recog-60 60 | 8&0| 84] &8| 6.1 
Recog-90 90 | 7.1] 4.9] 61] 8.7 
Recall 1(S-Est) | —]| 2.3] 3.5] 2.5] 3.5 
Recall 1(90) —>) 10.6 | 14.7 | 11.7 | 11.7 
Recall 3 —| 64| 82| 89] 92 


^k based on Ss’ estimates, 
bk ERA assumed to be 90. 
* k computed from retention protocols (see Table 1). 


—90). The median J scores are pre- 
sented in Table 3, 

A requirement of the technique for 
computing J scores is that k be greater 
than the sum of the number of symbols 
in the training list, g, and the total 
number of responses, g, minus the 
number correct, r; ie, b > g + q-— 
(cf. Lachman & Field, 1966). About 
one-half of the performances for each 
session in Recall 2 did not satisfy the 
condition k g-Fq-—r. Therefore 
I scores for Recall 2 could not be com- 
puted. For the same reason, J scores 
for three Ss in Recall 1 were not com- 
puted. Thus, the J scores in Table 3 
for Recall 1 (S-Est) and Recall 1 
(90) are based on 27 Ss. 

The data in Table 3 show that de- 
pending on the value of k employed 
in calculating I for free recall, the 
amount of information transmitted can 
be made considerably greater or less 
in free recall than in recognition. Ses- 
sion 1 I scores in the recognition con- 
ditions, Recall 3, and Recall 1 (S-Est) 
were compared by a Kruskal-Wallis 
one-way analysis of variance by ranks 
and a significant H value was obtained ; 
H —961, p< 05. However, there 
were no significant differences among 
the recognition conditions and Recall 
3; H — 261, p.05. Thus it ap- 
pears that significantly less informa- 
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tion is transmitted in free recall, Re- 
call 1 (S-Est), than in recognition 
when Ss’ estimates of k are employed 
in calculating 7 for free recall In 
sharp contrast to the lower J in Recall 1 
(S-Est) than in the recognition con- 
ditions is the finding that Recall 1 
(90) produced higher J values than 
recognition, as shown in Table 3. 
That is, when the spuriously high k 
= 90 is employed to calculate J in free 
recall, more information appears to be 
transmitted in free recall than in recog- 
nition. A Kruskal-Wallis test compar- 
ing Recall 1 (90), Recall 3, and the 
recognition conditions for Session 1 
yielded an H of 12.45 which is signifi- 
cant beyond the .02 level. Table 3 
shows that J in Recall 1 (90) is higher 
than J in Recog-30 and Recog-60. This 
finding is consistent with the results 
obtained by Davis, Sutherland, and 
Judd (1961). However, the data con- 
cerning the number of response alter- 
natives produced in Recall 2 strongly 
suggest that the value k = 90 is inor- 
dinately high for computing / in con- 
ventional free recall, the type of recall 
test used by Davis et al. Their con- 
clusion regarding the superiority of 
free recall over recognition, therefore, 
is erroneous. 

It is noteworthy that Ss in Recall 2, 
unlike those in Recalls 1 and 3, did not 
produce sufficiently high values of & 
to calculate 7. Since lower values of 
I in Recall 1 (S-Est) compared to Re- 
Call 1 (90) were due to lower values 
of k, the implication is that the actual 
amount of information transmitted in 
Recall 2 and for that matter in conven- 
tional free recall is extremely low. 
Also, the higher 7 scores in Recall 3 
than in Recall 1 (S-Est) appears to 
be the result of increased values of k. 
However, in the recognition conditions 
marked variations in I scores as a func- 
tion of & were not obtained. Among 
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the present recognition conditions I 
is apparently equal. These findings in- 
dicate that other factors besides the 
number of response alternatives operate 
to determine the amount of information 
transmitted in recognition and recall. 
Thus, in comparing recognition and 
recall in the amount of information 
transmitted as a function of numbers 
of response alternatives it is well to 
consider possible sources of response 
interference which can bias the informa- 
tion scores. For example, in recogni- 
tion an increase in the number of 
response alternatives produces a con- 
comitant increase in the retention in- 
terval and the amount of interpolated 
activity. Such interference may par- 
tially account for the observed raw 
score differences among the recogni- 
tion conditions and may correspond- 
ingly bias the information measures or 
more likely simply reduce the amount 
of information transmitted. Similar 
difficulties exist in the recall conditions 
where Ss were required to reproduce 
response alternatives during testing. 
Increases in the number of alternatives 
given in recall increases both the reten- 
tion interval and recall interference by 
the reproduction of nonrecalled items. 
The amount of information transmitted 
in recall is clearly influenced by test in- 
structions. Variations in instructions 
which influence Ss’ recall behavior also 
produced complex variations in the raw 
scores obtained. Thus the general issue 
regarding the superiority of recognition 
over free recall is considerably more 
complicated than a restricted use of in- 
formation measures might imply. Fur- 
thermore, there are factors other than 
the number of response alternatives 


791 


which require consideration in formu- 
lating general conclusions about recog- 
nition and free recall. In this connec- 
tion, Lachman, Laughery, and Field 
(1966) have shown that the superiority 
of recognition over free recall is con- 
tingent upon the degree of training. 
The issue of whether recognition is 
superior to recall is by itself meaning- 
less. The appropriate question is that 
concerning the conditions which lead to 
superior recognition performances con- 
trasted with recall performances. Under 
the conditions of the present experi- 
ment, if one is inclined to accept the 
value of k obtained in Recall 2 as a 
reasonable estimate for k in conven- 
tional free recall, then it would follow 
that more information is transmitted 
in recognition than in recall. 
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VARIED AND CONSTANT INTERSIGNAL INTERVALS 


IN PSYCHOLOGICAL REFRACTORINESS? 


ROBERT GOTTSDANKER ax» THOMAS C. WAY? 
University of California, Santa Barbara 


Choice responses to 2 successive signals were made by 8 young men 
with both randomly varied and contant intersignal intervals, ranging 
from 50 to 800 msec. “Delay” interpretations of psychological refrac- 
toriness were disconfirmed by quantitative tests, and evidence was 
against grouping explanations of this failure. The extent to which 
refractoriness is related to time uncertainty was not established defini- 
tively. Although RT: was lengthened with short constant intervals, and 
less degree than with varied intervals, the values of RTs for the 
constant condition are suspect because of the marked variation of RT: 
with interval. The “organization-persistence” prediction of higher 
RTs when opposite responses were required to the 2 signals was not 
realized, possibly because of expectation of reversals, 


Psychological refractoriness is ex- 
hibited when the RT to a signal is 
lengthened by an immediately preced- 
ing signal. (Under some conditions 
the effect is obtained only with a re- 
sponse to the first signal.) In the 
typical experiment, the intersignal in- 
terval on a given trial is selected ran- 
domly from a range of values. This 
procedure confounds length of interval 
and momentary probability, which has 
a value of 1 for the longest interval, 
As a consequence, two rival modes of 
interpreting psychological refractori- 
ness have arisen. The first, which 
takes length of interval to be the deter- 
mining factor, is that there is a delay 
in starting or resuming some phase of 
the processing of the second signal 
(eg. Welford, 1952). The second is 
that there is a retardation in the rate 
of processing a second signal whose 
momentary probability is low (eg., 


1 This research was Supported by research 
grants (MH 10317-01 and 10447-01) from 
the National Institute of Mental Health. 
Findings were reported, in part, at the 
1965 meeting of the Western Psychological 
Association, “Variable and fixed time in- 
tervals in psychological refractoriness.” 

? Now at Human Resources Research Of- 
fice, Fort Bliss, Texas, 


Elithorn & Lawrence, 1955). An ex- 
periment which employs constant inter- 
signal intervals as well as randomly 
varied ones would untie the suggested 
determinants. More generally it would 
assess the contribution of objective 
time uncertainty and thus provide in- 
formation which must be encompassed 
in any explanation of refractoriness. 
In addition, it would permit especially 
broad quantitative testing of the delay 
interpretation. The less precise re- 
fardation interpretation is not subject 
to such direct testing. 

Any monotonically decreasing func- 
tion of RT, with interval is equally 
reasonably assignable to delay (per- 
haps not on all trials) or to retarda- 
tion. Positive behavioral support for 
retardation can come only from a 
demonstration that the same factors 
that produce the monotonic relation 
in the typical experiment may be ma- 
nipulated to produce a nonmonotonic 
relation. The difficulty in accom- 
Dlishing this goal is underlined by 
the problems encountered in the in- 
triguing approach of Elithorn and 
Lawrence, who used a choice between 
two responses followed at a varied in- 
terval by the signal for the remaining 
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response. It seems probable that the 
low values of RT, at the shortest 
intervals reflected simultaneous re- 
sponses, and it is likely that other com- 
plications occurred. However, delay 
hypotheses that state rules of operation 
which hold for every trial can be given 
definite proof or disproof. 

In general terms, present delay inter- 
pretations hold that some stage in the 
processing of the second signal (from 
its appearance as change of energy at 
a receptor to just before its actualiza- 
tion as a response) cannot be started 
or is interrupted during a period which 
begins with—or a fairly well fixed 
time after—the first signal. According 
to the busy-channel hypothesis of Wel- 
ford (1952, 1959) this storage period 
ends with the start of the first re- 
sponse. Following Davis’ (1957) 
modifications a constant could be sub- 
tracted or added which should obtain 
for each trial. Broadbent’s (1958) 
“sampling-interval” hypothesis has the 
period ending a fairly well fixed time 
after it began.? 

The busy-channel prediction is that 
RT, = K + RT, — Interval, for all 
second signals whose potential delay 
stage starts during the storage period. 
It would thus apply to average RT, 
for all intervals well below minimum 
RT,. The sampling-interval predic- 
tion is that RT, = C — Interval. Ac- 
cording to the equations, different short 
intervals under the same condition 
should yield identical values of K and 
of C. Since mean RT, is not expected 
to vary between intervals, the differ- 
ence in predictions for short intervals 
is trivial. However, because the busy- 


3 Unless it is stipulated that the interval 
between the first signal and the start of the 
Storage period is shorter than that between 
the second signal and the end of the vul- 
nerable stage, a normal RT is predicted for 
sufficiently brief intervals. This has never 
been found. 


channel hypothesis holds that some as- 
pect of processing the first signal must 
be completed before the processing of 
the second can start or resume, a posi- 
tive correlation is implied between RT, 
and RT,, so that for statistical testing 
of this hypothesis a trial-by-trial appli- 
cation of the first equation is preferable. 

Earlier work, while extensive, has 
failed to answer satisfactorily the ques- 
tions to which this study is addressed. 
Three experiments have compared re- 
fractoriness with varied and with con- 
stant (or nearly constant) intervals 
under the condition of complete cer- 
tainty as to the second event (Adams, 
1962; Kay & Weiss, 1961; Reynolds, 
1966). Findings regarding the occur- 
rence of refractoriness with constant 
intervals and the question of whether 
there was more with varied intervals, 
are far from consistent. Unfortu- 
nately, in each of the studies there was 
either clear evidence or reasonable sus- 
picion of second responses that were 
not made to the second signal, but 
which were somewhat synchronized 
with it or even frankly anticipatory. A 
more subtle attack on the problem was 
made by Nickerson (1965), who em- 
ployed varied-interval ranges having 
different minimum intervals. Both 
briefness of interval and early position 
within the range increased RT to the 
second signal. A complication is that 
the ranges with lower minimum inter- 
vals had higher relative variability. 
Furthermore, the longest intervals, 
which were of quite different absolute 
values, tended to yield very similar 
values of RT.. 

Four studies have employed choice 
responses (Borger, 1963; Butler, 1959; 
Creamer, 1963; Davis, 1962). Both 
Davis and Borger found that with con- 
stant intervals there was no refractori- 
ness if the first signal did not require 
a response. In each of the studies 
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where there was a response to the first 
signal, refractoriness was reported for 
constant intervals. In the comparisons 
between constant and variable intervals 
both Butler and Creamer found more 
refractoriness for the latter, but only 
Butler's statistical test was significant. 
The studies by Butler, Borger, and 
Creamer all have questionable aspects. 
Butler did not have adequate counter- 
balancing of conditions and located the 
signals for the two responses so far 
apart as to introduce problems of eye 
movements. One of Borger’s three Ss 
made simultaneous responses to the 
two signals and another produced re- 
markably constant interresponse inter- 
vals at the short intersignal intervals, 
as evidenced by correlations between 
RT, and RT, as high as .96. Crea- 
mer’s claims that Ss placed in different 
constant-interval groups were well 
equated and that there was no real 
difference in RT, between constant 
and variable intervals, in both cases on 
the basis of lack of statistical signifi- 
cance, are debatable. The differences 
certainly appear striking. Perhaps 
part of the difficulties stemmed from 
his crude measure of RT, accurate only 
to 33 msec. 

The outcome of making separate 
tests at different intervals of specific 
models which predict RT, has been 
regarded by Davis (1956, 1957) as 
demonstration that the busy-charinel 
interpretation holds with varied inter- 
vals. Yet it has been found in nearly 
every such study, including those by 
Davis, that at the briefest intervals the 
obtained values are lowest relative to 
the predicted ones. Here, RT, is com- 
pared for different intervals, thus test- 
ing a general model in a general way. 

The present study is an improved 
version of that by Butler (1959), 
which was performed under the direc- 
tion of one of the writers. In addition 


to the objectives already stated, a final 
one was to test the specific hypothesis 
that there would be greater refractori- 
ness in making a second response in 
the reverse direction to the first than 
in making one in the same direction. 
This derives from a more general idea 
that the underlying cause of psycho- 
logical refractoriness is the necessity 
of relinquishing a previous organiza- 
tion of response in order to initiate a 
new organization. There should be 
more interference for a response which 
requires greater reorganization. 

Cognizance was taken of the fact 
that the human being is an imperfect 
time keeper even with complete objec- 
tive time certainty (Klemmer, 1956, 
1957). By using sufficiently long 
blocks of test trials, which were pre- 
ceded by practice trials, it was hoped 
to reduce this inaccuracy. 


METHOD 
Apparatus 


Display and controls—The S was seated 
before a table on which the top sloped up to 
the rear at a 30° angle. He was isolated 
in a sound-treated and air-conditioned room. 
The S could see, through a rectangular hole 
cut in the table, the display, which was per- 
pendicular to his line of sight and about 120 
cm. from his eyes. The essential part of 
the display consisted of two pairs of neon 
lamps mounted laterally, one pair on the 
left of the center and one pair on the right. 
Except for that part of the lamp facing S, 
each was masked with black paint to prevent 
reflections off the other lamps. The dis- 
tance between centers of each pair was 1.5 
cm., as was the distance between centers of 
pairs. In the middle of this rectangle of 
lamps was a white fixation cross each arm 
of which was 0.5 cm. in length. Lamps were 
placed far from S and close together to 
eliminate any need for eye movements or 
any tendency to make them. Along the 
plane of sight midway between the two 
lamps on the left, and just at the edge of 
the viewing aperture, a short Plexiglas lever 
was mounted to the table. It was moved 
with thumb and index finger in the direction 
of whichever of the two left-hand lamps 
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went on, away from S (upward) or toward 
him (downward). A similar lever was 
placed to the right of the right-hand pair of 
lamps. There were three additional lamps 
in the display. One was a neon bulb which 
was used as a warning light, at the hori- 
zontal center of the display and 2 cm. above 
the centers of the far (or top) lamps. Be- 
low the signal lamps were a red and a green 
lamp which informed S, respectively, that a 
block of trials was about to begin or that 
it had ended. The arrangement is shown 
in Fig. 1. 

Programming devices.—A. perforated-tape 
reader, activated by pulses from a 0.1% 
accuracy multivibrator, in conjunction with 
digital logic units, was used for all aspects 
of programming: selection of signals, dura- 
tion of signal, intersignal interval, warning 
interval, etc. The neon lamps were driven 
by alternating current with a frequency of 
about 2,000 cps thus lighting both elements. 
Photometric tests indicated that full illumi- 
nation was reached within 1 msec. 

Recording devices.—The programming sys- 
tem also functioned in relation to RT meas- 
urement. Presentation of a signal simul- 
taneously started a 1,000/sec train of pulses 
Írom the multivibrator to an electronic coun- 
ter. When S moved the appropriate lever 
0.3 mm. in the correct direction, a light- 
pressure microswitch was closed which 
stopped the train of pulses by closing an 
and gate. One counter was used to indicate 
the RT to the first signal and another for 
the RT to the second signal. There was 
also a set of lights which indicated incorrect 
responses. 


Procedure 


Conditions —There were two experimental 
conditions tested, Variable and Constant, 
with a control for each condition. 

Variable condition.—The S was instructed 
that on each trial there would be two choices, 
one on the left-hand side and one on the 
right-hand side. He was told which of the 
two sides would be first. For half of the 
Ss the left-hand side was always first; for 
the other half the right-hand side was first. 
One second after the warning lamp flashed, 
one of the two lamps on the first side 
flashed. Then, after an interval of either 
50, 100, 200, 400, or 800 msec. following 
onset of the signal on the first side, one of 
the two lamps on the second side flashed. A 
restricted random order was used to deter- 
mine sequence of intervals within each block 
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Fic. 1. Experimental arrangement. 


of 25 trials. Each light flash had a duration 
of 50 msec. The S's task was to move the 
appropriate lever in the correct direction as 
quickly as possible upon receipt of the sig- 
nal There were 10 sec. between successive 
warning signals. 

Constant condition.—The same procedure 
was followed except that all 25 trials of a 
block had the same intersignal interval. 
Each S had the same sequence, left first 
or right first, as he had in the variable 
condition. 

Control conditions.—Blocks of trials were 
run just as in the experimental conditions, 
except that the first signal was omitted, and 
S made only one response following the 
warning. If on a trial the 100-msec. interval 
was used, this simply meant that there was 
1 sec. plus 100 msec. between warning and 
signal to respond. 

Conduct of the experiment.—Each S par- 
ticipated in eight daily sessions of about 1 
hr. in length, over a period of 2 wk. The 
first two sessions provided equal practice 
for the various conditions. Sessions 3 and 
8 consisted of trials under the control condi- 
tions; four Ss were given the variable con- 
trol condition and the other four the con- 
stant control condition. Sessions 4 through 
7 were each devoted to one of the two ex- 
perimental conditions: variable or constant 
intervals. For each S the sequence of vari- 
able and constant conditions was counter- 
balanced, and four Ss started with each 
condition. Also counterbalanced for the 
constant condition was the sequence of inter- 
vals; both increasing and decreasing orders 
were used. The daily test session consisted 
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of five blocks of 25 trials, the first five trials 
of a block being considered practice. Thus, 
for each combination of condition and in- 
terval, S had 40 trials which entered the 
computations. 

Subjects.—Eight undergraduate men were 
tested. All were between 18 and 22 yr. of 
age and all were right handed. The session 
for a given S was always held at the same 
time of day. 
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Overall indications, Ss combined.— 
In Fig. 2 are compared the mean RTs 
for the varied and constant conditions, 
along with the control RTs, i.e., where 
the first signal was omitted. Anoma- 
lous trials which occurred were re- 
moved from the data and not replaced. 
In the following analyses, the result- 
ant inequities in N were handled 
by weighting all Ss and conditions 
equally. However, the anomalies 
themselves hold considerable interest 
and will be considered later. It is seen 
that there was a marked continuous 
drop in RT, under the varied condi- 
tion. ‘For the constant condition there 
was no difference between the 50- and 
100-msec. intervals but a moderate 
continuous drop thereafter. For all 
intervals except that at 800 msec, 
there was a higher value for the varied 
condition. 
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Comparison with the control condi-" 
tion indicates that this effect was not 
merely due to the time interval be- 
tween the warning and the second sig- 
nal. For the varied condition, control 
means were lower at all intervals ex- 
cept 800 msec. For the constant con- 
dition, the control means were lower 
for the two shortest intervals but 
gradually rose so that there was a con- 
siderably higher value at the 800-msec. 
interval. 

Not much variation is seen among 
the different intervals for RT, under 
the varied condition, with the possible 
exception of a slight rise with interval. 
In contrast, a marked variation with 
interval occurs under the constant con- 
dition, with highest values at 100 and 
200 msec. and a low value at the 800- 
msec. interval. It may further be 
noted that RT, is lower than RT, for 
the short intervals under the varied 
condition, and higher for the long in- 
tervals. Under the constant condition 
RT, is lower than RT, at all intervals 
except 800 msec. 

Table 1 compares the RT, when re- 
sponses were in the same direction and 
when they were in opposite directions. 
Contrary to predictions based on per- 
sistence of organization, it appears 
that there might even be somewhat 
higher values for the same direction. 
Whether responses were made in the 
direction away from S or toward him 
appears to have made little difference 
but with possibly slightly higher values 
for the direction away. Other aspects 
of the table will be considered in rela- 
tion to problems of interpretation of 
RT, difference with interval. 

Statistical tests.—For each of the 
eight Ss, there were four separate 
analyses of variance, one for each com- 
bination of condition (varied or con- 
stant) and response (first or second). 
The summary of number of tests sig- 
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& TABLE 1 
OVERALL MEAN VALUES ‘ 
Varied Intervals Constant Intervals * . 
Measure I LLÉÁL————————————— 
100 | 200 | 400 | 800 
RT. 
Mean Same 265-| 240 | 245 | 240 
Mean Opposite 268 | 255 | 241 | 239 
Mean Toward 260 | 243. | 242 | 237 
Mean Away 272 | 252 | 243 | 243 
Mean of Means 266 | 247 | 243 | 240 
Mean SD 39 44 36 34 
Control 
Mean of Means 244 | 249 | 247 | 258 
Mean SD 44 50 37 53 
RT: 
Mean Same 291 | 281 | 271 | 243 
Mean Opposite 284 | 293 | 261 | 238 
Mean Toward 285 | 285 | 265 | 238 
Mean Away 290 | 289 | 267 | 242 
Mean of Means 287 | 287 | 266 | 240 
Mean SD 40 48 44 36 
Correlation (r) between 
RT; & RT; 51 38 23 AS 


Note.—In milliseconds except for coefficients of correlation. N = 8 for all conditions except control condi- 


tions, where N = 4. 


nificant at the .01 level is shown in 
Table 2. It is seen that for the varied 
condition, RT, did show significant 
variation according to interval for 
seven of the eight Ss, with six of these 


Ss showing a peak mean at 50 msec. 
For the constant condition, the situ- 
ation is not so clear. Half of the Ss 
showed significant variation with in- 
terval, and the location of the peak is 


TABLE 2 


No. or SIGNIFICANT MAIN EFFECTS For Erour Ss (p < .01), SUMMARY OF 32 ANALYSES 


Or VARIANCE, EACH FOR ONE S, FOR RT; on RT», FOR VARIED OR 
Constant CONDITION 


RT: RT: 
Effect df Treatment 
Varied Constant Varied Constant 
Interval 4 50 6 2 1 1 
100 1 1 4 
200 1 2 
400 
‘ 800 
Relation 1 Same 3 
eh Opposite 1 
Direction 1 Toward 
Away 4 
Error 132-166 à 


Note.— Frequencies are shown for the treatment giving the higher or highest mean for each effect. Interac- 


tions are not shown since there were only three significan! 


t instances of a possible 128. 
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"TABLE 3 


CoMPARISON OF VARIED AND 
Constant CONDITIONS 


Effect af Treatment | RT: | RT: 
Condition 1 Varied T 
Constant 4 
1 Short 5 
XVaried| 
Condition Short 5 
Interval XCon- 
stant 
Error 271-325 


Note.—Number of Significant differences for the 
eight Ss (p «.01). Frequency is shown for the 
treatment giving the higher mean for each effect. 
Duplications of the information in Table 2 are not 
shown. 


less stable. As for the comparison of 
same-direction and opposite-direction 
responses, the results are inconsistent. 
For half of the Ss relation. between 
first and second responses made no 
significant effect; where it did, it 
tended to indicate higher RT, for the 
same relationship. Toward and away 
provided no significant differences. 
The variation of RT, with interval 
under the constant condition is seen 
to be reliable, with seven of the eight 
Ss showing significant variation, A 
strong peak appears at the 100-msec, 
interval. Only one S showed signifi- 
cance for interval under the varied con- 
dition. "Under both conditions about 
half of the Ss showed a significantly 
higher RT, for the away direction. 


The higher RT, for the varied con- 
dition is seen to be reliable in Table 
3, which summarizes 16 further analy- 
ses of variance, those in which condi- 
tion was made a variable. Seven of 
the eight Ss show a significantly higher 
mean for the varied condition. Fur- 
ther, the missing S is among those who 
showed a significant interaction of con- 
dition with interval, all having a rela- 
tively larger mean for short intervals. 
Somewhat the reverse situation is seen 
for RT,, where five of the Ss had 
higher mean values at the short inter- 
vals under the constant condition. 

For Ss with the varied control con- 
dition, it is clear from Table 4 that 
RT, was higher than control RT, espe- 
cially at the short intervals, all of the 
four Ss showing the significant inter- 
action. The outcome is less decisive 
for Ss with the constant control condi- 
tion, where a significant interaction 
was found for two of the four Ss. A 
somewhat different analysis of the con- 
trol conditions was made by combining 
all Ss under each of the conditions. 
These analyses are presented in Table 
5. Most important, it is demonstrated 
that there was no reliable interval ef- 
fect for the varied condition while 
there was for the constant condition, 
with its rise in mean RT for the longer 
intervals. In any event, for both con- 
ditions the effect of interval on control 


TABLE 4 
Comparison or RT; AND CoxrRoL RT (Errzcr or Previous SIGNAL RESPONSE) 


Cond. 
Effect df Higher M = 
Varied Constant 
Previous Signal Response 1 RT: 3 1 
5 3 C 
Previous Signal Response X Interval 4 RT. Short 4 2 
Error Histuso Control, Short 


Note.—l 


Ni ignificant 3 
Ge LAE of si t differences for the four Ss with the varied control and the four with the constant 


— th a — —— 
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TABLE 5 
ANALYSIS OF VARIANCE OF CoxTROL RT 


Effect 


& 


Interval (1) 


Direction (D) 
IXS 


CAD 
SXD 
IXSXD 
Within Group 


- 
[Ur 


30.37*** 
1.34 
1.20 


4,80** 
132 


Note.— Error term for effects involving S was within-group MS; for other effects, error term was interaction 


with S MS. 
^p 


RT was far different than its effect on 
RT, That Ss varied significantly is 
neither surprising nor informative, 
similarly the significant interaction be- 
tween S and direction. 

Only the two shortest intervals, 50 
and 100 msec. were used for testing 
the delay predictions, RT, = K + RT, 
—Iand RT,=C — I. If the first hy- 
pothesis is correct, the value of RT, + 
I-— RT, should be the same for the 
two intervals; if the second hypothesis 
is correct, the value of RT, + J should 
be the same. It can be seen in Table 
6, that a higher value was found for 
the 100-msec. interval for both condi- 
tions for each hypothesis. Treating all 
Ss in one analysis for each of these 
four differences yielded F values sig- 
nificant at the .01 level. 

Possible contaminating factors — 
Anomalous trials of several kinds were 
examined: those on which response 
was in the direction opposite that of 
the signal, those on which the response 
occurred before the signal, and those 
on which the response to the first sig- 
nal did not occur before the response 
to the second. The findings are shown 
in Table 7 in terms of the percentage 
of all responses under the indicated 


conditions. In the interest of simplic- 
ity, trials are not included in the analy- 
sis where both responses were in the 
wrong direction. First, they were 
rare; second, an interaction is intro- 
duced which makes interpretation most 
difficult. Also the few trials on which 
there was both the wrong direction and 
wrong order of response were elimi- 
nated. Omitted responses were too 
infrequent to merit study, and are not 
shown. For premature responses, no 
distinction is made as to whether they 
were in the correct or wrong direction. 

Looking first at RT,, it is seen that 
wrong-direction responses tend to peak 


TABLE 6 
Tests or DELAY HYPOTHESES 


Obtained Value 


Hypothesis 


RT.+1-RTi 
RT:+1 


Varied 
Constant 
Varied 

Constant 


Note.—For each row the hypoth: 
same mean for the 50- and se cree, Buenos: ^ 
row differences were statistically significant (p < .01). 
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TABLE 7 
PERCENTAGE OF TRIALS ON WHICH THE INDICATED ERRORS WERE MADE 


— 


Interval 


Type of Error and Cond. 50 100 


n 
o 
a 
o 


Wrong Direction 


RT», Varied 1 
RT», Constant 7. 
RT;, Varied KH 
RT, Constant 3 


Premature 
RT», Varied 0 
RT», Constant 0. 
RT;, Varied 0 
RT;, Constant 0 


i 
oooo 


oooo 


Wrong Order R, R; 


Varied 
Constant 


eid 
Mi 


co 


Note.—Same or opposite direction separated for wong-direction errors, 


at the 100- and 200-msec. intervals. 
Moreover, there appears to be the 
smallest number of these responses 
under the varied condition at the 50- 
msec. interval. For the constant inter- 
val condition, it appears that fewest of 
these responses occur for the longest 
intervals. It is appropriate to raise 
the question, assuming these differ- 
ences to be reliable, of their effect on 
the conclusions which are thus far sug- 
gested from the RT, means. It was 
noted that RT, tended to be slightly 
shorter for wrong-direction than for 
right-direction responses. Tt might be 
argued that these were “impulsive” 
responses which were detected and that 
there were the same number which 
happened to be correct. Following this 
line of reasoning, adjustments which 
would result from eliminating these 
undetected "twins" were made to the 
mean RT, However, the overall ef- 
fect was slight, and did not lead to any 


different outcome in the tests of the 
delay hypotheses. 

The larger number of wrong-direc- 
tion responses when the two signals 
were in the same direction is evident. 
It is possible that S tended to guess 
that the two signals would be in oppo- 
site directions. Whatever the reason, 
it is evident that this factor may have 
disturbed the prediction of a higher 
RT, when an opposite response was 
required, 

Premature responses occurred (with 
one exception) only under the varied 
condition and only at the 400- and 
800-msec. intervals. This appears 
fairly reasonable although the absolute 
discipline of Ss in this respect under 
the other conditions is remarkable. A 
similar finding has been made by Nick- 
erson (1965), who pointed out that the 
mean RT, at long intervals is very 
likely artifactually low due to inclusion 
of responses that were really premature 
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in that they were inaugurated before 
the second signal but occurred after it. 

Now, considering the first response, 
it is seen that there were fewer wrong- 
direction responses, and apparently 
little difference between the same- 
direction and opposite-direction pairs 
or between the varied-interval and 
constant-interval conditions. There 
were somewhat more of these re- 
sponses for the 50-msec. interval than 
for the other intervals. Premature re- 
sponses simply did not occur to the 
first signal. : 

Under the constant condition there 
were quite a few instances of wrong- 
order responses at the 50-msec. inter- 
val and some at the 100-msec. interval. 
Under the varied condition the rare 
instances that occurred at the 50-msec. 
interval would seem attributable to 
the variability of the RT, and RT, 
distributions. The question is whether 
these variabilities were unusual will 
now be considered. 

Mean values of standard deviations 
for both RT, and RT, may be found 
in Table 1. It is noteworthy that the 
values for RT, do not increase at the 
short intervals, which is an expecta- 
tion (as will be discussed later) of 
grouping. The other point of interest 
is the remarkable stability of values 
for RT, under the varied condition. 

Mean values of correlation coeffi- 
cients between RT, and RT, are also 
shown in Table 2. They were ob- 
tained by averaging the Z transforms 
(Fisher, 1941). Various explanations 
are possible for the generally higher 
values at the short intervals. No at- 
tempt was made to explain. However, 
when considered in conjunction with 
variation of RT, with interval, they 
do contribute to the interpretation of 
RT, differences among intervals. 

The mean correlation coefficients 
and mean standard deviations were 
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50 usec. 
100 MSEC, 


— 200 usec. 
400 usec. 


200 
180-100 220  :40 160 180 300 3:0 340 340 
MT fuste) 


Fic. 3. Variable condition: Mean regres- 
sion, RTs on RT; out to + 1.34 SD in RT.. 
(Intercept of vertical bars show location of 
mean values of RT; and RT») 


used to obtain mean linear regressions 
of RT, on RT, for each interval under 
the two conditions. The two sets are 
shown in Fig. 3 and Fig. 4. The short 
vertical bars across each line indicate 
the mean value in both RT, and RT.. 
The lines are drawn to a distatnce of 
1.34 SD from the mean in RT,. This 
includes 8096 of the cases in a normal 
distribution. 

It is seen that the lines for the varied 
condition are regular; there is gener- 
ally less slope (indicative of lower cor- 
relation coefficients) as interval in- 


E 
OL thats rip Tio n 300 325 40 560 
A iuste 


Fic. 4. Constant condition: Mean regres- 
sion, RTs on RT; out to + 1.34 SD in RT.. 
(Intercept of vertical bars show location of 
mean values of RT: and RT:.) 
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creases. The similarity of RT, means 
and the clear differences among RT, 
means seen also in Fig. 2 are note- 
worthy. For any particular RT,, the 
same order holds in most probable 
value of RT,. 

For the constant condition, the situ- 
ation is quite different. The slopes are 
generally steeper at the shorter inter- 
vals. Moreover, the mean RT; value 
for 800 msec, is as low as the lowest 
value shown for the 50- and 100-msec. 
intervals. At these short intervals, 
where S was able to make his RT, 
short, apparently he was at no dis- 
advantage in making a short RT,, 
compared with values for the other in- 
tervals. It is curious that S seems to 
have adopted a long RT, as a tech- 
nique at the 100- and 200-msec. inter- 
vals, when it is quite clear that this 
does not lead to a short RT, The 
combined effect of interval on RT, 
means and RT,-RT, correlation makes 
it evident that the differences in RT, 
values among intervals cannot be taken 
at face value for the constant condition. 


Discussion AND CONCLUSIONS 


Since there were fewer "side effects" 
under the varied condition, it will be con- 
sidered first. The existence of refractori- 
ness was shown beyond doubt ; its dura- 
tion remains uncertain. A general drop 
in RT, was found as interval increased 
Írom 50 to 800 msec. However, the oc- 
currence of frank anticipatory responses 
at the 800-msec, interval and even at the 
400-msec. interval throws doubt upon the 
genuiness of the mean RT, obtained at 
these intervals. There doubtless were 
additional responses which were initiated 
before the signal at these intervals but 
which were not brought to culmination 
before the signal. It is not safe to con- 
clude that Psychological refractoriness 
persisted beyond 200 msec. 

Tests of the delay hypotheses were 
negative. Values of RT, at the 50-msec. 
interval were too close to those found at 


100 msec. Any effect of expectancy 
would be to increase the difference, so it 
can not provide an explanation. Welford 
(1952, 1959) has emphasized the possi- 
bility of a grouped processing of the two 
signals as an explanation for some of 
the low RTs at brief intervals. How- 
ever, the mere existence of these low 
values does not constitute evidence of 
grouping. The problem is to demon- 
strate grouping by means of independent 
indexes. Davis (1959) reasoned that 
in a grouped response there would be a 
high value of RT, coupled with a low 
value of RT, giving rise to a negative 
correlation between RT, and RT,. How- 
ever, there are reasons for expecting a 
positive correlation between RT, and 
RT, at short intervals. First of all, there 
should be little fluctuation of alertness 
between the instants of the first and sec- 
ond signals. Second, as was stated in 
the introduction, the busy-channel hy- 
pothesis itself predicts a positive corre- 
lation. The only effect of grouped trials 
would be to reduce the value of the posi- 
tive correlation, 

More defensible indexes of grouping 
are high values of mean RT, and SD of 
RT,, both of which could result from the 
occasional grouped response, and the oc- 
currence of trials on which the response 
to the first signal did not take place be- 
fore the response to the second. These 
effects were not found, so the grouping 
explanation of the failure of the tests to 
Support the delay interpretations was not 
upheld. Admittedly, grouping may be a 
more subtle phenomenon than was here 
conceived and might be detected by other 
indexes, 

There are other delay possibilities to 
consider. Should delay take place on 
only part of the trials, the difference be- 
tween RT, at the 50- and 100-msec. in- 

*It may appear from the values in Table 6 
that while the differences were significant, 
they were of such low magnitude as actually 
to lend support to the hypotheses. However, 
the obtained differences in RTs between the 
50- and 100-msec. intervals was 22 msec., 
which is less than half that predicted by 
either hypothesis, one prediction being 53 
msec. and the other 50 msec. 


‘ 
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terval, would diminish. However, this 
outcome in turn leads to an expectation 
of greater variability of RT, for the short 
intervals, where delay should occur, than 
for the longer ones. As is seen in Table 
1, there was exceptional stability among 
intervals for RT, under the varied condi- 
tion. In this connection, the busy-channel 
hypothesis itself very definitely implies 
greater variability for RT, at the shorter 
intervals since the normal variability 
should be augmented by the variability 
of the end of the delay period, which 
would be comparable to that of RT,. It 
is difficult to conceive of any revised 
delay hypothesis which could cope with 
these counterindications. 

Turning now to the constant condi- 
tion, it may be asked whether psychologi- 
cal refractoriness was demonstrated. Os- 
tensibly it was, as RT, in general de- 
creased with interval, although there was 
no difference between the 50- and 100- 
msec. intervals. However, the influence 
of interval on RT,, coupled with differ- 
ences in correlation between RT, and 
RT, for the different intervals, makes it 
questionable to take RT, differences at 
face value. That RT, should be espe- 
cially low at 800 msec. is not too sur- 
prising. It was found by Gottsdanker, 
Broadbent, and Van Sant (1963) that 
RT to a single choice was shorter than 
to a choice which was consistently fol- 
lowed by a second choice after ł sec. 
Apparently the 800-msec. interval pro- 
vided somewhat the same condition in 
respect to the first response as that of no 
Second choice. However, this is only 
part of the picture, as seen in the dip in 
RT, at the 50-msec. interval, now com- 
bined with a fair number of responses 
made in the wrong order. 

Even though the negative tests under 
the constant condition of the delay inter- 
pretations cannot be given much weight 
for the foregoing reasons, this particular 
disproof is not vital because of the dis- 
proof under the varied condition which 
has already been discussed. What cannot 
be substantiated is that there was demon- 
stration of psychological refractoriness 
under the constant condition. Conse- 


803 


quently, there is no firm basis for accept- 
ing a causal factor independent of time 
uncertainty. The same considerations 
would hold for Borger's (1963) and Crea- 
mer's (1963) evidence for psychological 
refractoriness under the constant condi- 
tion, since they too obtained great RT, 
differences with interval. 

Thus, the existence of a factor related 
to time uncertainty was not established. 
To be sure, for all intervals except that 
of 800 msec. there was a higher value of 
RT, for the varied condition. However, 
the problem of interpreting RTT, under 
the constant condition remains. More- 
over, the varied condition gave lower 
values of RT,. 

It should be noted that all of the in- 
terval effects depended upon the occur- 
rence of the first signal (and perhaps on 
the response to it, as both Borger and 
Davis have shown). Interval effects 
were minimal and of a different nature 
when the first signal was omitted under 
each of the conditions. 

The prediction, stemming from the 
assumption of persistence of organiza- 
tion that there would be a longer RT, 
when the second signal demanded the 
opposite response from the first, was not 
realized. Perhaps the explanation may 
be found in the greater occurrence of 
incorrect responses when the second sig- 
nal called for the same response. This 
points to an expectation of reversal, for 
reasons unknown. 
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USE OF A DELAYED SIGNAL TO STOP A VISUAL 
REACTION-TIME RESPONSE' 


JOSEPH S. LAPPIN? Au» CHARLES W. ERIKSEN 


University of Illinois 


In a visual RT experiment, 5 male Ss were each confronted with 2 
lights and instructed to respond to 1 light but not respond when 
both lights occurred. The onset of the 2 lights was made asynchronous 
by 0, 12, 33, or 63 msec. Probability of inhibiting the response 
declined markedly for each delay and increased with RT. It was con- 
cluded that choice RT is correlated with the extent of the stimulus 
information processing and the duration of effective stimulation which 
has preceded initiation of the motor event. 


This experiment was motivated by 
an earlier attempt (Lappin & Eriksen, 
1964) to replicate a finding that a sim- 
ple visual reaction time can be inhib- 
ited by a second stimulus (Helson & 
Steger, 1962). No effect from the 
second stimulus was obtained in the 
replication. However, extension of 
the original experiment suggested an- 
other approach to the phenomenon 
found by Helson and Steger. If S is 
instructed to respond when one of two 
lights occurs but not to respond if both 
lights occur, how asynchronous can the 
onset of the two lights become before 
S’s ability to inhibit is impaired? 

Tn light of the demonstrations (Raab 
& Fehrer, 1962; Raab, Fehrer, & 
Hershenson, 1961) that the simple RT 
response is sensitive to only the first 
millisecond or so of the visual stimu- 
lus, perhaps a retroactive inhibitory 
effect from a second stimulus should 
not have been expected in the previous 
experiment. However, some question 
Still remains as to S's ability to make 
use of a delayed signal when the ex- 
perimental task requires the response 
to be modified when the signal appears. 


1This research was supported in part by 
Public Health Service Research Grant MH- 
1206 and a Public Health Service Research 
Career Program Award K6-MH-22014. 

2 Now at Carnegie Institute of Technology. 


Thus, when S must decide on each 
trial whether or not to respond, the 
time span of the effective visual stimu- 
lus may be greatly increased; the “psy- 
chological moment” (cf. Stroud, 1956) 
may be longer for choice RT than for 
simple RT. Indeed this is one way of 
viewing the results of Fehrer and Bie- 
derman (1962) and Fehrer and Raab 
(1962) who found no difference in the 
reaction times to visual stimuli phe- 
nomenally masked by metacontrast 
when compared with those which were 
not masked. A somewhat similar sug- 
gestion also appears in some recent 
data of Kristofferson’s (1965). The 
decrement which he expected to result 
from modal uncertainty in a bisensory 
RT task was not obtained with simple 
RT, but only when some discrimina- 
tion was required of the signals in each 
modality. 

The present experiment is relevant 
also to observations of what is called 
the “psychological refractory period.” 
In general it is found that the response 
to the second of two successive stimuli 
is less rapid than to the first (cf. Ad- 
ams, 1961; Reynolds, 1964). How- 
ever, numerous experiments have indi- 
cated variously that the source of this 
effect lies in (a) the operation of a 
central decision process (e.g., Davis, 
1959; Nickerson, 1964; Welford, 
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1952), (b) a multichanneled stimulus 
but single-channeled stimulus detector 
(Adams & Creamer, 1962; Creamer, 
1963 ; Kristofferson, 1965), (c) an in- 
adequate “set” to respond to the sec- 
ond stimulus (Adams, 1962; Adams 
& Chambers, 1962; Elithorn & Law- 
rence, 1955), or (d) response compe- 
tition (Reynolds, 1964). 

The apparent discrepancies and di- 
versities in these studies may result 
from the fact that there are a number 
of conceptually distinct stages in the 
stimulus-response sequence where se- 
lection of one sort or another may 
occur. “Channels” might impose re- 
strictions on load at perceptual, deci- 
sional, or motor levels, and so also 
“expectancy” might rear its head at 
several points. It remains to be shown 
that the decision necessitated in the 
choice RT experiment does or does 
not modify the temporal organization 
of visual signals from that operating 
in simple RT. 

The present experiment can be de- 
scribed as follows: Two stimuli are 
presented in what is presumably the 
same sensory channel. One of these, 
the “go signal,” appears on every trial, 
and the other, the “stop signal,” has 
event uncertainty. Associated with 
the two stimuli are perfectly competi- 
tive responses—response vs. no re- 
sponse—which might also be assumed 
to occupy the same “channel.” From 
some points of view the existence of 
event uncertainty and response compe- 
tition are likely to place the delayed 
stop signal at a disadvantage in affect- 
ing the response. On the other hand, 
the fact that both members of the 
stimulus and response pairs are han- 
dled by the same channel might imply 
that only the operation time of a deci- 
sional stimulus-response mediator ex- 
ists to modify their asynchrony from 
input to output. ! 
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In short, this experiment attempts to 
examine the relation between psycho- 
logical and real time in effecting a dis- 
junctive response to a visual stimulus. 


METHOD 


Apparatus—The apparatus was basically 
similar to that of a previous experiment 
(Lappin & Eriksen, 1964). The stimulus 
lamps, however, were slightly brighter 
NESIH bulbs, operated with dc power in 
order to eliminate variation in onset 
asynchrony. Both remained on until S 
responded. The two lamps, mounted 1 in. 
apart, subtended about 1.5* of visual angle. 
Separate Hunter interval timers controlled 
the three foreperiods and an automatic tape 
reader and relay device was used to select 
the foreperiod and stimulus events according 
to a preset random sequence. In addition, a 
second Hunter klockounter was placed in 
front of S to provide feedback on every 
trial. 

Subjects.—Two graduate and three under- 
graduate male students in psychology at the 


University of Illinois served as paid 
volunteers. 
Procedure—Each experimental session 


consisted of 300 trials and was approximately 
lhr.long. The foreperiod was 1.5, 2, or 2.5 
sec. and there were 5 sec. between the onset 
of the stimulus for one trial and the warn- 
ing signal (a buzzer) of the next trial. Ten 
practice trials preceded the experiment 
proper. A break was taken after each 75 
trials. Two conditions were run within a 
session, each consisting of 150 trials. The 
stop signal (a designated one of the two 
lamps, unchanged for a given S) was pre- 
sented on a randomly selected 50 of these 
trials, and on the remaining 100 trials S 
responded to only the go signal (the other 
lamp). For a given condition, the stop 
signal was simultaneous with the go signal 
on 25 trials and delayed by a fixed amount 
on the other 25 trials. Four experimental 
conditions are distinguished by the delay of 
the stop signal: 0, 12, 33, and 63 msec. 

S was run three times in each condition 
and six sessions were run using each possible 
combination of two conditions. Prior to the 
six experimental sessions each S' served in 
two practice sessions, using all four experi- 
mental conditions. 

The S was instructed to respond as fast as 
possible to the go signal and still be able to 
inhibit the response on about 75% of the 
stop-signal trials. Thus, were it possible, 
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TABLE 1 


Mean RT To Go SiGNaL (RTa), PROBA- 
BILITY OF INHIBITING TO SIMULTANEOUS 
CP(1s)] AND DzLAvED [P(Ip)] Stop 
SIGNALS, AND SD or RTa FOR 
EacH S AND Eacu Conn. 


Cond. RTa P(Is) P(I») SD 
S1 

0 226 -68 77 38 

12 229 .19 65 31 

33 243 .89 53 29 

63 265 96 39 37 
S2 

0 233 .69 60 47 

12 232 75 55 39 

33 274 99 48 48 

63 301 96 41 57 
S3 

0 229 72 75 36 

12 240 85 65 43 

266 OL 43 44 

63 293 99 43 48 
S4 

0 236 77 83 53 

12 229 87 60 60 

33 257 .89 32 65 

63 252 .95 31 50 
S5 

0 272 oT. 64 45 

12 277 43 57 42 

33 316 93 59 67 

63 342 95 56 71 

Average 

0 240 73 72 44 

12 241 .80 60 44 

33 271 92 47 52 

63 291 .96 42 54 


he was to inhibit to 10096 of the simultaneous 
stop signals and 50% of the delayed signals. 


RESULTS 


In Table 1 appear the mean RT to 
the go signal (RT), probability of 
inhibiting the response to the simul- 
taneous stop signal [P(Is)] and to the 
delayed stop signal [P(Ip)], and SD 
(square root of average variance) of 
the RTgs for each S and condition. 
The pooled P(Is) and P(Ip) for each 
condition are plotted in Fig. l as a 
function of mean RTg. (For the 0 
delay condition P(Is) and P(Ip) de- 
rive from identical conditions but dif- 
ferent random sequences.) It is read- 
ily apparent from these data that de- 
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Fic. 1. Average probability of inhibiting 
to simultaneous and delayed stop signals as 
a function of RT for each condition. 


laying the stop signal by only a few 
milliseconds is sufficient to produce a 
marked change in S’s ability to inhibit 
his response. That is, while the Ss 
maintained a P(I) of about 75%, the 
inhibitions were less likely to occur to 
the delayed stop signal as its asyn- 
chrony increased. 

From the upper curve of Fig. 1, 
which was obtained from those trials 
where the stop signal was simultaneous 
with the stimulus to respond, it may be 
seen that the P(Is) increases rapidly 
as the RTg lengthens. In order to 
obtain further information on this as- 
pect of the data, product-moment cor- 
relations were computed for the 12 
mean RTg and P (Is) obtained for each 
S. The five r’s for these data from 
S1 to S5, respectively, are as follows: 
94 (p< .001), 81 (p «.005), 76 
(p«.005), .33 (p «.15), and .69 
(p< .01). It seems clear that for a 
given S, P(I) is quite dependent upon 
RT. 

Another aspect of the data which is 
related to the correlation of P(I) with 
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Fic. 2, Relative frequency distributions for RTe and RTs 


zero delay condition, 


RT is the distribution of the RTs ob- 
tained on those trials when the stop 
signal was presented but the S failed 
to inhibit the response (RTs). Rela- 
tive frequency distributions of RTg and 
RTs were made from the data of the 
0- and 12-msec. delay conditions, Fre- 
quencies were tallied for each 10-msec. 
interval above and below the mean 
RTs, rounded to the nearest .01 sec., 
of each session. The resulting RTg 
distributions based on 1,500 trials and 
RTs distributions based on 325 trials 
from each of the two stop signals, are 
shown in Fig. 2 and 3. (It should be 
realized that these distributions are af- 
fected to some degree by between- 
session variability and that the number 
of RTs obtained from each session is 
not constant.) It may be seen that up 
to a point the two distributions are 
quite similar. Of particular interest is 
the overlap Occurring at the lower tail 


from the pooled data of the 


standardized about the mean RT of each session. 


of the distributions, where the response 
appears to have been made too rapidly 
to permit the simultaneous stop signal 
to have any effect. As RT increases 
it becomes less likely that a response 
will occur in the presence of a stop 
signal. 

An important question raised in 
the present experiment concerns the 
amount of time by which RT must be 
increased in order that S may inhibit 
his response when the stop signal is 
delayed. In Fig. 1, the increases in 
mean RTg which result from stop- 
signal delays are also accompanied by 
a decrease in ability to inhibit [a de- 
crease in P(Ip)]. It is necessary to 
correct the RT measure for the change 
in P(Ip). Perhaps the most meaning- 
ful statistic for this purpose is the per- 
centile point in the RT, distribution 
corresponding to 1 — P(I). This sta- 
tistic might be taken to represent a 
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Fic. 3. Relative frequency distributions for RTo and RTs from the pooled data of the 
12-msec. delay condition, standardized about the mean RTs of each session. 


critical RT such that for faster RTs 
inhibition is unlikely and for longer 
RTs the response may usually be in- 
hibited. Thus, the relationship be- 
tween the increase in this RT statistic 
and in the stop-signal delay provides 
some notion as to the changes in RT 
necessary for S to make use of the 
delayed information. This data is pro- 
vided in Table 2. It may be seen that 


TABLE 2 


PERCENTILE Pornt IN RTe DISTRIBUTION 
CORRESPONDING TO 1 — P(Ip) FOR 
EacH S AND COND. 


Cond. | Si S2 S3 S4 S5 | Average 
0 | 200] 222 | 204 | 192 | 253 | 214 
12 218 | 230 | 225 | 216 | 265 | 231 
33 | 236 | 271 | 267 | 271 | 293 | 267 
63 |269| 306 | 294 | 264 | 322 | 291 


when the stop signal was delayed by 
12, 33, and 63 msec, the RT statistic 
showed an average increase of 17; 53; 
and 77 msec., a rather close corre- 
spondence. It seems clear that when 
the stop signal is delayed, a very 
slightly greater amount of time than 
this delay must be added to the RTc 
in order for S to inhibit his response 


with equal success. 
DISCUSSION 


The above results portray three phe- 
nomena: (a) The effective visual stimu- 
lus for the disjunctive RT response may 
include only the first 12 msec. or less 
of the physical stimulus. (b) The abil- 
ity to inhibit the response improves as 
a function of RT. In fact, a response 
may be made to the mere onset of stimu- 
lation which is faster than the choice 
response capable of utilizing the informa- 
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tion in a simultaneously presented stop 
signal. (c) If the signal to stop is de- 
layed by a few milliseconds from the 
signal to respond, then § must slow his 
RT by a slightly greater amount of time 
if he is to maintain the same ability to 
stop the response. 

These results offer strong support for 
the conclusion from the earlier experi- 
ment (Lappin & Eriksen, 1964) that de- 
layed stimulation should not be expected 
to interfere with a simple RT response. 
This conclusion is also reminiscent of 
previous work with the simple RT (Feh- 
rer & Biederman, 1962; Fehrer & Raab, 
1962; Raab & Fehrer, 1962 ; Raab, Feh- 
rer, & Hershenson, 1961). 

It seems clear that the longer RTs 
Which are commonly observed with the 
disjunction procedure have not resulted 
merely from a marked lengthening of 
the "critical duration" of the stimulus for 
simple RT (Raab & Feher, 1962 ; Raab, 
Fehrer, & Hershenson, 1961). It is 
striking that the physical energy for in- 
stigation of this choice response is inte- 
grated over such a brief portion of time, 

The results of this study point to two 
specific components of the variance in 
RT for a voluntary response to a visual 
stimulation. First, since RT is shown 
to correlate with the ability to modify 
the response in accord with stimulus in- 
formation, RT reflects the degree of 
stimulus Processing which underlies the 
motor output. In Hyman’s (1953) 
words, RT varies with information 
transmitted. Secondly, from the fact that 
RT must be lengthened in order for the 
Tesponse to be inhibited when the stop 
signal is delayed, it is evident that when 
RT increases so also does the duration 
which has preceded 
Suggestion of a 
in Hershenson's 
imal intersensory 
T occurs with an 
equal to the mean 


r Thus, the Motor portion 
of both simple and disjunctive RT seems 


relatively constant 
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duration, which is unmodifiable once initi- 
ated. Taylor and Birmingham (1948) 
several years ago concluded that control 
of the responses made in tracking be- 
havior is complete once the motor event 
is underway. 

An interesting question arising from 
the above response model concerns the 
voluntary control of RT. Certainly, the 
fact that Ss were able to adjust RT to 
yield a P(I) of approximately 75% for 
each session implies that some aspect of 
the stimulus processing time may be 
regulated. One may wonder how such 
voluntary control interacts with stimulus 
intensity and temporal uncertainty effects 
on RT. The question is; What makes 
RT so variable? 

With respect to the psychological re- 
fractory period phenomenon, this study 
indicates that such an effect probably 
does not reside with the operation of 
signal reception or processing portions of 
the stimulus-response sequence. That is, 
utilization of a second signal required 
only that RT be increased by approxi- 
mately the same time as the delay of 
the second signal. The fact that this 
response delay is actually slightly greater 
than the delay of the second stimulus 
might be expected since the delayed stop 
Signal is accompanied by a slightly longer 
go signal which may thus be effectively 
more intense. This interpretation of the 
Psychological refractory period conflicts 
with Nickerson's (1964) report which 
indicates a refractory period from de- 
laying part of the stimulus information 
for the response. The reason for this 
discrepancy is not clear, Although the 
Present study agrees with Reynolds’ 
(1964) conclusion that the refractory 
period is not sensory, his response com- 
petition hypothesis would seem to have 
Predicted a greater increase in RT for 
the delayed signal conditions than was 
observed. Support from this study leans 
toward a response expectancy interpre- 
tation of some of the earlier findings. 
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RETROACTIVE INHIBITION AND THE SENSITIVITY 
OF DICHOTOMOUS INDICANTS? 


HARRY P. BAHRICK Ax» NANCY REYNOLDS 
Ohio Wesleyan University 


Absolute and relative RI are com 


pared for an easy and a difficult rec- 


ognition task and a recall task, after 2 degrees of OL. Contrary to 
previous conclusions, it is shown that RI can increase or decrease with 
increased OL, and that recognition tasks may show larger or smaller 


amounts of RI than recall tasks. 
function of task difficulty and the 


determine sensitivity of measurem 


group’s distribution of associative 


RI is accounted for as a combined 
degree of training. These variables 
ent by their effect upon the control 
strength in relation to the threshold 


of recall or recognition at the end of the retention interval. Measur- 
ing RI in ebbs is shown to be useful in minimizing these artifacts, but 


may introduce other artifacts, 


Indicants are dichotomous if they 
yield only two categories for the classi- 
fication of phenomena, Ordinarily 
measures of recall and recognition have 
been used as dichotomous indicants, 
le. responses are classified as correct 
or incorrect. Most of our information 
regarding the nature of retention is 
based upon comparisons of the number 
or percentage of correct answers on 
recall or recognition tests, as a func- 
tion of various independent variables. 
In two previous articles (Bahrick, 
1964, 1965) it was shown that such 
comparisons are greatly influenced by 
variation of the sensitivity of the 
dichotomous indicant, but are often 
erroneously interpreted as intrinsic ef- 
fects of the independent variable. This 
has resulted in the questioning of some 
traditional generalizations, two of 
which are to be reexamined here: (a) 
recognition memory is more resistant 
to retroactive inhibition (RI) than re- 
call memory, and (b) the amount of 
RI decreases as a function of increased 
degree of original learning (OL). 
Both of the above generalizations have 

1 This research was su ort i 
Grant HD00926-05 from fete ed 


tute of Health, United States Public Health 
Service. 


extensive support. For many years 
it was believed that recognition mem- 
ory is not at all subject to RI effects 
(Heine, 1914). Later it was shown 
that recognition performance can be 
affected by RI (Dahl, 1928; Lund, 
1926), but the reports of all investiga- 
tors agree that RT effects are substan- 
tially greater for recall than for recog- 
nition performance (Hollingworth, 
1913; McMullin, 1942; Postman, 
1952; Postman & Rau, 1957). 

The inverse relation between RI and 
the amount of OL was confirmed by 
Briggs (1957), Postman and Riley 
(1959), and others, These data have 
recently been summarized in the im- 
portant article by Slamecka and Ceraso 
(1960) who concluded: “all reports 
agree that the susceptibility of the 
original material to RI was inversely 
related to its level of acquisition.” 

The most general method of deter- 
mining RI is to compare retention per- 
formance for a control group and an 
experimental group, where the experi- 
mental group practices some specific 
task during the retention interval, but 
the control group does not. If the ex- 
perimental group suffers a greater 
decrement in retention, the excess is 
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attributed to the RI effect of the inter- 
polated task. It is customary to use 
dichotomous indicants (recall or recog- 
nition) because RI effects are often 
transitory, and if the comparison is 
based upon saving scores (the only 
commonly used nondichotomous indi- 
cant of retention) the measurement is 
so insensitive that it is usually not pos- 
sible to demonstrate any RI (Slamecka 
& Ceraso, 1960). 

Although the dichotomous indicants 
are more sensitive, their sensitivity is 
greatly affected by characteristics of 
the distribution of associative strength 
and by the relation of the mean of this 
distribution to the threshold of the in- 
dicant being used. Distributions of 
associative strength are created when 
several Ss learn a list of several asso- 
ciations. At the end of training the 
associations will usually vary in 
strength and those associations corre- 
sponding to the portion of the distribu- 
tion exceeding a threshold level will be 
correctly recalled. Tf a smaller percent 
is recalled at a later time, a further 
portion of the distribution of associa- 
tive strength has dropped below the 
threshold level during the interim. 
This change not only reflects the 
amount of weakening of the various 
associations, but is also a function of 
the variance and shape of the distribu- 
tion, as well as the relation of the 
threshold to the mean of the distribu- 
tion at the beginning of the retention 
interval (Bahrick, 1965). 

In previous investigations comparing 
RI effects for recall and recognition, 
highly discriminable alternatives were 
used on the recognition tasks. As a 
result, the recognition tasks were much 
easier than the recall tasks and a 
constant number of training trials dur- 
ing OL caused a great deal of over- 
learning with respect to the threshold 
of recognition but not with respect to 
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the threshold of recall ( Bahrick, 1964). 
The interpolated learning (IL) would 
bring a large portion of the distribution 
of associative strength below the recall 
threshold (yielding large RI effects 
upon recall performance), but would 
not weaken the associations sufficiently 
to bring a large portion of the distribu- 
tion below the lower recognition 
threshold. This leads to the generali- 
zation regarding greater RI effects 
upon recall By using recognition 
tasks of a difficulty level both above 
and below that of the recall task it can 
be determined whether the difference in 
RI effects upon recall and recognition 
memory is intrinsic to the type of indi- 
cant, or whether the effect results from 
the unilateral confounding of the type 
of indicant used with the degree of OL 
in relation to the threshold of the indi- 
cant. The present study is designed to 
answer this question. 

Another previously neglected vari- 
able concerns the relation between indi- 
cants used for the interpolated and 
the original task. Comparisons of RI 
effects upon recognition and recall tasks 
are more meaningful if the effects of 
interpolating both recognition and recall 
tasks can be compared. Previous com- 
parisons have made use only of inter- 
polated recall tasks, and have thus 
failed to control this important variable. 

In regard to the general relation be- 
tween RI and OL it would follow from 
the earlier rationale that dichotomous 
indicants are most sensitive to RI ef- 
fects when a large portion of the asso- 
ciations learned by the control group 
are barely above threshold strength at 
the end of the retention interval. 
Slight interference effects caused by the 
IL should then cause large amounts of 
RI since they will be sufficient to bring 
many of the associations below thresh- 
old, causing large absolute and relative 
differences between the performance of 
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TABLE 1 

SUMMARY OF EXPERIMENTAL DESIGN 

G Trials | Type of m of 
Designation | OL. | RE Retest 
8-RL-RL 8 Recall | Recall 
8-Rec-RL 8 Recog. | Recall 
8-C-RL 8 None | Recall 
8-RL-ER 8 Recall | Easy Recog. 
8-Rec-ER 8 Recog. | Easy Recog. 
8-C-ER 8 None | Easy Recog. 
8-RL-DR 8 Recall | Diff. Recog. 
8-Rec-DR 8 |Recog. | Diff. Recog. 
8-C-DR 8 None | Diff. Recog. 
16-RL-RL 16 Recall | Recall 
16-Rec-RL, 16 Recog. | Recall 
16-C-RL 16 None | Recall 
16-RL-ER 16 Recall | Easy Recog. 
16-Rec-ER 16 Recog. | Easy Recog. 
16-C-ER 16 None p Recog. 
16-RL-DR 16 | Recall | Diff. Recog. 
16-Rec-DR| 16 Recog. | Diff. Recog. 
16-C-DR 16 None | Diff. Recog. 


fhe control and experimental groups. 
If Ss of the control group have greatly 
overlearned the associations, so that 
most of the associations are still far 
above threshold at the end of the reten- 
tion interval, or if the associations were 
originally so weak that most of them 
are already below threshold at the time 
of recall, the possibility of revealing RT 
effects for an experimental group is 
clearly diminished. This leads to the 
prediction that some intermediate level 
of OL should strengthen the associa- 
tions to the level at which dichotomous 
measures are most sensitive, i.e., will 
show maximum percent changes in RI 
for a certain average amount of loss 


of associative strength, sustained as the 
result of the IL, 


METHOD 


The design was arranged to test the effects 
of two types of interfering tasks (a recog- 


DR; and a recall indi RL) aft 
degrees of OL. x Bises 
groups were trained under th 12 (2 
X2) conditions. In su os 
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degrees of OL) and tested after the same 
intervals. The control groups experienced 
no IL. Three of the control groups were 
available from a previous study (Bahrick & 
Bahrick, 1964). Each control and experi- 
mental group consisted of 10 male and female 
Ss (180 Ss altogether). The Ss were 
student volunteers from Ohio Wesleyan Uni- 
versity and were assigned to the various 
groups by systematic alternation. The Ss 
of the three available control groups were 
chosen 6 mo. earlier by the same method 
from the same population. Table 1 sum- 
marizes the treatment of the 18 groups. 

Original learning.—The training material 
consisted of seven paired associates. The 
stimulus member of each pair was an 8 X 8 
matrix with one of the 64 cells filled in, the 
remainder left blank. The response member 
of each pair was a randomly chosen 3-digit 
number. The material and procedure were 
identical to that described by Bahrick and 
Bahrick. 

The material was projected on a screen 
and one to four Ss serving simultaneously 
had to write the response numbers into the 
exposed row of an answer sheet during the 
4-sec. interval when only the stimulus mem- 
ber was exposed on the screen. At the ex- 
piration of that interval a shutter closed over 
the appropriate position of the answer sheet 
while the response number was shown on the 
screen for 4 sec. together with the stimulus. 
Each trial took 56 sec. (8 sec, for each of 
seven S-R pairs); and a 130-sec. intertrial 
interval was used. The sequence of the 
Paired associates was randomized in suc- 
cessive trials. 

As shown in Table 1, one-half of all the 
experimental and control Ss received 8 trials 
of OL, the other half 16 trials, regardless of 
their performance. 

Interpolated learning —The interpolated 
training material consisted of seven paired 
associates of the same type used in original 
learning. The stimulus position in the 
matrices was one cell removed in a random 
direction from the comparable stimulus used 
in OL, and the response numbers shared 1 
digit with the response numbers used dur- 
ing OL, the other 2 digits being chosen 
randomly. The assignment of the new re- 
sponses to the new stimuli was randomly 
varied from the pairings used in the OL 
material. 

The same training material was used in 
connection with both types of interpolated 
tasks. The Ss experiencing recall-type in- 
terference received eight trials of anticipation 
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training with the new material under condi- 
tions identical to those described for OL. 
The same procedure and material were used 
for Ss receiving recognition training, but 
these Ss also were given recognition tests 
during each intertrial interval. The recog- 
nition tests consisted of a chart with 7 rows 
and 10 columns. Each row of the chart 
corresponded to one of the paired associates. 
Within each row the correct stimulus was 
randomly positioned among four incorrect 
ones. (The incorrect stimuli used positions 
one cell removed in a random direction from 
the correct one.) The correct response num- 
ber was randomly positioned among four 
wrong numbers which shared 2 of the 3 
digits with the correct number. The third 
digit was randomly chosen. A sample row 
from the recognition test is shown in Fig. 1. 
The recognition test was prepared with three 
different sequences of rows representing the 
seven paired associates; these three versions 
also randomly varied the position of the 
correct stimuli and responses within each 
row. During successive intertrial tests the 
three versions of the recognition test were 
alternated. The recognition tests were ad- 
ministered one row at a time, by an exposure 
device which allowed S 8 sec. to mark the 
recognition responses for both stimulus and 
response of each paired associate, A shut- 
ter closed over each row of the test blank 
at the end of 8 sec., exposing the next row 
for an equal duration. Thus, the administra- 
tion of the test took 56 sec. 

Retesting—All retesting occurred 2 hr. 
after original training terminated. The Ss 
were instructed to report for further training, 
and no attempt was made to control their 
activity during the unused time. When Ss 
returned they were given easy recognition, 
difficult recognition, or recall tests over the 
original material according to their group 
assignment. The recall test simply consisted 
of an additional anticipation trial adminis- 
tered under conditions identical to those used 
during OL. 

The recognition tests were administered by 
a procedure identical to that described under 
IL above. Two types of recognition tests 
were used. On the easy test (ER) the cor- 
rect matrix was placed among four in- 
correct ones in which the stimulus position 
was three cells removed in a random direc- 
tion from the correct one. The four incorrect 
response numbers shared none of their digits 
with the correct one. On the difficult test 
(DR) the incorrect stimulus positions were 
only one cell removed from the correct one, 


815 


305-34-204-305-304 


E 


Fıc. 1. One row from the difficult 
recognition test. 


and the incorrect response numbers shared 
two of their three digits with the correct one. 


RESULTS AND DISCUSSION 


In order to establish whether per- 
formance during OL was comparable 
for all groups, anticipation perform- 
ance on the eighth trial of OL was 
compared by means of an analysis of 
variance. The variation among the 18 
group means was found not significant 
(p > 25). 

Anticipation performance on the 
eighth trial of IL was also compared 
for the 12 experimental groups by an 
analysis of variance. The variation 
among the group means was found not 
significant (p > .10). This confirmed 
an earlier finding (Bahrick & Bahrick, 
1964) that interpolation of recognition 
tests during the intertrial intervals had 
no significant net effect upon anticipa- 
tion performance in this task. 

All the scores for recognition per- 


TABLE 2 


ANALYSIS OF VARIANCE FOR 
RETENTION PERFORMANCE 


Source af MS F 
Degree of Trainin: 

OL) f 1| 20.20| 6.43* 
Type of Measure (M)| 2 72.40 | 23.05** 
Type of Interference 

Task (IL) 2| 95.14 | 30.20** 

Experimental vs. 
ntrol Groups 1 | 189.37 | 60.31** 
Recognition vs. 

Recall Task 1 .92 29 
OLXM 2 1.72 E 
OL X IL 2 1.71 54 
MXIL 4 6.17 | 1.97 
OL XM XIL 4 3.90 | 1.24 

Within 162 3.14 
* 05. 
2$ Sor 
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TABLE 3 
ABSOLUTE RI, RrLATIVE RI, AND Epp Losses OF ALL EXPERIMENTAL GROUPS 
Absolute | Relative RI 
Dene naire ay urea d RI x "RI In Ebbs 
Experimental 
Groups 
M E M E 
8-RL-RL 2.40 1.68 8-C-RL 4.40 1.80 2.00 45% .15 
8-Rec-RL 3.00 1.61 1.40 32i o 5 
8-RL-ER 2.92 2.04 8-C-ER 6.00 1.10 3.08 B o 1.28 
8-Rec-ER 2:33 2.56 3.67 61 do 1.56 
8-RL-DR 1.66 1.07 8-C-DR 2.00 1.10 Tn HA Ha 
-Rec-DR 1.19 1.10 5 Zo E 
IRI RI 2.40 143 16-C-RL 5.60 1.85 3.20 5795 1.25 
16-Rec-RL 2.70 249 2.90 52% 1,12 
16-RL-ER 3.76 2.31 16-C-ER 6.20 .87 2.44 39% 1.13 
16-Rec-ER 3.96 2.00 2.24 36% 1.56 
16-RL-DR 2.31 1.38 16-C-DR 3.78 1.51 1.47 39% 54 
16-Rec-DR 1.22 98 2.56 68% 1.05 


formance were corrected for chance 
success by subtracting .04 of the num- 
ber of wrong responses from the 
number of correct responses. This 
correction reflects the joint probability 
of a correct guess on both stimulus and 
response members of a pair. 

The retention data were subjected to 
an analysis of variance, the results of 
which are shown in Table 2. The 
analysis shows significant effects as a 
function of the degree of OL [07 
< .025), the type of retention measure 
($ < .005), and the type of IL (p 
X.005).  Orthogonal comparisons 
show that the variance produced by the 
IL is almost entirely due to the differ- 
ences between the combined experi- 
mental groups contrasted with the con- 
trol groups ($ < .005), and not to the 
difference between the two types of IL 
used with the experimental groups. 
This analysis established that highly 
Significant absolute RI effects oc- 
curred ; however, the absence of signifi- 
cant interaction effects limits the con- 
clusions which can be drawn in regard 
to relative amounts of RI under the 
various treatment conditions. 

Inspection of the absolute and rela- 
tive amounts of RI shown in Table 3 


however does permit several important 
conclusions relevant to the present 
inquiry. (The absolute amount of RI 
is obtained by subtracting the mean 
performance of each experimental 
group from the corresponding control 
group mean. Relative RI is computed 
by dividing absolute RI by the corre- 
sponding control group mean and mul- 
tiplying the quotients by 100.) The 
old generalization regarding greater 
RI effects upon recall than upon recog- 
nition no longer holds, With 8 trials 
9f OL the ER groups sustain the 
largest relative and absolute RI losses, 
the DR groups sustain the smallest 
losses, with the Recall groups occupy- 
ing an intermediate position. With 16 
trials of OL the relative and absolute 
RI losses have increased for the DR 
groups, but decreased for the ER 
groups, and RI is now largest for the 
groups tested with the Recall measure. 

It would thus appear that the abso- 
lute as well as the relative amount of 
RI is a combined function of task diffi- 
culty and the degree of training. Abso- 
lute RI is small when these two vari- 
ables combine to produce a distribution 
of associative strength for a control 
group either too far above or below the 
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threshold of the retention task used. 
The low RI scores for the DR groups 
illustrate the latter case. The task 
difficulty was so great that the control 
group receiving 8 trials of OL scored 
only a mean of two correct responses 
on the retention test. Thus five- 
sevenths or 71% of the distribution 
was below threshold, and as a result 
the DR measure was no longer sensi- 
tive to the additional interference ef- 
fects sustained by the experimental 
groups. With 16 trials of OL the DR 
control group performed substantially 
better ; about 54% of the distribution 
was above the DR threshold and thus 
the sensitivity of the measure to addi- 
tional interference effects was in- 
creased. The opposite phenomenon 
was observed for the ER task. Here 
the RI effect was greatest with 8 trials 
of OL, because the control group per- 
formance was 86% correct and quite 
sensitive to the effects of additional in- 
terference. With 16 trials of OL many 
of the associations apparently were 
greatly overlearned, so that they re- 
mained above the ER threshold, even 
after the IL; and thus the additional 
practice diminished rather than in- 
creased the absolute amount of RI for 
this comparatively easy task. 
Previous findings (Briggs, 1957; 
Postman & Riley, 1959; etc.) have 
shown that increased OL may lead to 
increased absolute RI, but that rela- 
tive RI is always diminished. Sla- 
mecka and Ceraso (1960) have ac- 
counted for this by pointing out that: 


When the degree of OL is low, the control 
group’s retention is low and even slight 
departures from this base-line on the part of 
the experimental group will represent a sub- 
stantial percentage difference; whereas when 
the control’s recall is high, the same absolute 
difference will reflect a lesser percentage 
change, and relative RI will have decreased, 
while absolute RI will have remained the 
same [p. 450]. 
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The present results suggest that abso- 
lute RI scores are subject to analogous 
changes of sensitivity, and that in- 
creased sensitivity of the absolute RI 
measure with increased OL can bring 
about increases in relative RI. This is 
illustrated by the increase in relative 
RI for the DR task when OL is in- 
creased from 8 to 16 trials. 

Heine (1914) recognized that the 
level of performance of the control 
group is an important determinant of 
sensitivity of the RI measurement, and 
she controlled this variable for various 
levels of OL by compensatory varia- 
tions of the retention interval. As 
Postman and Riley (1959) have 
pointed out, however, this introduces 
another source of variation, and per- 
haps for this reason this important con- 
trol has been neglected in more recent 
investigations of RI. The identical 
variable plays an even greater role 
when RI effects are compared for re- 
call and recognition performance, but 
Heine and all subsequent investigators 
have failed to consider this. 

The nature of the interfering task 
produced no significant individual or 
interaction effects, and this is not sur- 
prising in the light of the comparable 
performance of all experimental groups 
on the eighth trial of IL. 

Ebb analysis —The preceding analy- 
sis suggests that traditional measure- 
ment of RI confounds the desired esti- 
mates of losses of associative strength 
due to IL with artifacts due to scoring 
sensitivity. These artifacts arise when 
a distribution of associative strength 
passes below a dichotomous threshold, 
because the distribution is not of uni- 
form density. Asa result a small aver- 
age loss of associative strength of the 
distribution may yield large amounts 
of absolute and relative RI if the di- 
chotomous threshold is passing through 
a dense area of the distribution, and 
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vice versa. One means of coping with 
this general problem was suggested in 
an earlier paper (Bahrick, 1965). It 
involves plotting estimates of the 
change of the mean of the distribution 
of associative strength in units of vari- 
ance of the distribution, which are 
called ebbs. 

To apply the procedure to the pres- 
ent situation, the estimate of the posi- 
tion of the mean is obtained first for 
the control group by converting the 
percent correct score into a z score, 
using a table of integral values of the 
normal curve. Thus the control group 
for recall performance with eight trials 
OL scored a mean of 4.40 out of 7.00 
possible correct responses, i.e., 6395 
Correct. The corresponding z value is 
+.33, and this is the best estimate for 
the position of the mean of the distri- 
bution of this control group in relation 
to the recall threshold. The 8-RL-RL 
experimental group scored a mean of 
2.0 correct responses out of 7.00, i.e. 
28.5% correct. This corresponds to a 
# score of —.57, the best estimate of 
the mean of this distribution in rela- 
tion to the recall threshold, Thus, the 
best estimate of the amount by which 
the mean of the distribution has 
dropped as the result of the IL is from 
+.33 to —.57, for a total distance of 
90 ebbs. 

The last column of Table 2 shows 
the loss in ebbs computed on this 
basis for each of the experimental 
groups. If this method were completely 
successful, the differences in ebb losses 
among groups would reflect different 
average losses in associative strength, 
due to the experimental conditions in- 
dependent of the above mentioned 
scoring artifacts, Comparison of the 
ebb losses sustained by various groups 
showed the following: (a) the losses 
are larger for the ER and recall meas- 
ures than for the DR measure; (b) 
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the losses are larger after 16 trials of 
OL than after 8 trials; (c) the losses 
on recognition performance are greater 
with recognition-type interference than 
with recall-type interference, and vice 
versa; and (d) the increased degree 
of OL now has a more uniform effect 
for all types of measures. 

No analysis of the significance of 
these effects is possible because esti- 
mates of ebb losses for the distributions 
of individual Ss are unobtainable in 
many instances. This is true for all Ss 
who scored either 100% or 0% on 
their retention tasks. 

The greater uniformity of the OL 
effect upon various measures is in ac- 
cordance with expectation and suggests 
that the ebb transformation is useful 
in obtaining a measure of RI less sub- 
ject to artifacts of scoring sensitivity. 
However, the remaining differences as 
a function of the type of indicant used 
and of the degree of OL suggest the 
persistence of scoring artifacts. Thus 
the ebb transformation does not solve 
all problems. 

Ideally, the RI loss specified in ebbs 
should give an estimate of the average 
amount of associative strength lost by 
a distribution of scores as a result of 
the IL. However, an unbiased esti- 
mate can be obtained only if all the 
distributions of associative strength 
involved in the comparison have ap- 
proximately equal variance and are 
approximately normal. If either of 
these assumptions is not met, the z- 
Score transformation introduces new 
artifacts. The present results could 
be reasonably well accounted for if the 
variance of the distributions diminishes 
with additional training from 8 to 16 
trials. No data are available for a 
comparison of variance and shape of 
distributions of associative strength be- 
fore and after IL, or after different 
degrees of OL. Examination of such 
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distributions at different stages of 
training and after various retention in- 
tervals must become an important goal 
in the quest for better understanding 
of the phenomena of learning and 
retention, 
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The Ist experiment varied the anticipation interval and the pair- 
presentation interval during the learning of the A-C list in a B-C, 
A-B, A-C mediation paradigm. The Russell and Storms (1955) 
material showed significant amounts of mediated transfer but there 


Was no suggestion of an interaction between mediation and either of 
the intervals. The 2nd experiment varied stimulus meaningfulness 
and anticipation interval on the A-D list after learning an A-C, A-B, 
A-X, or E-X list constructed from the Russell and Storms material. 
Stimulus meaningfulness, anticipation interval, and lists were effective 
variables and there was an interaction between lists and stimulus 
meaningfulness but no interaction between lists and anticipation inter- 
val The results were discussed in terms of an associative probability 


theory. 


At the time this series of studies was 
initiated, it was commonly agreed that 
verbal-mediating responses require time 
to occur (Barnes & Underwood, 1959; 
Cofer, 1961, p. 194; Goss, 1961) but 
there was no relevant evidence about 
the effect of presentation time on 
transfer in paired-associate learning. 
It seemed plausible that both the an- 
ticipation interval (AI) and the inter- 
val permitted for study of the stimulus- 
response pair (SR I) could affect the 
use of mediating responses. Bugelski 
(1962) and Runquist (1965) found 
the number of reported mediators di- 
rectly related to the SR I but Williams 
(1962) failed to confirm this relation- 
Ship. If the SR I is too brief to per- 
mit the discovery and rehearsal of me- 
diating responses, the learning during 
the SR I and the recall during the AT 
cannot be facilitated by mediating re- 


1This research was Supported by National 
Science Foundation Grants G-14726 and GB- 
1666. Experiment I was part of an honors 
thesis submitted by the junior author. Eileen 
M. Bodie, Katherine J. Kosinski, Ayn Fox, 
Robert Kaplitz, and Marie Gavazzi assisted 
in the collection of the data for Exp. II. 


sponses. On the other hand, if medi- 
ating responses are available, they can- 
not facilitate recall unless the AI is 
long enough to permit the chain of re- 
Sponses to occur. It is also possible 
that a short AI, which does not permit 
the use of mediating responses to aid 
recall, will affect the learning strategy 
of Ss during the SR I and thus have 
Some effect on performance. 

The first experiment in this study 
varied the AT and the SR I in a B-C, 
A-B, A-C paradigm. The second ex- 
periment used different procedures and 
varied stimulus meaningfulness in an 
attempt to increase the amount of 
transfer. The AI was varied in the 
Second experiment while SR I was 
held constant. 


EXPERIMENT I 
Method 


Lists—Two of the B-C-X sequences from 
the Russell and Storms (1955) material 
were eliminated (MEMORY-MIND-AFRAID, and 
TROUBLE-BAD-MUSIC) and the remaining eight 
B-C-X sequences used to construct three 
lists. The A-C or test list was constructed 
by using the C items from Russell and Storms 
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as responses and the numbers 1 through 8 as 
stimuli. Two forms of the first list were 
constructed in order to control for the diffi- 
culty of the A-C pairs. One form consisted 
of four A-B and four A-X pairs and the 
other of the remaining four A-B and four 
A-X pairs. Thus in learning two lists all Ss 
learned four A-X and four A-B pairs in the 
first list and the eight A-C pairs in the 
second list. The C items were the primary 
associations to the corresponding B items 
and the X items were not among the 10 
most frequent responses to the B or C items. 

Design.—There were two forms of the 
first list and they were learned to a cri- 
terion of either one perfect trial or three 
successive perfect trials. The second list was 
presented at either a 1:1-, 1:3-, 3:1-, or a 
3:3-sec. rate. Thus with two first-list forms, 
two degrees of first-list learning, two Als, 
and two SR Is there were 16 independent 
groups of Ss. Mediation (A-B, A-C or 
A-X, A-C pairs) varied within the lists so 
that each S was his own control for the 
amount of transfer due to mediation. The 
eight conditions within the low degree of 
first-list learning were listed in counter- 
balanced order and Ss assigned in order of 
their appearance at the laboratory. The 
same eight conditions were later replicated 
with the higher degree of first-list learning 
and Ss were assigned in the same fashion. 

The Ss were students from introductory 
college courses and eight Ss were assigned to 
each of the 16 independent conditions. 
Twenty-one Ss were dropped because of 
equipment or E malfunctions but none were 
dropped for failure to learn. 

Procedure—All lists were presented by a 
16-mm. stripfilm projector and each list was 
presented in six different random orders to 
prevent serial learning. All learning was 
by the anticipation method and S was not 
required to pronounce the stimuli as they 
were presented. The paired-associate in- 
structions were read to S and the first list 
presented at a 2:2-sec. rate with a 4-sec. 
intertrial interval. After attaining the cri- 
terion of first-list learning, all Ss learned 
the same second list for five trials. The 
presentation rate of the second list was 
either 1:1, 1:3, 3:1, or 3:3 sec., according 
to the condition, and the intertrial interval 
was 4 sec. 


Results 


First-list learning—An analysis of 
variance on the number of trials re- 
quired to attain the criterion of one 
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Fic. 1. Exp. I: Mean number of correct 
responses on the A-C list as a function of the 
anticipation interval and the pair-presenta- 
tion interval (n= 32). 


perfect trial showed there were no sig- 
nificant sources of variance. The Ss 
who learned the first list to a criterion 
of one perfect trial required a mean of 
9.12 trials to attain the criterion while 
Ss who learned the first list to a crite- 
rion of three successive perfect trials 
required a mean of 8.89 trials to attain 
a criterion of one perfect trial. There 
was no indication that the 16 groups 
of Ss differed in learning ability. 
Second-list learning—An analysis 
of variance was computed on the total 
number of correct responses during 
the five trials of second-list learning. 
The only significant Fs were Degree 
of First-list Learning (5.18), AI 
(213.21), SR I (31.39), Mediation 
(17.54), and Mediation x Form (4.01). 
The degrees of freedom were 1 and 112 
in each case. Although prior learning 
of A-B and A-X pairs produced a dif- 
ference in A-C learning there was no 
indication that mediation interacted 
with degree of first-list learning, AI, 
or SRI. The Fs were less than 1 for 
each of these three interactions. The 
data were examined trial by trial and 
in terms of the first correct response 
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on the test list but there was no indi- 
cation of an interaction between medi- 
ation and AI or SR I. On the five 
learning trials the 128 Ss gave 8.63 
mean correct responses for the medi- 
ated pairs and 7.55 mean correct re- 
sponses for the nonmediated pairs, 

Degree of first-list learning, form, 
and type of pairs were combined and 
the mean correct responses on the five 
second-list learning trials are presented 
in Fig. 1 as a function of AI and SR 16 
Performance was a direct function of 
both AI and SR I with no suggestion 
of an interaction. These results agree 
with a study by Nodine (1963) which 
varied AI and SR I of single-list 
learning, 


Experiment II 


Although there was no indication of 
an interaction between mediation and 
Al or SR I in Exp. I the small amount 
of mediated transfer and the rapid 
second-list learning made another at- 
tempt seem worthwhile. Several 
changes were made in an attempt to 
produce more mediated transfer. Ho- 
mogeneous lists were used so that 
more pairs could be used within each 
level of mediation and additional con- 
ditions were added so the direction of 
transfer could be assessed. The mean- 
ingfulness of the stimuli were varied 
since, in addition to any direct effect 
on learning, 


able than SR I so A 


Exp. II while SR T was held constant 
at 2 sec. 


Method 


Lists.—The Stimuli were two groups of 10 
CVC trigrams from Archer (1960). The 
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mean association value of the low-meaning- 
fulness stimuli was 21.4 while that of the 
high-meaningfulness stimuli was 96.5, The 
formal similarity of the two groups of 
stimuli was the Same ; there were 18 different 
letters with 8 appearing once, 8 appearing 
twice, and 2 appearing three times within 
the list. The responses for the lists were 
the 10 B-C-D-x Sequences from Russell and 
Storms (1955). The stimuli and responses 
were paired so that a stimulus item was not 
obviously related to any of the responses 
paired with it, the first letter of a stimulus 
was not the first letter of any of its re- 
Sponses, and the last letter of a stimulus was 
not the last letter of any of its responses. 

Four lists were constructed within each 
level of meaningfulness, The trigrams were 
paired with the D responses and then with 
the corresponding C, B, and X responses. 
The lists were A-D, A-C, A-B, A-X, and 
E-X with the E-X list consisting of the 
A-X list from the other level of meaningful- 
ness. Thus when Ss learned the A-D list, 
they had Previously learned a list with the 
Same stimuli and responses related to the 
D responses by a direct Primary association 
(A-C), by a two-step chain (A-B), or with 
responses unrelated (A-X). Both the 
stimuli and responses in the E-X list were 
not related to the A-D list, 

Procedure.—All Ss learned two lists by 
the anticipation method with a 4.sec. inter- 
trial interval for the first list and an 8-sec. 
intertrial interval for the second list. 
Standard paired-associate instructions were 
used for both lists with no indication that the 


arranged in five 
random orders to prevent serial 


A-B, A-X, 
criterion of two successive perfect trials. 

hree minutes after learning the first list 
the instructions for second-list learning were 
read and the A-D list learned to a criterion 
of one perfect trial with a minimum of five 
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TABLE 1 


MEAN NuMBER OF TRIALS REQUIRED TO LEARN THE FIRST 
List AND THE SECOND LIST TO CRITERION 


First List Second List (A-D) 
rr Anticipation Interval Anticipation Interval 
1 Sec. 2 Sec. 4 Sec. 
H/A-C 13.67 11.17 12.25 
H/A-B 15.00 13.83 10.58 
H/A-X 10.83 10.50 15.08 
H/E-X 12.50 15.50 11.92 
Combined 
H Lists | 13.00 12.75 1246 
L/A-C 14.42 16.92 14.58 
L/A-B 16.67 10.75 13.58 
L/A-X 15.50 21.58 13.92 
L/E-X 9,83 12:50 10.83 
Combined 
LLists | 14.10 15.44 13.23 


presented and S had no information concern- 
ing the correctness of the responses. 
Design and subjects.—Two levels of stimu- 
lus meaningfulness (H or L), four types of 
first lists (A-C, A-B, A-X, or E-X), and 
three second-list AIs (1, 2, or 4 sec.) re- 
sulted in 24 conditions and independent 
groups of 12 Ss were assigned to each of 
the 24 conditions. The Ss were run individ- 
ually but in two groups. The 18 conditions 
consisting of the two levels of meaningful- 
ness, three second-list AIs, and the A-B, 
A-C, and A-X lists were listed in six differ- 
ent random orders and Ss were assigned to 
the conditions in sequence as they appeared 
at the laboratory. After the 108 Ss were 
run, another schedule was constructed con- 
sisting of the same 18 conditions and the 
additional 6 conditions required by the ad- 
dition of an E-X list. These conditions with 
the 6 E-X conditions appearing twice as 
often as the others were listed in six dif- 
ferent random sequences. The Ss were 
assigned to the sequence as they appeared at 
the laboratory and the result was 12 Ss in 
each of 24 conditions. After the data was 
collected, the 18 conditions which were 
replicated were subjected to an analysis of 
variance on the trials to criterion on the 
first list and on the trials to criterion on 
the second list. Neither replication nor 
any of the interactions with replication were 
significant sources of variance so the results 


will be treated as if all conditions were run 
at the same time. 

The Ss were college students from intro- 
ductory courses and none of them had pre- 
viously served in a verbal-learning experi- 
ment. In addition to the 288 Ss who com- 
pleted the experiment, 32 Ss were dropped 
due to malfunctions of Es and equipment and 
1 S failed to learn to criterion. 


Results 


The analyses of variance presented 
will refer to a 2 (stimulus meaning- 
fulness) X 4 (lists) X 3 (AI) analy- 
sis with independent groups and 264 
df for the error term. The Fs will be 
presented in parentheses after the 
source of the variance. The .05 level 
of significance will be used but all Fs 
with p < .10 will be reported. 

First-list learning.—The mean num- 
ber of trials required for each group to 
learn the first list to the criterion of 
two successive perfect trials are pre- 
sented in Table 1. The Fs were sig- 
nificant for Meaningfulness (4.55), 
Meaningfulnes X Lists (4.66), and 
Meaningfulness X Lists X AI (2.80). 
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TABLE 2 
MEAN TIME TO CRITERION ON THE SECOND List AND THE MEAN NuMBER 
or Tora Correct RESPONSES ON THE First FIVE TRIALS 
Time to Criterion Total Correct Responses 
ARA Anticipation Interval ah Anticipation Interval Mrs 
Intervals Intervals 
1 Sec. 2 Sec. 4 Sec, 1 Sec. 2 Sec. 4 Sec. 

H/A-C 23.25 18.33 24.50 22.03 24.92 35.67 33.67 31.42 
HAS 33.25 24.00 27.50 28.25 21.50 27.33 33.83 27.56 
H/A-X 33.25 28.33 37.50 33.03 19.83 26.83 27.25 24.64 
H/E-X 17.25 19.67 20.00 18.97 2747 31.83 36.08 31.69 
Combined 

H Lists | 26.75 22.58 27.38 25.57 23.35 30.42 32.71 28.82 
L/A-C 29.75 30.00 23.00 27.58 27.50 31.33 37.00 31.94 
L/A-B 45.50 22.00 34.50 34.00 14.33 29.17 28.42 23.97 
L/A-X 40.50 44.33 37.00 40.61 12.67 18.92 26.67 19.42 
L/E-X 37.50 34,33 37.50 36.44 16.33 23.33 25.17 21.61 
Combined 

L Lists 38.31 32.67 33.00 34.66 17.70 25.69 29,31 24.24 


The E-X list stimuli were opposite in 
meaningfulness to the way they are 
grouped in Table 1 so another analysis 
of variance was computed with the 
E-X lists interchanged, i.e., all lists in 
the high- and low-meaningfulness 
groups actually had the degree of 
stimulus meaningfulness specified. The 
mean trials to criterion for the high- 
and low-meaningfulness groups was 
12.17 and 14.82, respectively, Mean- 
ingfulness (13.78) and Meaningfulness 
x Lists x AI (2.57) were significant 
but Meaningfulness x Lists (1.58) 
was not. Examination of Table 1 
shows that lists with high-meaningful 
stimuli were less difficult than those 
with low-meaningful stimuli and there 
was no systematic effect of AI. The 
significant Meaningfulness x List x 
AT interaction was Produced by a ten- 
dency for the 2-sec, AT groups, within 
lists, to have Opposite relative Positions 
in the two levels of meaningfulness, 
Second - list learning. — The mean 
number of trials required to learn the 
A-D list to the criterion of one perfect 
trial are presented in Table 1. Mean- 


ingfulness (24.96), Lists (6.96), and 
AI (49.30) were significant sources of 
variance and Meaningfulness x AI 
(2.99) and Lists x AI (2.07) ap- 
proached significance. Trials to cri- 
terion decreased as AI increased and 
the list with high-meaningful stimuli 
was less difficult than the one with low- 
meaningful stimuli. The A-C, A-B, 
A-X conditions within the levels of 
meaningfulness increased in difficulty 
in that order. However, the E-X con- 
ditions differed in their difficulty, rela- 
tive to the other conditions, in the two 
levels of meaningfulness. The H/E-X 
condition required fewer trials to cri- 
terion than any of the other high- 
meaningful conditions. The L/E-X 
condition was approximately the same 
as the L/A-B in difficulty. 

Each S's trials to criterion score was 
Converted to a time measure by multi- 
plying the number of trials to criterion 
by the sum of the AT and SR I. The 
mean time to criterion is presented in 
Table 2 for each group. Meaningful- 
ness (26.33) and Lists (849) were 
significant sources of variance while AI 
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MEAN CORRECT RESPONSES 


Low HIGH 
STIMULUS MEANINGFULNESS 


Fic. 2. Mean total number of correct 
responses on the first five trials of A-D 
learning as a function of stimulus meaning- 
fulness and of the first list (n = 36). 


(2.56) and Meaningfulness X Lists 
(2.55) approached significance. The 
F for Lists x AI was 1.64. Table 2 
shows that the list with low-meaning- 
ful stimuli required longer to learn 
than the one with high-meaningful 
stimuli. The A-C, A-B, A-X lists 
were again in that order of difficulty 
within both levels of meaningfulness ; 
H/E-X required less time to learn than 
other high-meaningful conditions, and 
L/E-X required approximately the 
same amount of time as the L/A-B 
condition. There was a tendency, al- 
though it did not reach an acceptable 
level of significance, for the 2-sec. Al 
condition to require less time to cri- 
terion than either the 1-sec. or 4-sec. 
condition. The lack of effect of AT 
on learning time apparently conflicts 
with the results of a study by Nodine 
(1965) who found that time to learn 
increased as AI increased. However, 
Nodine varied the AI in first-list learn- 
ing, used a more difficult list, and used 
a different criterion of learning. 

The mean total correct responses on 
the first five trials are presented in 
Table 2. — Meaningfulness (45.10), 
Lists (33.82), AI (83.30), Meaning- 


fulness X Lists (10.31), and Meaning- 
fulness X Lists X AI (3.19) were all 
significant. Lists X AI (1.03) did not 
approach significance. The significant 
interaction of Meaningfulness, Lists, 
and AI was produced by the tendency 
for the relationship between the 2-sec. 
and 4-sec. AI to be reversed for the 
lists in the two levels of meaningful- 
ness. The results are not systematic 
and agree with the results on the first- 
list learning so they may be primarily 
due to original differences in the 
groups. The three AIs were com- 
bined and the mean total correct re- 
sponses are presented in Fig. 2. Al- 
though there tends to be more differ- 
ence between the A-C, A-B, A-X con- 
ditions with the low-meaningful stimuli 
the major difference is the relative 
positions of the E-X groups. There 
is very little difference between the 
H/A-C, H/E-X, and L/A-C groups. 
The effects of lists with low-meaningful 
stimuli are similar to the results of 
McGehee and Schulz (1961) who also 
used the Russell and Storms material. 

None of the analyses showed a sig- 
nificant Lists x Al interaction. The 
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MEAN CORRECT RESPONSES 


Fic. 3. Mean number of correct responses 
on the first five trials of A-D learning as a 
function of anticipation interval in the A-C 
and E-X condition with low-meaningful 
stimuli (n= 12). 
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MEAN CORRECT RESPONSES 


Fic. 4. Mean number of correct responses on 
the recall trials of the first lists (n= 36). 


L/A-C and L/E-X conditions were 
selected as those most apt to interact 
with AI and the mean correct re- 
Sponses on the first five trials are pre- 
sented in Fig. 3 as a function of list 
and AI. Neither the analysis of the 
first five trials nor of the first three trials 
produced a List x AI interaction 
Which approached significance. The 
AI had no differential effect on the 
condition which produced the most 
transfer and the condition which should 
be least likely to produce mediating 
responses. 

Recall.—This experiment was de- 
Signed to assess the transfer to the 
second list so there are a number of 
possible problems in considering the 
recall of the first list. The first lists 
were different and there were no con- 
trol groups without interpolated learn- 
ing; the second list was learned a 
minimum of five trials and some Ss at- 
tained the criterion before the five trials 
were completed; the time required for 
Second-list learning varied significantly. 
Tn spite of these Problems the recall 
trials show relationships which do not 
seem to be related to any of them so 
the data will be presented. 

The five recall trials of the first list 
were presented at a 2:2-sec. rate and 
no information was given to S concern- 


JACK RICHARDSON AND BRUCE L. BROWN 


ing the correctness of the responses, 
The analysis of variance included trials 
as a within-S variable and Lists 
(26.57), Meaningfulness x Lists 
(2.74), Trials (107.31), and Trials 
X Lists (7.87) were significant. The 
F for AI (1.03) did not approach sig- 
nificance so the three AIs were com- 
bined and the mean number of correct 
responses on each trial are presented 
in Fig. 4. The increase in the number 
of correct responses with trials agrees 
with other results (eg. Butler & 
Peterson, 1965; Richardson & Grop- 
per, 1964) but the main concern here 
is the difference between lists and the 
Meaningfulness x Lists interaction. 
The recall in the L/A-C, L/E-X, and 
H/E-X conditions was very high and 
approximately the same for the three 
conditions. The other conditions 
showed poorer recall and tended to 
cluster after the first trial with the ex- 
ception of H/A-C which showed slightly 
better performance. Figure 4 shows 
that the recall is apparently not related 
to either the difficulty of the second list 
nor to the time required to learn the 
second list. The important point here 
is that the E-X condition, which had 
different stimuli and different responses 
in the two lists, showed superior recall 
and, of the two A-C mediation lists, the 
L/A-C lists showed recall equal to the 
E-X lists while the H/A-C recall was 
inferior to the E-X. In contrast to the 
second-list learning, where the Mean- 
ingfulness X Lists interaction was pro- 
duced by the E-X lists, the Meaning- 
fulness X Lists interaction in recall of 
the first list was produced primarily by 
the difference in the recall of the A-C 
lists. The A-X lists tended to show 
poorer recall on Trial 1 than the A-B 
but this trend disappeared after the 
first trial. Tt is tempting to relate the 
Trials X Lists effect to recovery from 
interference but it may be simply the 
Tesult of the level of performance at 
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the beginning of recall (Butler & 
Peterson, 1965). 


Discussion 


In a discussion of an associative- 
probability theory Underwood and Schulz 
(1960) state: 


In a paired-associate task the subject must 
learn to associate two items; he must learn 
to say B when A is presented. The basic 
postulate of the theory could state that the 
greater the M of these items—the greater 
the number of associates which they elicit— 
the greater the probability that associates 
from the two items will link up in some 
manner so that B is connected to A via 
already existent associations [p. 45]. 


If it is assumed that the effectiveness of 
the association selected as a mediator also 
increases as a function of meaningfulness 
then the results of Exp. II seem to con- 
form to an associative-probability theory. 
Since the responses were the same for 
the two levels of stimulus meaningfulness 
the results will be discussed in terms of 
stimulus meaningfulness and the effect 
of adding another association, in first- 
list learning, on the learning of the A-D 
lists and on the recall of the first list. 
The amount of transfer from the first- 
list learning to A-D should depend upon 
the effectiveness of the association learned 
in the first list relative to the already ex- 
istent associations. Transfer from E-X 
to A-D would not be specific to the par- 
ticular items but the larger number of 
potential mediators for the A-D list 
should produce faster learning with high- 
meaningful stimuli (H/E-X) than with 
low-meaningful stimuli (L/E-X) as 
Shown in Fig. 2. Training an effective 
mediating association to the low-mean- 
ingful stimuli (L/A-C) should facilitate 
A-D learning more than training the 
Same associations to high-meaningful 
stimuli (H/A-C) because there are 
fewer effective associations, other than 
the C association, available in the low- 
meaningful A-D list. If associations 
which are less effective as mediators are 
learned in the first list, the transfer re- 
sults should also depend upon the associ- 
ations available to link A and D. Asso- 


ciations trained to the high-meaningful 
stimuli could interfere with A-D learning 
because they were less efficient than the 
previous associations (H/A-B). On the 
other hand, the same associations might 
produce positive transfer with low- 
meaningful stimuli because they were 
more efficient than the other available 
associations (L/A-B). The same argu- 
ments hold for learning an even less effi- 
cient association in the first list (A-X). 

A stimulus-predifferentiation hypothe- 
sis could possibly account for the differ- 
ences in transfer as a function of mean- 
ingfulness. However, the recall data also 
tend to conform to an associative-proba- 
bility theory if it is assumed that first-list 
responses, used as mediators in the A-D 
learning, should be recalled better than 
the responses which were not used as 
mediating responses. There was no con- 
trol for recall without interpolated learn- 
ing but the E-X lists had different stimuli 
and different responses from the A-D 
lists and should be subject to a minimum 
of direct interference from A-D learning. 
Figure 4 shows that the E-X lists are 
recalled equally well in spite of differ- 
ences in difficulty of learning. The 
L/A-C list is recalled as well as the E-X 
lists and this is consistent with the hy- 
pothesis that the C items were commonly 
used as mediators during the A-D learn- 
ing and thus subject to little interference. 
On the other hand the H/A-C list recall 
is inferior to the L/A-C and E-X lists 
and this is consistent with the hypothesis 
that fewer C associations were used as 
mediators in the learning of the high- 
meaningful A-D list. The L/A-B and 
L/A-X lists failed to show recall supe- 
rior to that of the H/A-B and H/A-X 
lists as would be required by the hypothe- 
sis that B and X associations tend to be 
used as mediators more often in the low- 
meaningful lists. However, the possi- 
bility remains that a better controlled ex- 
periment would show these differences. 

Increasing the AI facilitated perform- 
ance on the A-D list but there was no 
evidence for an interaction between AI 
and the amount of transfer. A mediating 
association should require time to occur 
but if the lists differ only in the number 
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and effectiveness of the potential medi- 
ators, instead of whether there are or are 
not associations used to aid A-D learn- 
ing, then increased AI would not have 
a differential effect on transfer. The 
longer B-C-D chain in the A-B condition 
should require more time than the shorter 
C-D chain in the A-C condition but 
Martin and Dean’s (1964) results make 
it doubtful that any facilitation in the 
A-B condition was due to the use of the 
B-C-D chain, 

The results of this experiment suggest 
that the control groups may be the deter- 
mining factor in many conclusions about 
the amount of mediated transfer in verbal 
paired-associate lists because most verbal 
materials do not permit conclusions about 
"nonmediated" learning. The approach 
presented here suggests that meaningful- 
ness should interact with the SR I at very 
brief intervals which do not permit dis- 
covery or use of mediators. A mediated 
response should require longer to occur 
than a direct response but not longer 
than the use of a less efficient mediator 
Or of a very weak direct response. De- 
termination of the effect of AI on medi- 
ation may require comparison of medi- 
ated and nonmediated recall which has 
the same probability of occurring if suf- 
ficient time is permitted for mediation. 

Previous experiments which suggest 
an interaction between transfer and AI 
have compared different experiments 
(Richardson, 1965b; Schulz & Lovelace, 
1964) or have not been Statistically sig- 
nificant (Richardson, 1965a; Runquist & 
Marshall, 1963). Katz (1963) has pre- 
sented the only clear evidence that verbal- 
mediating responses require time to 
occur. 
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FORGETTING AND NATURAL LANGUAGE MEDIATION* 
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The experiment attempted to demonstrate the effectiveness of natural 
language mediators (NLMs) for the retention of paired associates. 
An NLM was any word, phrase, or sentence association reported by S 
for a pair. After a pretest familiarization run, 4 groups of 30 Ss 
learned 96 CVC pairs either high or low in meaningfulness (m’), and 
presented only once for either 15 or 30 sec. Upon presentation of each 
pair S wrote his NLM, if any, on a card. At recall, 24 hr. later, S 
attempted to give in writing the response member of the pair and the 
NLM that he had used in acquisition. Proportion correct at recall 
was highest for items to which NLMs had been originally formed and 
later remembered. Retention was negligible for rote pairs where no 
NLMs were formed, and for pairs where forgetting caused the NLM 
to be dropped completely from S’s repertoire or seriously changed. 
Presentation time was positively related to number of NLMs formed 
in acquisition but had little effect on the proportion of items cor- 
rectly recalled. The effectiveness of NLMs for retention of paired 
associates was discussed in terms of the transfer of associative strength 


from well-learned language habits. 


A natural language mediator (NLM) 
for paired associates is any association 
of any type or complexity that S brings 
to bear on his verbal-learning task. 
The NLM may be a one-word asso- 
ciation between the stimulus and re- 
sponse terms of a pair, one or more 
sentences, an association based on 
similarity of sounds, or an association 
between selected segments of the terms 
such as single letters or subgroups of 
letters. Knowledge of NLMs is 
through interview or questionnaire that 
probes how S went about learning the 
pairs. Woodworth and Schlosberg 
(1954, p. 708) have commented on the 
avoidance of NLMs by experimental 
psychologists, and more recently Run- 
quist and Farley (1964) have said: 
Although it is usually considered dangerous 


to treat these utterances as having any scien- 
tific importance, the ubiquity of the reports 


1This research was supported by Grant 
802-65 from the Air Force Office of Scien- 
tific Research. Susan Baum assisted in run- 
ning Ss and Carl Webber, Marcia Feuerzeig, 
and Janice Wilson assisted in the analysis of 
the data. 


of mediators seems to warrant their investi- 
gation [p. 280]. 

We concur, and we believe that NLMs 
must eventually enter the laws of 
verbal behavior as a variable for both 
acquisition and retention. The use- 
fulness of NLMs for laws is, of course, 
dependent upon their value for predict- 
ing criterion behavior. 

Underwood and Schulz (1960, pp. 
296-300), Clark, Lansford, and Dal- 
lenbach (1960), and Bugelski (1962), 
report on the generous presence of 
NLMs in verbal acquisition. More- 
over, the topic of retention has several 
studies which document a positive role 
of NLMs for remembering. For ex- 
ample, Reed (1918), and Runquist and 
Farley (1964) have shown that pairs 
with NLMs are recalled better than 
nonmediated pairs learned by rote. 
More recently the senior author, in two 
as yet unpublished studies, found that 
retention of paired associates over a 
3-day interval was benefited by NLMs. 

The experiment reported here is an 
investigation of NLMs and the reten- 
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tion of paired CVCs. The two main 
variables of the study were chosen be- 
cause of their likelihood of influencing 
the production of NLMs: (a) mean- 
ingfulness, and (b) presentation time 
for a pair on the assumption that more 
study time would induce more NLMs 
(Bugelski, 1962; Clark et al, 1960). 
Because NLMs are themselves re- 
Sponses and stand a chance of being 
altered or lost from S’s behavioral 
repetoire over the retention interval, 
the data were analyzed in terms of 
retention as a function of change in 
NLMs from acquisition to recall. 


METHOD 


Experimental design—The experiment was 
conceived as a strong test of the assumption 
that NLMs benefit retention, Each CVC 
pair was shown for study only once, and if 
NLMs are a strong variable for retention 
their potency should emerge with only a 
restricted opportunity to form and use them. 

All Ss received the same pretest familiari- 
zation exercise of learning and short-term 
recall of 25 pairs, The 25 pairs were de- 
vised by pairing randomly 50 CVCs (Noble, 
1961) of intermediate m value (X= 3.0). 
Presentation time was 23 sec. in both acqui- 
sition and recall, 
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Twenty-three seconds was chosen because it 
was intermediate between 15 and 30 sec. 

Basic procedures for all groups.—Initial 
instructions. demonstrated CVC pairs, indi- 
cated that Ss should memorize each pair for 
later recall, and discussed N LMs and urged 
Ss to use them with each pair if possible. 
They were told that at recall only the 
stimulus term would be presented and they 
would be asked to give both the response 
term and the NLM if they could, and the 
more they recalled the better their pay would 

The Ss received standard pay of $3.50 
for the two sessions and, in addition, received 
a bonus of 1 for each response and for each 
NLM correctly recalled, 

All materials were presented with a slide 
projector to 15 Ss at a time. Two Es were 
used, one to operate the projector and one to 
monitor the group. The first session lasted 
1-2 hr. depending on the presentation time 
allowed for a pair of the criterion list. The 
Pretest followed the initial instructions im- 
mediately. As each of the pretest pairs was 
Presented S took from a stack on his desk 
a card that had a number corresponding with 
a small-size number in the upper corner of 
the projection Screen and wrote his NLM, 
if any, on it. The number corresponded to 
the position of the pair in the list, and was 
a safeguard to insure that S would stay in 
Proper sequence and also for us to identify 
the pair to which the NLM belonged. After 
S had finished writing he placed the card 
in a box. A 5-min. rest was given after 
pretest acquisition, followed by brief recall 
instructions and recall of pretest pairs. For 
recall, only the stimulus member of a pair, 
along with a sequence number, appeared on 
the screen. The $ took from his stack a 
numbered card, printed the stimulus CVC on 
it and then attempted to recall in writing the 
CVC response and NLM for the pair. The 
order of slides for recall Was a new random 
order, but the same order for all Ss. Five 
minutes rest followed pretest recall. 

Up to this point ail groups had the same 
treatment, but in acquisition of the criterion 
list they were differentiated by presentation 
time and meaningfulness, 
Instructions were given and they were told 
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versed sequences of the 96 pairs to counter- 
balance for possible order effects. Sequence 
effects, if any, were ignored in the analysis 
by pooling the two subgroups. 

The recall session for the criterion list 
was 1 hr. in duration. Criterion recall was 
preceded by brief reminder instructions, and 
procedures were the same as for pretest 
recall. The random order of slides in recall 
was different from the acquisition series but 
was the same order for the two low m’ 
groups, as it was for the two high m’ groups. 

Subjects—The 120 Ss were university 
undergraduates who were randomly assigned 
to groups. 


RESULTS 


Evaluation of NLMs.—Two judges 
were used to rate the change in NLMs 
over the retention interval so that recall 
of response terms could be related to 
change or forgetting of the NLMs. 
Working independently, each judge 
compared the acquisition and recall 
cards for each pair. A judge tabulated 
whether or not an NLM was used in 
acquisition, whether the response term 
of a pair was correct in recall, and then 
categorized the NLM and its change 
from acquisition to recall. If no NLM 
was formed in acquisition the pair was 
categorized Rote. When there was an 
NLM in acquisition the judge com- 
pared it with the NLM at recall and 
assigned the pair to one of three cate- 
gories: (a) Same, (b) Similar, or (c) 
Not the Same if it was distinctly dif- 
ferent or if there was an NLM and it 
had been completely forgotten at recall. 
The judges agreed in their classifica- 
tion of 93.9% of the pairs. For the 
remaining 6.1% the judges consulted 
together and agreed on a category be- 
cause each pair had to be assigned to 
a definite category. For analysis the 
Same and Similar categories were com- 
bined. For all groups only 40 items 
fell into the Similar category. 

Mediators in acquisition.—Table 1 
gives the mean number of NLMs 
formed in the acquisition of the 96 


TABLE 1 


Mean NUMBER OF NATURAL LANGUAGE 
Meprators (NLMs) USED IN 


ACQUISITION 
p i i 
reas X: 
Low High 
15 31.1 59.4 45.2 
30 50.5 74.7 62.6 
x 40.8 67.1 


pairs, and it can be seen that Ss had 
a substantial number of NLMs even 
when m was low and the pairs seem- 
ingly nonsense. Number of NLMs 
increase both with presentation time 
and m. With number of NLMs in 
acquisition as the measure for S, a 
2 x 2 analysis of variance for the four 
groups was performed. The effects of 
both presentation time and m were 
significant beyond the .001 level, Fs 
(1, 116) — 31.5, and 71.9, respectively, 
but their interaction was not significant, 
F«1. 

Recall.—Table 2 gives the propor- 
tions correct at recall for each group 
for each NLM category as well as the 
frequencies of pairs on which a pro- 
portion is based. The table shows 
that when the NLM category is the 
Same or Similar the frequency of cor- 
rect recall is directly related to m’ and 
presentation time. The longer presen- 
tation time allows Ss to generate more 
NLMs in acquisition and thereby more 
items are recalled. It is interesting to 
note, however, that while large differ- 
ences exist in the numbers recalled at 
each level of m’, for both time intervals 
the number recalled is close to being a 
constant proportion of the number of 
NLMs generated. 

Examination of the data summarized 
in Table 2 reveals a high correlation 
between the retention of NLMs and 
correct response at recall, Correct re- 
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TABLE 2 
Proportion Correct RECALLS iN Each NLM CaTEGORY ror EACH Group 


NLM Category 
Group Group Total 
Same or Similar Not the Same Rote 
L-15 83/182 6/752 57/1946 146/2880 
456 .008 .029 .051 
L-30 154/337 5/1178 72/1365 231/2880 
457 .004 .053 -080 
H-15 339/421 32/1361 93/1098 464/2880 
.805 .024 .085 .161 
H-30 666/770 41/1472 73/638 780/2880 
.865 .028 114 271 
Category 1242/1710 84/4763 295/5047 1621/11520» 
Total for 726 -018 058 41 
All Groups 


Note.—The proportion is based on the ratio given, which is number correct over number of items in a category 


for a group. 
* All items, all groups. 


call was low with rote learning or when 
the NLM was Not the Same, but when 
the NLM was available in substantially 
its original form the proportion correct 
was a robust 726 overall ^ Groups 
H-15 and H-30 correctly recalled over 
80% of their items when the NLM was 
recalled. The data on which Table 2 
is based were analyzed statistically by 
analysis of variance in which m’ and 
Presentation time were between-Ss 
variables and the three NLM cate- 
gories determined by the presence or 
absence of NLMs in learning and their 
recall (Same and Similar vs. Not Same 
vs. Rote) was a within-Ss variable. 
The cell score for an S was the propor- 
tion of responses correctly recalled. 
The differences between NLM cate- 
gories, F (2, 232) = 579.58, the effect 
of m’, F (1, 116) = 89.55, and their 
interaction, F (2, 232) = 63.28, were 
all significant beyond the .001 level. 
All other F tests were insignificant 
(p > .05). 


Discussion 


These findings confirm the infrequent 
experiments which have shown that 
NLMs enhance retention, Although the 


total proportion of items recalled by our 
Ss was small, items for which the NLMs 
used in acquisition were retained were 
recalled more than 70% of the time, On 
this basis, we conclude that NLMs are a 
very positive factor in the retention of 
paired associates, 

Our study was empirical exploration 
of the effects of NLMs on retention and 
it leaves unanswered the more funda- 
mental reasons why NLMs function for 
retention as they do, One possibility is 
that NLMs represent the application of 
well-learned language habits to new verbal 
material, and being well-learned, are com- 
paratively resistant to forgetting processes. 
This point of view says that verbal learn- 
ing is often a case of positive transfer 
where past language skills are brought to 
bear on the learning of new ones. The 
view does not deny rote learning, because 
clearly there are situations where positive 
transfer is absent and a pair must develop 
associative strength from a zero or very 
low level by practice repetitions in the 
experiment. Nor is the viewpoint new. 
Positive transfer is a phenomenon implic- 
itly present in the use of a quantity like 
m which is related to number of single- 
word associations from past learning, but 
NLMs would seem to tap more fully the 
richness of S's associative ability and the 
transfer potential from his past learning 
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in a way that other approaches do not. 
Our study has given no cause to disavow 
m as a useful variable for retention, but 
it does suggest that a turn to more com- 
plex associative processes may be profit- 
able. The scaling of word pairs for 
probability of an NLM is an orderly de- 

` parture from Noble's m and, on the basis 
of our experiment here, would seem to 
be a useful research venture. 

The transfer of training view coupled 
with possible covert rehearsal has value 
for explaining why presentation time 
produces reliable differences between 
groups in the frequency of NLMs and 
items recalled, but not in the proportions 
of correct recalls. The íact that more 
NLMs are formed during the longer time 
interval (see Table 1) is consistent with 
the findings reported by Bugelski (1962) 
and Clark et al. (1960). Some of the 
NLMs generated in the longer interval 
could be expected to be formed relatively 
late and have long response latencies. 
Given the accepted view that associative 
strength and latency are inversely re- 
lated, these NLMs have less benefit from 
transfer of associative strength. Some- 
what paradoxically, as shown in Table 2, 
proportionately as many of these items are 
retained as are NLMs which occur in the 
Shorter interval and are likely to have 
shorter latencies. This result may be 
due to the covert rehearsal of pairs with 
weaker NLMs occurring in the longer 
interval. The rehearsal strengthens some 
of them well enough so that they are re- 
tained as well as pairs with stronger 
NLMs generated in the shorter interval. 

The pairs with forgotten NLMs in the 
category of Not the Same had only .018 
of their responses correctly recalled, 
which is virtually nil. This level is much 
poorer than for rote learning, which was 
itself low. When an S learned a pair by 
rote he probably used the presentation 
time for covert repetition and this gave 
it some associative strength (Bugelski, 
1962). But, when he formed an NLM, 
he may have covertly repeated the whole 
Stimulus-NLM-response complex if he 
had time remaining. Thus, when the 
NLM was forgotten the response could 
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not be recalled because the stimulus had 
an associative tie with the response via 
the NLM and had no capability for 
arousing the response directly. 

Our study has established conditions 
where retention is a direct function of 
meaningfulness. Either in the total num- 
ber of items retained or in the propor- 
tions retained in the various categories, 
the high m items show superiority. It 
should not be construed from these re- 
sults that meaningfulness affects the re- 
tention process per se. Rather, it is 
our contention that meaningfulness oper- 
ates through the transfer of associative 
strength from priorlearning. Since high 
m. items have more past associations than 
low m’ items, more NLMs of considerable 
associative strength should be formed and 
retention benefited accordingly. Mean- 
ingfulness and retention become corre- 
lated because NLMs and their associative 
strengths are a determining underlying 
factor. 


REFERENCES 


Bucetski, B. R. Presentation time, total 
time, and mediation in paired-associate 
learning. J. exp. Psychol., 1962, 63, 409- 
412. 

Crank, L. L., Lansrorp, T. G., & DALLEN- 
BACH, K. M. Repetition and associative 
learning. Amer. J. Psychol, 1960, 73, 
22-40. 

Noste, C. E. Measurements of association 
value (a), rated associations (a'), and 
scaled meaningfulness (m') for the 2100 
CVC combinations of the English alphabet. 
Psychol. Rep., 1961, 8, 487-521. 

Reep, H. B. Associative aids: I. Their 
relation to learning, retention, and other 
associations. Psychol. Rev. 1918, 25, 
128-155. 

Runguist, W. N., & Farley, F. H. The 
use of mediators in the learning of verbal 
paired associates. J. verbal Learn. verbal 
Behav., 1964, 3, 280-285. 

Unverwoon, B. J., & Scaurz, R. W. Mean- 
ingfulness and verbal learning. New 
York: Lippincott, 1960. 

WoopwortH, R. S., & ScHLosBERG, H. Ex- 
perimental psychology. New York: Holt, 
1954. 


(Received October 6, 1965) 


Journal of Experimental Psychology 
1966, Vol. 72, No. 6, 834-840 


GENERALIZATION OF POSITIVE AND NEGATIVE 


RESPONSE BIASES 


MICHEL HERSEN : 
State University of New York at Buffalo 2 


This investigation was designed to determine whether experimentally 
induced response biases can generalize from 1 concept topic to an- 
other, and whether the “conditioning” effects are a function of time, 
task format, or the presence of E. Secondly, these studies were con- 


ducted to ascertain whether S's ability to verbalize the response- 
reinforcement contingency is related to the specificity of the postexperi- 


mental questionnaire. 


The results warrant the following conclusions : 


(a) "conditioning" effects generalize across concept topics; (b) the 
experimental treatment is not restricted by time, format, nor is it re- 
lated to E who originally administered reinforcement; (c) S's ability 
to specify the experimental arrangements is primarily a function of 
the sensitivity of the postexperimental questionnaire employed. 


Recent studies (Bugelski & Hersen, 
1966; Carriero, 1964) have shown that 
undergraduate Ss could be influenced 
to endorse or reject statements of in- 
formation which they were unable to 
evaluate, as the information involved 
was new to them. In both investiga- 
tions the experimental task was pre- 
sented in a forced-choice true-false 
format, as the validation of a teaching- 
machine program, The Ss were re- 
quired to respond “true” or “false” to 
a set of training items presented one at 
atime. After each response Ss were 
verbally informed by E whether the 
item was true or false, The training 
Series consisted of Obviously true, ob- 
viously false, and "ambiguous" state- 
ments. “Ambiguous” meant that in 
a standardizing test about 50% of the 
group responded true and the other 
50% responded false. In the Bugelski 
and Hersen study all ambiguous state- 


1 Now at West Haven Veterans Adminis- 


m Hospital, West Haven, Connecticut. 
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submitted to the Psychology Department of 


ments (items on old age) were labeled 
true by E, whereas in the Carriero 
study all ambiguous statements (items 
about Alaska) were labeled false by E. 
Immediately following the training ses- 
Sion Ss took a test which contained 
more of the same three types of state- 
ments, However, this time instead of 
answering the ambiguous items about 
half true and half false, the experi- 
mental Ss, consistent with their rein- 
forcement history, responded true or 
false to such statements significantly 
more often than their respective control 
groups. Thus, Bugelski and Hersen 
induced a positive response bias, and 
Carriero induced a negative response 
bias. 

The purpose of the present series of 
experiments was to further establish 
the effectiveness of this technique by 
demonstrating that the "conditioning" 
effects can transfer across concept 
topics (eg. train Ss with a series of 
items about disease ; test Ss with a 
similar series of statements about old 
age). Secondly, these experiments 
were conducted to determine whether 
the experimental treatment was specific 
to the moment, the status, or the pres- 
ence of E. Of further interest is 
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TABLE 1 
EXPERIMENTAL DESIGN 


Experimental RT| ^ Control ee 
Training Series® 
16 Ambig. RT | 24 True RT | 16 Ambig. RF 
16True RT| 24False RF | 16True RT 
16 False RF 16 False RF 
Test Series™ 
16 Ambig. NR | 16 Ambig. NR | 16 Ambig. NR 
16 Tre NR | 16 True NR | 16 True NR 
16 False NR | 16False NR | 16False NR 


Note.—RT = Reinforced as true; RF = Reinforced 
as False; NR = no Reinforcement. 

* In random order for all three groups; not more than 
three of any kind of statement in sequence. 


whether S’s ability to identify the re- 
sponse-reinforcement contingency is re- 
lated to the sensitivity of the post- 
experimental questionnaire. On the 
basis of these questionnaires Ss able to 
specify the response-reinforcement con- 
tingency were labeled “Verbalizers.” 


EXPERIMENT I 
Method 


Subjects.—In both the pretest and experi- 
mental study proper, males and females from 
introductory psychology classes in group 
settings were employed as Ss. 

Materials.—A. pool of obviously false (e.g., 
Medicine has derived little benefit from elec- 
tronics), obviously true (e.g, Symptomatic 
of a cold is a congested nasal passage), and 
“ambiguous” (e.g., Caput succedaneum is a 
traumatic alteration of the head) statements 
about disease were constructed and ad- 
ministered in group setting to 330 Ss. The 
operational definition for true and false 
statements was at least 90% group agree- 
ment. The criterion for ambiguous state- 
ments was 40-60% group agreement. From 
those items best meeting the stated criteria, 
appropriate training and test series were 
constructed as indicated in Table 1. 

Procedure.—In all experiments to be re- 
ported, Experimental Group RT is the group 
in which all ambiguous items of the training 
series are labeled true by E. Experimental 
Group RF is the group in which the same 
ambiguous items of the training series are 
labeled false by E. 

Table 1 presents the experimental design, 
indicating the reinforcement pattern in train- 
ing for the two Experimental groups and the 


Control group. The statements were mimeo- 
graphed in the form of a true-false test and 
were presented to Ss as the validation of 
a teaching-machine program. The Ss were 
seated in a large lecture hall in three groups. 
There were 37 Ss in Experimental Group 
RT, 42 Ss in the Control group, and 39 Ss 
in Experimental Group RF. Written in- 
structions were given to the effect that the 
test might not follow the usual pattern of 
true and false tests, and that some of the 
statements would be more difficult than 
others. This was done in order to provide 
Ss with a basis for responding to the post- 
experimental inquiry, designed to identify Ss 
able to verbalize ("Verbalizers") the re- 
sponse-reinforcement contingency. 

The Ss were instructed to circle what they 
thought to be the correct response to the first 
item in the training series. This action was 
immediately followed by E’s verbal rein- 
forcement of True or False. This procedure 
was maintained at a constant pace (15 sec. 
per item) for all three groups throughout 
the 48-item training series. Upon completion 
of the training series all three groups were 
verbally instructed to continue on their own, 
answering all items (49-96) of the test 
series within a 10-min. time period. After 
completing the test series Ss were required 
to answer the following: How did you 
answer when the item was difficult? And 
why did you answer in such a manner? 
(“specific” verbalization questionnaire). 


Results 


For the sake of uniformity all analy- 
ses were carried out in terms of the 
number of true responses made by the 
Experimentals and Controls to ambigu- 
ous statements. Differences between 
the Experimentals and Controls were 
highly significant in both the positive 
and negative directions. Further, Ss 
able to verbalize the response-reinforce- 
ment contingency continue to show the 
influence of training more than the 
Nonverbalizers. However, both Ex- 
perimental RT and Experimental RF 
nonverbalizers are significantly differ- 
ent from the Control group. 


EXPERIMENT II 


This experiment is essentially a repe- 
tition of Exp. I except that this study 


836 


MICHEL HERSEN 


` TABLE 2 


COMPARISON or MEAN NUMBER oF “TRUE” 


RESPONSES TO AMBIGUOUS ITEMS IN 


THE Test SERIES MADE BY THE CONTROL Ss AND VERBALIZERS AND. NON- 


VERBALIZERS OF THE Two 


EXPERIMENTAL GROUPS 


Group N M SD $, r 
Exp. RT Verbalizers vs. 15 11.33 3.67 2.62** 
Hrs RT Nonverbalizers 8 9.00 1.87 
Exp. RT Verbalizers 15 11.33 3.67 4.79*** 
vs. Control 25 6.68 3.31 
Exp. RT Nonverbalizers 8 9.00 1.87 1.85* 
vs. Control 25 6.68 3.31 
Exp. RF Verbalizers vs, 7 3.43 214 328** 
Exp. RF Nonverbalizers 14 7.93 2.96 
Exp. RF Verbalizers 7 3.43 2:17 2.43* 
vs. Control 25 6.68 3.31 
Exp. RF Nonverbalizers 14 7.93 2.96 1.39 
vs. Control 25 6.68 3.31 


Note.—Train items on disease; Test items on old dge. 
*p <05. 

bad AE do 
wp < 1001. 


was conducted to determine whether 
the treatment effect carries over from 
one concept topic to another. 


Method 


Subjects—There were 23 Ss in Experi- 
mental Group RT, 25 Ss in the Control 
group, and 21 Ss in Experimental Group RF, 

Materials—The Ss were trained with 
items on disease and tested with items on 
old age. Test series statements on old 
age were the same as those used in the 
Bugelski and Hersen study. 

Procedure.—With the exception of the 
different Concept topic in the test series, all 
Procedures were the same as described in 
Exp. I. Here too the “specific”? verbalization 
questionnaire was used to detect Verbalizers, 


Results 


Analysis of data presented in Table 
2 confirms the hypothesis that the ef. 
fects of training on one concept topic 
can generalize to another. Both RT 
and RF verbalizers are significantly 
different from the Control group. 
However, RF nonverbalizers are not 
significantly different from the Control 
group, and only when RF verbalizers 
are separated from RF nonverbalizers 
is the difference Significant. "This sug- 


gests that knowledge of the response- 
reinforcement contingency enhances 
performance, 


ExrermeNrT III 


This experiment is basically a repeti- 
tion of Exp. I except that training and 
test series items were about “Cuba,” 
and the postexperimental inquiry was 
“open-ended.” 


Method 


Subjects—There were 90 Ss in Experi- 
mental Group RT, 54 Ss in the Control 
group, and 56 Ss in Experimental Group RF. 

Materials. —A pool of obviously true, am- 
biguous, and obviously false statements about 
“Cuba” was constructed and administered in 
group setting to 179 Ss in the form of a 
forced-choice true and false test. From 
those statements best meeting the afore- 
mentioned criteria, training and test series 
were constructed as indicated in Table 1. * 

Procedure.—Aside from the “open-ended 
questionnaire, all procedures were the same 
as described in Exp. I. The following is 
the open-ended questionnaire: We want to 
make sure you understand the purpose of 
the experiment and that you have complete 
understanding of the instructions, Tell us 
how you operated during the experiment— 
for example, what was the procedure? 


r 
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TABLE 3 


COMPARISON OF MEAN NUMBER OF “TRUE” 
RESPONSES TO AMBIGUOUS ITEMS IN THE 
Test SERIES MADE BY THE, Two 
EXPERIMENTAL AND THE 
CONTROL GROUPS 


Group | N | M | SD | Comparison t 


Exp. RT | 90 | 9:56 | 3.01 | Exp. RT > C |6.96*6* 
Control |284 | 617 | 244 | oR ES 
Exp. RF | 6 | 5:37] 278, | PRE «C 1.59 


Note.— Train items on Cuba; Test items on Cuba. 
**p < 001. 


What were you expected to do? What did 


you do? 


Results 


On the basis of this questionnaire 
method one S from the RT group and 
two Ss from the RF Group were 
judged to be verbalizers. Percentage 
of agreement between judges was 
9995.35 The data of Verbalizers and 
Nonverbalizers were combined in the 
statistical analysis. Table 3 presents 
tests series performance collapsed over 
trials. The difference between Experi- 
mental Group RT and the Control 
group is highly significant; however, 
the difference between the Control 
group and Experimental Group RF 
only approaches significance. The lack 
of significance is attributed to the de- 
pressed score of the Control group. 
On the basis of chance, one would ex- 
pect control Ss to respond about 8 true 
and 8 false, but the mean score of 6.17 
true responses is considerably below 
the expected. Inan a posteriori specu- 
lation one might surmise that control 
Ss, not exposed to ambiguous items in 
training, are reluctant to endorse am- 
biguous statements about Cuba in the 
test series. This is possibly due to the 
negative feelings connoted by the 
concept topic, Cuba. 


3 The author thanks Karen G. Wisotzgy 
for rating Ss' responses. 
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TABLE 4 ' 


Comparison OF MEAN NUMBER OF “TRUE” 
RESPONSES TO AMBIGUOUS ITEMS IN THE 
Test SERIES MADE BY THE EXPERI- 
MENTAL AND THE CONTROL GROUPS 


Group | N | M | SD | Comparison t 


Exp. RT | 66 | 9.02 | 248 | xp, RT >C | 2.60%" 

Control | 61 | 7.82 | 2.67 

Exp. RF | 97 | 6.88 | 2.87 | EXP RF < C | 2.06% 
Note.—Train items on Cuba; Test items on old age. 
* p <.05. 
Dp c0. 


ExPERIMENT IV 
This experiment was also conducted 


-to determine whether the treatment ef- 


fect generalizes to a different concept 
topic. It is essentially a repetition of 
Exp. III except that test-series items 
were on Old Age. 


Method 


Subjects.—There were 66 Ss in Experi- 
mental Group RT, 61 Ss in the Control 
group, and 97 Ss in Experimental Group RF. 

Materials—The Ss were trained with 
items on Cuba and tested with the old age 
statements. 

Procedure.—With the exception of the 
different concept topic in testing, all pro- 
cedures were the same as described in 
Exp. III. 


Results 


Six Ss from Experimental Group 
RT were considered to be Verbalizers 
and none from Experimental Group 
RF was judged to be a Verbalizer. 
Percentage of agreement between 
judges was 94% and 99%, respectively. 
As the mean score of the six RT 
verbalizers did not differ from the rest 
of the group, the data were combined 
in the analysis. Test-series data pre- 
sented in Table 4 indicate that both 
experimental groups were significantly 
different from the Control group. 
Once again the generalization phenom- 
enon was confirmed. 
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EXPERIMENT V 


This experiment was conducted to 
assess whether “Demand Characteris- 
tics” play a role in this paradigm 
(Orne, 1962). Orne contends that Ss 
seek to identify the “real” nature of 
the experimental hypothesis. There- 
fore in this experiment an effort was 
made to eliminate cues that might lead 
Ss to relate test-series items with train- 
ing-series items. 


Method 


Subjects.—To conform to the nature of 
the experimental design two intact second 
year psychology classes were used as Ss. 
All Ss had successfully completed intro- 
ductory psychology the previous year. 

Materials—A pool of obviously true, am- 
biguous, and obviously false statements about 
psychology was constructed and administered 
in group setting to 376 Ss in the form of a 
forced-choice true and false test. From 
those items best meeting the stated criteria, 
the appropriate test series for Experimental 
Group RT and the Control group was con- 
structed as indicated in Table 1. 

Procedure—Each of the two psychology 
classes was divided into approximately two 
equal groups (Experimental RT and Con- 
trol). Reinforcement pattern and instruc- 
tions to Ss were the same as described in 
Exp. I for Experimental RT and the con- 
trols.  Training-series items (Cuba) for 
Experimentals and Controls of both classes 
were dittoed in the form of a true-false exam. 
In both classes E was introduced by the 
instructor as a graduate student working 
towards his PhD. The E administered re- 
inforcement and at the conclusion of the 
training sessions he collected the test book- 
t Nothing was 
said about the experiment just completed 
nor was anything said about E's intended 
return 96 hr. later, Exactly 96 hr. later E. 
(“Demand 
was there 
to complete the experiment begun 4 days 
earlier. He presented the experimental and 
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TABLE 5 


MEANS AND SDs or "TRUE" RESPONSES TO 
AMBIGUOUS ITEMS IN THE TEST SERIES 
MADE BY THE EXPERIMENTALS AND 
CONTROLS OF THE "DEMAND" 

AND "NONDEMAND" GROUPS 


Group N M | SD 
Demand (Experimental) 38 | 9.65 | 2.96 
Demand (Control) 31 | 7.06 | 2.66 
Nondemand (Experimental) | 22 | 9.10 | 3.12 
Nondemand (Control) 19 | 6.32 | 3.15 


their responses to the "specific" verbalization 
questionnaire. There were 38 Ss in the 
Experimental Group RT and 31 Ss in the 
Control group. 

In the other class (“Nondemand group"), 
4 days after the male E had trained the 
group on Cuba items, the instructor pre- 
sented her class with the mimeographed 
Psychology Inventory. She instructed her 
class that this represented a departmental 
Survey to assess student's recall of intro- 
ductory psychology. After completing the 
test series (Psychology Inventory in true- 
false form), Ss were required to write their 
responses to the "specific" questionnaire. 
There were 22 Ss in Experimental Group 
RT and 19 Ss in the Control group. 

The only cue linking training and test 
Series is the interrelated character of the 
true-false task. If Demand Characteristics 
are operating, “conditioning” effects should 
logically appear only in the Demand group. 


Results 


Table 5 presents the means and SDs 
of the experimental and control Ss of 
the Demand and Nondemand groups. 
The postexperimental questionnaire 
yielded only one verbalizer from the 
Experimental RT Nondemand group, 
and his score was retained in the 
analysis. 

Tn view of the disproportional num- 
ber of Ss per cell, an approximate 2 
X 2 analysis of variance was utilized 
(Walker & Lev, 1953, pp. 381-382). 
The experimentals vs. controls yielded 
a significant F (1, 106) = 20.44, p 
< .001, demonstrating the effectiveness 
of the conditioning procedures, The 
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difference between the Demand and 
Nondemand groups was not significant, 
nor was there any evidence for an 
interaction, indicating that Demand 
Characteristics do not operate in this 
situation. Further, this experiment 
confirmed the contention that the ef- 
fects of training on a concept topic 
could generalize to a different concept 
topic after a 4-day interim. 


DiscussioN AND CONCLUSIONS 


]t should first be recognized that the 
experimental induction of a positive or 
negative response bias has been clearly 
established. Secondly, there is evidence 
that generalization does take place, 
whether verbal reinforcement is admin- 
istered in the positive or negative direc- 
tion. A review of the verbal-condition- 
ing literature on this topic (Greenspoon, 
1962; Krasner, 1962) indicates that the 
results are equivocal at best. Singer 
(1963), using a reinforcement procedure, 
was able to shift Ss’ attitudes toward 
a prodemocratic position on a combined 
version of the California and Christie F 
Scale. He then administered the E Scale 
and found that Ss maintained their re- 
sponse bias. However, these Ss were 
preselected in that they were relatively 
prodemocratic. The present investiga- 
tion has shown that generalization can 
occur without a preselection factor. 

Stronger confirmation of the generali- 
zation effect was obtained in Exp. V, 
where generalization was not only dem- 
onstrated across topics, but across situ- 
ations, across Es, and over time. This 
is of special interest with respect to Sing- 
er’s study where generalization occurred 
only when E was present in the room. 

In Exp. I and II, 26% of the Ss were 
considered to be verbalizers on the basis 
of the “specific” questionnaire. In Exp. 
III and IV, 3% of the Ss were judged 
to be verbalizers on the basis of the open- 
ended inquiry. This finding indicates 
that verbalization of the response- 
reinforcement contingency is primarily a 
function of the sensitivity of the ques- 
tionnaire method employed. However, 
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the issue is further complicated when one 
considers at what point in the experi- 
mental procedure the inquiry is intro- 
duced. It will be recalled that only one S 
of 60 experimental Ss in Exp. V was 
judged to be a Verbalizer (specific ques- 
tionnaire). This suggests that imme- 
diate posttest questioning stimulates some 
Ss to a review of what took place during 
experimentation, with some Ss making 
astute ex post facto deductions. Yet, 96 
hr. later, even a postexperimental inquiry 
fails to sharpen the memory traces. 
Other than the possibility that learning 
without knowledge of the answer contin- 
gency occurred here, one might invoke 
the alternate explanation that learning 
and performance are two independent 
processes, learning requiring cognition, 
performance being automatic (Kimble, 
1961). 

It is important to recognize that the 
present paradigm is convenient and time- 
saving as very large groups of Ss can be 
run concurrently in one brief session, 
thus affording an opportunity to demon- 
strate conditioning in the mass. These 
studies firmly demonstrate how in the 
laboratory the temporary control of ver- 
bal (written) behavior can be achieved. 
If in a laboratory context, admittedly 
artificial, the semipermanent modification 
of verbal behavior can be attained in a 
10-min. training session, one can begin 
to understand why verbal habits, consist- 
ently reinforced in a particular direction 
throughout one's lifetime, can be so in- 
grained and impervious to change. 
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OPERANT CONDITIONING OF THE GSR WITH CONCOMITANT 
MEASUREMENT OF TWO SOMATIC VARIABLES* 
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AND 


An attempt was made to determine whether operant GSR condition- 
ing depends upon changes in somatic responses. The Ss were 21 pairs 
of college students who received either contingent or noncontingent re- 
inforcement during a reinforcement period. A 10-min. rest period was 
followed by 16 min. of reinforcement and then 10 min. of extinction. 
In addition to the GSR, respiration rates and EMGs were recorded. 
Ss receiving contingent reinforcement increased significantly in GSR 
rates during acquisition while a noncontingent group declined slightly. 
A significant Contingency X Minutes interaction was also found for 
GSR rates during this period. During reinforcement, there were no 
statistically significant differences between groups in respiration rate, 
forearm movements, or frequency of breathing irregularities. Sub- 
traction of all somatic associated GSRs resulted in no attenuation of 
the conditioning effect. During extinction the contingency effect 
reached significance only after subtraction of all movement associated 
GSRs. The results were interpreted as evidence that the GSR may 


be operantly conditioned in the absence of somatic mediation. 


Several recent experiments have in- 
dicated that reliable changes in the rate 
of emission of the unelicited galvanic 
skin response (GSR) can be produced 
by operant conditioning methods 
(Fowler & Kimmel, 1962; Kimmel & 
Kimmel, 1963; Shapiro, Crider, & 
Tursky, 1964). In these studies, Ss in 
a contingent-reinforcement group re- 
ceived reinforcing stimuli (dim light 
when in total darkness or tone signal- 
ing monetary reward) immediately 
after emission of a response, while Ss 
in a noncontingent-reinforcement group 
received the same number of stimuli at 
times when they were not responding. 
Noncontingent Ss tended to decline in 
response rate during reinforcement and 
increase during extinction, while re- 
sponse rates of contingent Ss were 
shown to rise, remain constant, or de- 


1Done under United States Public Health 
Service Grant MH 0-6060-3 and based in 
Dart upon the senior author’s MA thesis sub- 
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cline only slightly during reinforcement 
and to decline during extinction. 

A major question regarding such 
findings is whether the observed GSR 
changes are due to operant conditioning 
of autonomically mediated responses 
or are artifactual consequences of oper- 
ant conditioning of somatically medi- 
ated responses (eg. respiration or 
movement). Relative to this question, 
Shapiro et al. (1964) reported slight 
but nonsignificant differences between 
contingent and noncontingent groups in 
respiration rates during reinforcement. 
When GSRs associated with respira- 
tion changes were omitted from their 
data, however, the difference between 
contingent and noncontingent groups’ 
GSRs was still significant. These in- 
vestigators also found no correlation 
between GSR rate and gross body 
movements. Similarly, Rice (1964) 
reported an increase in GSRs accom- 
panied by forearm movements during 
the first 12 min. of reinforcement, but 
found a significant difference in rate 
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of emission of GSRs due to reinforce- 
ment between contingent and noncon- 
tingent groups with high initial operant 
levels when only GSRs unaccompanied 
by movements were reinforced. The 
latter effect was not found in Ss with 
low initial operant levels, 

The present experiment used an op- 
erant conditioning paradigm similar to 
that of Kimmel and Kimmel (1963) 
and, simultaneously, measured muscle 
activity from the arm containing the 
GSR electrodes and respiration rate. 
Rates of emission of unelicited GSRs 
of groups receiving contingent and non- 
contingent reinforcement were expected 
to diverge during conditioning and re- 
converge during extinction as in pre- 
vious studies, The main question was 
whether such GSR changes would oc- 
cur independently of forearm move- 
ments and respiration changes, 


METHOD 


Subjects—The Ss were 42 male students 
from introductory psychology classes at the 
University of Florida. They were alter- 
nately assigned to the contingent and non- 
contingent groups on the basis of order of 
appearance at the laboratory. 

Instructions —The Ss were asked to re- 
frain from moving any more than was 
necessary to remain comfortable and to stay 
alert during the experiment. 

Apparatus—The GSR recording system 
has been described in detail previously (Mor- 
row, 1966) as has the visual reinforce- 
ment (Kimmel & Kimmel, 1963). Electro- 
myograms (EMG) were picked up by means 
of bipolar silver electrodes placed on the 
dorsal surface of the right forearm approxi- 
mately 4 of the distance from S's wrist to 
the elbow and spaced about 3 cm. apart. The 
EMG was integrated over a 0.2 sec, time 
constant and recorded on a Grass Model 5 
Polygraph (5P3 Preamplifier). Respiration 
measures were obtained with a chest bellows 
connected to a strain gauge (Statham pres- 
sure transducer) and recorded on the Grass 
polygraph (5P1 Preamplifier). Paper speed 

the polygraph was 1 mm/sec. 

Design—The design of the present study 
was identical to that of Kimmel and Kimmel 
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(1963) with one major exception. Briefly, 
the independent variable was the contingency 
or noncontingency of reinforcement, The Ss 
in the contingent group received visual rein- 
forcements immediately after E observed a 
criterion response. Noncontingent Ss were 
matched with contingent Ss on the basis of 
number of reinforcements given per minute 
but received them at times when no response 
was occurring, The major design difference 
between this and the Kimmel and Kimmel 
(1963) study was that the reinforcement 
criterion used was 1% of the basal resist- 
ance; only responses equal to or greater than 
this criterion were reinforced and (see Re- 
sults) counted. This criterion was employed 
on the assumption that responses smaller 
than 1% of the basal resistance may not be 
mediated centrally and, thus, may not be 
susceptible to reinforcement effects (Greene, 
1964). In the earlier studies, very small 
responses were not capable of being displayed 
0n the recorder because of equipment differ- 
ences. 

Procedure.—Data were collected in an 
insulated air-conditioned (IAC) double- 
walled audiometric chamber. When he ar- 
rived at the laboratory, each S$ was prepared 
by cleaning the palm and back of his right 
hand with acetone. The GSR electrodes 
were attached and held in place with a large 
rubber band. The EMG electrodes were 
filled with conductive electrode paste (Grass 
Instruments Company) and held in place 
with two strips of adhesive tape. Paste was 
also rubbed onto the skin at the electrode 
Sites. After $ was seated in a cushioned 
chair with arm rests in the IAC chamber, 
the chest bellows was attached and the doors 
to the chamber closed. After about a 5-min. 
delay for adjustment of the equipment, a 10- 
min. rest period was begun during which 
time the GSR response criterion was deter- 
mined. Next, a 16-min. reinforcement period 
was run. Reinforcements were delivered 
manually by E, The reinforcement period 
was followed by 10 min. of extinction during 
which time no stimuli were delivered. Be- 
fore concluding the experiment, E instructed 
S via an intercom to move the fingers of his 
right hand and then to open and close the 
hand. This was done to ensure that EMG 
electrode placement was satisfactory and to 
Obtain a postextinction EMG response meas- 
ure. No systematic interview was conducted. 
At the end of the session, E simply asked the 
question, *What happened while you were in 
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the room?” to be sure that S had remained 
alert.? 


RESULTS 


GSR.—The basic dependent variable 
for assessing conditioning was the fre- 
quency of unelicited responses per min- 
ute. A response was counted only if 
its magnitude equaled or exceeded 1% 
of the S’s basal skin resistance at the 
time the response occurred. For Ss 
whose basal skin resistance changed 
over time, this meant that a changing 
magnitude criterion was used. A 3- 
sec. “grace” period was allowed follow- 
ing each light presentation, during 
which time no responses were counted. 

GSR frequencies were transformed to 
Vix +1 and expressed as percentages 
of transformed resting frequency levels. 
This was done separately for each S 
using the average transformed response 
frequency of the last 5 min. of rest. 
Figure 1 shows the relationship be- 
tween mean transformed GSR fre- 
quency and minutes of acquisition and 
extinction for the two groups of Ss, 
plotted in 2-min. blocks. As expected, 
the two curves diverged during acqui- 
sition and reconverged during extinc- 
tion. The effect of Contingency was 
statistically significant for the acquisi- 
tion data, F (1, 40) 28.58, MS. 
— 2439.95, as was the Contingency 
X Minutes interaction, F (7, 280) 
= 3.20, MS, = 322.41. These effects 
failed to reach significance in the ex- 
tinction data. 

EMG.—Frequency of muscle action 
potentials was obtained by counting 
criterion pen deflections per minute of 
time, transforming to Vx + 1, and ex- 
pressing the transformed values as 


2 One noncontingent S was dropped from 
the sample after admitting that he had fallen 
asleep and seen no lights. One contingent 
S was dropped after suggesting to E that 
moving his hands (contrary to instructions) 
had “caused the light to come on.” 
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Fic. 1. Average percentage of trans- 
formed GSR frequency of contingent and 
noncontingent groups during reinforcement 
and extinction, in 2-min. blocks. 


percentages of transformed frequencies 
during the last 5 min. of rest. A cri- 
terion movement was defined as any 
deflection which equaled or exceeded 
1096 of the average postextinction re- 
sponse magnitude (see Procedure). 
Use of this criterion resulted in scoring 
almost every visually detectable deflec- 
tion. With EMG sensitivity set as 
high as feasible, and with careful elec- 
trode placement, the probability of un- 
detected arm movements was negligible. 

Figure 2 presents the average trans- 
formed measures of forearm muscle ac- 
tivity expressed as percentages of 
resting levels for the two groups during 
acquisition and extinction, in 2-min. 
blocks. Again, the data were trans- 
formed for each S individually prior 
to averaging. Both curves declined 
slightly with time, although this effect 
was not significant. Analysis of vari- 
ance of these data revealed no signifi- 
cant differences between groups and no 
interaction, either during acquisition 
or extinction. 

Respiration.—Figure 3 shows the 
average respiration rates expressed as 
percentages of resting levels for the 
two groups, in 2-min. blocks. No 
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Fic. 2. Average percentage of trans- 
formed EMG frequency of contingent and 
noncontingent groups during reinforcement 
and extinction, in 2-min. blocks, 


transformation was used prior to ob- 
taining percentages for the respiration 
data since they do not suffer from the 
Skewness that is typical of the other 
dependent variables. 

There was a tendency for the con- 
tingent group to breathe at a slightly 
lower rate than the noncontingent 
group from the fourth through the 
eighth minutes of acquisition. How- 
ever, analysis of variance revealed no 


Contingent — e e 


1204 — Noncontingent o=o 


o4 


% Response Frequency 
(/00 Test /Rest) 


2 6 0 i4 2 6 10 
Minutes of Reinforcement Minutes of Extinction 


Fic. 3. Average Percentage of respiration 
frequency (rate) of Contingent and noncon- 
tingent groups during reinforcement and 
extinction, in 2-min, blocks, 
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Fic. 4. Average percentage of trans- 
formed respiration irregularity frequency of 
contingent and noncontingent groups during 
reinforcement and extinction, in 2-min. blocks. 


significant effects in these data during 
acquisition or extinction. 

Respiration irregularities.—Because 
of the slight difference, albeit insignifi- 
cant, between the two groups’ respira- 
tion rates during acquisition, a more 
sensitive analysis of the respiration 
data was performed, Figure 4 shows 
the average frequency of respiration 
irregularities for both groups, trans- 
formed to Vr+i and expressed as 
Percentages of resting levels. A 
Tespiration irregularity was defined as 
a deep breath whose amplitude ex- 
ceeded those of the 10 breaths before 
it by 50% or more. As Fig. 4 shows, 
there was no consistent tendency for 
the groups to differ in this measure 
during acquisition, nor did an analysis 
of variance of the acquisition data re- 
veal any significant effects, The inter- 
action of Contingency x Minutes dur- 
ing extinction was significant, F (7, 
280) — 3.18, MS, = 17349, although 
too complex to be interpreted. 

Corrected GSR analysis.—Although 
the EMG and respiration data leave 
little room for speculation regarding 
their possible influence upon the GSR, 
one additional evaluation of this possi- 
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bility was undertaken. All criterion 
GSRs which occurred concurrent with 
or within 5 sec. after a forearm move- 
ment or breathing irregularity were 
discarded from the data. This was 
done for all three periods during which 
observations were made (i.e., rest, re- 
inforcement, and extinction). Figure 
5 shows the rescored GSR frequency 
data, transformed and expressed rela- 
tivity in 2-min. blocks as before. As 
evidenced in this figure, the omission of 
“elicited” GSRs produced essentially 
no changes for either group. Analysis 
of variance showed that the contin- 
gency effect remained highly significant, 
F (1, 40) = 17.88, MS, = 203025, 
during acquisition, while the interaction 
was not significant. During extinction, 
the effect of Contingency was now sig- 
nificant, F (1, 40) =4.52, MS. 
= 2274.83, while the other effects were 
not. 


Discussion 


The purpose of this experiment was to 
examine forearm movements and respira- 
tion rates during operant GSR condi- 
tioning to determine whether these so- 
matically mediated responses might be 
responsible for apparent GSR condition- 
ing effects. The results provided little 
basis at all for assuming that the GSR 
changes depended upon concomitant 
changes in the other two measured re- 
sponses. Rather, the present results sug- 
gest that operant GSR conditioning oc- 
curred in the absence of identifiable medi- 
ation by somatic behavior. 

_ It goes without saying that these find- 
ings do not rule out completely the pos- 
sibility that other somatic mediators, 
which were not measured, may have 
played a role in the operant GSR condi- 
tioning observed. All we have done here 
is more or less eliminate what appeared, 
a Priori to be the two most obvious pos- 
sibilities, obvious because it is well-known 
that voluntary deep breaths or forearm 
movements frequently are sufficient to 
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Fic. 5. Average percentage of trans- 
formed corrected GSR frequency of contin- 
gent and noncontingent groups during rein- 
forcement and extinction, in 2-min. blocks. 


produce GSRs. While we advocate this 
“cautious” view, we cannot overempha- 
size that sufficiency is not in question in 
any of the present series of studies. 
What is in question is necessity, and ne- 
cessity has not here been supported. 

It is of some importance to note that 
the strength of GSR conditioning in the 
present study increased more or less 
continuously throughout training. This 
finding is at variance with earlier results 
(Kimmel & Kimmel, 1963) showing a 
peak in the acquisition function after the 
first 2 min. and a decline thereafter. 
There are two possible explanations of 
this difference. First, as Shapiro et al. 
(1965) have observed, the form of the 
acquisition function is determined in part 
by the prereinforcement operant rate. In 
their study high operant-rate Ss were 
shown to decline slowly in rate of re- 
sponding during conditioning while low 
operant-rate Ss increased progressively. 
The Ss in the present study were prob- 
ably more similar to their low operant 
group (1.0 responses per minute), since 
our mean resting-operant rate was 
1.16/min. Another possible explanation 
of better conditioning in the present 
study is that only responses greater than 
1% of the basal resistance were re- 
inforced. On the assumption that re- 
sponses failing to meet this criterion may 
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have been local rather than central in 
origin, it is not surprising that the pres- 
ent technique was more effective. It may 
also be possible that the use of this re- 
inforcement criterion placed increasingly 
greater demands upon S to make large 
responses, since there was some tendency 
for the basal resistance to rise gradually 
during the entire session and, thus, for 
the reinforcement criterion to rise also. 
Such gradual “shaping” of response mag- 
nitude, however, is in complete accord 
with the assumption that “true” operant 
conditioning was occurring, 

The very slight tendency for the con- 
tingent Ss to move more than the non- 
contingent Ss during the early minutes 
of acquisition parallels similar findings 
by Rice (1964). However, as indicated 
in Fig. 5, subtraction of all such move- 
ment-associated GSRs did not attenuate 
the effect as the somatic-mediation hy- 
pothesis would predict. If anything, 
the effect was greater during acquisition 
after rescoring and was significant dur- 
ing extinction. 

It is also quite clear that GSR condi- 
tioning was not dependent upon respira- 
tion changes in this study. The slight 
difference between groups in respiration 
rate shown in Fig. 3 occurred only 
through Min. 3 through 10, while the 
GSR conditioning effect was progressive 
through Min. 16. The respiration- 
irregularities data clearly revealed no 


H. B. VAN TWYVER AND H. D. KIMMEL 


group differences during acquisition and 
an essentially meaningless interaction in 
extinction (primarily cubic and quartic), 
We are, thus, strongly moved by these 
results toward acceptance of the possi- 
bility of operant conditioning of the un- 
elicited GSR, without somatic mediation, 
and toward rejection of the proposition 
that responses mediated by the autonomic 
nervous system cannot be conditioned 
instrumentally, 
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3 studies are reported investigating human preference for immediate 
vs. variably delayed shock with and without a warning signal. The 
information-preparation theoretical statements of Berlyne and Perkins 
were used for predicting that immediate shock would be preferred 
under 100% shock with no warning signal for the delayed condition 
and that this preference would be prevented or reversed when shock 
occurred only 2596 of the time and a warning signal preceded delayed 
shock. Study 1 (N —20) used an unstructured free-choice task with 
100% shock and no warning signal; Study 2 (N — 20) replicated the 
first study, but used a structured forced-choice task. The final study 
(N =40) presented shock on only 25% of the trials and a 4-sec. 
warning signal preceded delayed shock for an experimental but not 
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control group. The predictions were confirmed. 


Several studies involving human Ss 
are reported similar to the animal 
studies of Knapp, Kause, and Perkins, 
(1959) ; Perkins, Levis, and Seyman, 
(1963) ; and Lockard, (1963), dealing 
with the informational value of warn- 
ing signals preceding aversive stimuli. 

One purpose of the studies reported 
here was to investigate whether a pref- 
erence would develop in humans for 
immediate (predictable) shock vs. 
variably delayed (unpredictable) shock 
(VDS) or for shock preceded by a 
warning signal vs. shock without a 
warning signal. Investigations using 
infrahuman Ss have consistently ob- 
tained preferences for immediate shock 
or for a warning signal preceding 
shock. Only two relevant studies us- 
ing human Ss have been reported 
(Cook & Barnes, Jr., 1964; D’Amato 
& Gumenik, 1960). Both are inade- 
quate for our purposes. D’Amato and 
Gumenik confirmed the notion that un- 
predictable shock is more aversive than 
predictable shock but only 13 of their 
20 Ss preferred immediate to VDS. 
One possible explanation for such a 


weak effect is the masking task used by 
these investigators (predicting which 
of two lights would occur). In the 
study by Cook and Barnes (1964), Ss 
tended to choose a short rather than 
long delay of shock, but only six trials 
were given. Perhaps even more im- 
portant, however, fixed rather than 
varied delays were used and the long- 
est delay was only 8 sec. 

The three studies reported below 
overcome the noted limitations of the 
preceding human studies and reveal 
(Exp. I) the acquisition function of a 
preference over more trials and longer 
durations of variable delay. While the 
first two experiments use 100% im- 
mediate vs. 100% VDS, the final study 
used shock for both conditions only on 
some trials (25%). In addition, this 
latter study employed a warning signal 
preceding VDS for an experimental 
group but not for a control group. In 
the first two experiments a preference 
for the immediate condition was pre- 
dicted. However in the final study, 
the inclusion of a warning signal for 
the delayed condition led to the predic- 
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Fic. 1. Percentage of Ss choosing the im- 
mediate shock side per block of five trials, 


tion that a preference for immediate 
shock would not develop. 

The above predictions derive from 
the theoretical statements of Berlyne 
(1960) and Perkins (1955). Both 
theorists stress that Ss seek informa- 
tion. For Berlyne, information leads 
to reinforcement through uncertainty 
reduction, while for Perkins it allows 
for preparatory responses which mini- 
mize the aversiveness or maximizes the 
attractiveness of stimulus situations. 


EXPERIMENT I 


The first experiment is similar to 
that of D'Amato and Gumenik (1960), 
but with the masking task eliminated 
ie, no instructions were given Ss to 
predict which of two lights would ap- 
pear following their responses, 


Method 
"undergraduate Psychology courses Served as 


Ss. Half were males and half females. 
Apparatus and brocedure.—Two Tooms 


apparatus was a metal Panel with i 
swi tehes: Aow with two spring 
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then left the room and established detection 
and pain thresholds, The Ss were discarded 
if their pain threshold was less than 1.5 ma, 
The pain thresholds determined the intensity 
of shock each S received. The following 
instructions were given over the intercom 
system, 


Now you see on the panel before you two 
switches, Each time I give the command 
“select” I want you to depress one of these 
switches firmly and release it immediately, 
A brief shock will occur on each trial, 
The total number of trials is fixed and 
your response will in no way affect how 
many shocks you receive, Are their any 
questions ? 


Forty free-choice trials were run with an 
intertrial interval of 40 sec. and a shock 
duration of 1 sec. After 20 trials the condi- 
tions were reversed so that the switch pre- 
viously Producing immediate shock now pro- 
duced VDS (20 sec. mean delay) and vice 
versa. The response switch resulting in 
immediate shock was alternated for consecu- 
tive Ss. 


Results 


The data from only 20 Ss are in- 
cluded since 2 Ss were discarded for 
Tesponding to one switch on all trials, 
and 2 for failing to meet criterion shock 
levels, 

Mean percentage of Ss choosing im- 
mediate shock is plotted in blocks of 
five trials in Fig. l. The curve illus- 
trates the gradual acquisition of a pref- 
erence for immediate shock. After 
the first 20 trials the conditions were 
reversed so that the choice which ini- 
tially resulted in immediate shock now 
delivered VDS. 

Preferences were evaluated using a 
three-category chi-square analysis with 
the theoretical frequencies being N/3 
in each cell, For example, out of the 
40 trials 26 choices or more (p < .05) 
to the immediate side were required to 
place Ss in the immediate cell, The 
Same criterion was used for a delay 
Preference, All Ss failing to respond 
with fewer than 26 choices to either 
side were placed in a no-preference 
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TABLE 1 


NuMBER OF Ss PREFERRING IMMEDIATE OR 
VDS ow PRE- AND PostsHirT TRIALS 


Over 
Im- No 
Trials | mediate | Delayed | Prefer. | „Half 
Shocks | Shock? | “ence | tom. 
Preshift 
Obtained 7 3 10 12 
Theoretical] 6.66 | 6.66 | 6.66 | 10 
Postshift 
Obtained | 12 6 2 15 
Theoretical| 6.66 | 6.66 | 6.66 10 


ds Sombra of Ss significantly preferring immediate 
shock, 

b Number of Ss significantly preferring VDS. — 

* Number of Ss making 21 or more responses to im- 
mediate shock, 


category (Table 1). A x? analysis 
indicated preshift immediate preference 
(first 20 trials) was not significant but 
the x? for the 20 postshift trials was 
x? (2) = 7.511, p € .05. 

Results were also analyzed in terms 
of the number of Ss making 21 or more 
of the 40 trials to one side (Table 1). 
While on the first 20 trials more than 
half the Ss responded more frequently 
to the immediate side, the number was 
not significant. However, a significant 
number of Ss (15) made more than 
half their responses to the immediate 
side after reversal x? (1) = 4.5, $ 
« .05. 

These data agree with those of 
D'Amato and Gumenik (1960), but 
not with those of Cook and Barnes 
(1964). The latter found that the 
tendency to choose immediate shock 
appears in full strength initially but 
subsequently results in a U-shaped 
curve. While our Ss tended to choose 
the immediate condition, the function 
reported here is monotonically in- 
creasing. 

An interesting finding of the post- 
experimental interview concerns the 
idiosyncratic hypotheses of some Ss. 
Several Ss adopted a "gaming orien- 
tation" stating that the adoption of cer- 
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tain strategies minimized the intensity 
or reduced the number of shocks. Ob- 
viously such hypotheses reduced the 
sensitivity of the preference test. The 
following experiment attempted to 
overcome this problem. 


EXPERIMENT II 


In addition to retrospective verbal 
report, a structured forced-choice tech- 
nique was used in the second study and 
specific information about the nature of 
our research interests given each S. 


Method 


Twenty undergraduate students from the 
introductory psychology course served as Ss. 
Six were male and 14 female. 

Apparatus and procedure.—The same ap- 
paratus was used except for the probability 
lights. All other apparatus components and 
procedures were the same except for the 
following instructions. 


In this experiment we are interested in 
whether individuals prefer to know when 
shock is going to be given. We are going 
to give you a couple of practice trials so 
that you will have experience under both 
conditions. That is, for one response you 
will receive a shock immediately following 
pressing of the switch and for the other 
response the shock will be delayed. On 
the panel before you are two switches. 
Each time that I give you the command 
*press" I want you to press the designated 
switch firmly down and then release it 
immediately. When the left (right) switch 
is pressed the shock will always be im- 
mediate, while pressing the right (left) 
will result in delayed shock. 


Shock was presented on all 20 trials. 
Each S was instructed to press five times 
to one side and then five to the other, etc. 
The condition experienced first was counter- 
balanced over Ss. After 20 trials S' was told 
he would recive 10 more trials and asked 
what condition he preferred on these trials. 
However, these trials were never adminis- 
tered. After S stated his preference, he was 
given a postexperimental interview and 
released. All Ss were forced to select one 
of the two conditions. The following ques- 
tions were asked during the interview: 


Why did he prefer ——? How did he 
prepare? (This question was asked only 
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if S stated he prepared when asked 
the first question.) Did the shock seem 
as strong when you pressed one switch as 
when you pressed the other? If the 
answer was no, the Ss were asked to 
identify which shock was most intense. 


Each S was asked to rate the strength of 
his preference along a 5-point scale. 


Results and Discussion 


In answer to the question dealing 
with preferences, 19 Ss stated a pref- 
erence for immediate as opposed to 
VDS (p<.01). When asked why 
they preferred the immediate shock, 2 
Ss could not state a reason; 2 Ss 
stated that immediate shock was of 
lower intensity and less painful; 15 Ss 
verbalized in some fashion that the 
VDS made them more tense or 
anxious. Seven of these 15 Ss stated 
VDS was noticeably stronger, 

In summary we can conclude that 
Ss given immediate vs, VDS develop 
a marked preference for immediate 
shock. This finding is consistent 
with those obtained with infrahumans, 
However, it should be noted that the 
preference measured by verbal report 
In a structured situation is more 
marked, Apparently the use of a 
Structured task Tequiring a verbal re- 
port minimizes the adoption of inter- 
fering gaming Strategies, 


Experiment III 


BADIA, McBANE, SUTER, AND LEWIS 


which shock occurs. In contrast, un- 
certainty on the variably delayed side 
could be reduced by providing a warn- 
ing signal preceding shock. For ex- 
ample, if shock is scheduled to occur 
on a small percentage of trials, eg. 5 
out of 20 trials, little information re- 
Sarding the occurrence of shock is 
available for either the immediate or 
delayed condition. However, if a 
warning signal is presented before 
shock on those five VDS trials, precise 
information about shock onset would 
be available only on the delayed side. 
Therefore preparatory responses could 
be made most economically under the 
delay warning condition. In contrast, 
intermittent shock is unpredictable on 
the immediate side, and to insure that 
Preparatory responses occur on shock 
trials they have to occur on all trials. 
Thus, on the immediate side, inappro- 
priate preparation would occur on non- 
Shock trials. The prediction follows 
that the preference for immediate shock 
found in the second study should not be 
established under these latter condi- 
tions. Effects of the warning signal 
Can be further evaluated by using a 
control group given the identical condi- 
tions of intermittent shock but without 
the warning signal on the delayed side. 

preference for immediate shock 
would be predicted for this group, simi- 
lar to Ss of the second study. The 
final experiment deals with the condi- 
tions outlined above, 


Method 


Subjects.—Forty undergraduate students 
from the introductory psychology courses 
Served as Ss. Of these 16 were male and 
24 female. 

Apparatus and brocedure—The same ap- 
Paratus and procedure were used as in Exp. 
IL Instructions were the same except for 
the following passage given the experimental 
group. 


On the delayed shock side there will al- 
Ways be a light and tone before the shock 
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occurs. Shock will occur only on some 
trials and will occur an equal number of 
times for both immediate and delayed 
shock conditions. 


For the experimental group 20 Ss were 
given a total of 40 trials, half to the im- 
mediate and half to the VDS side but in the 
following order: 10 (1)—20 (D)—10 (1). 
The condition experienced first was counter- 
balanced over Ss. Since shock was admin- 
istered on only 25% of the trials, five 
unpredictable shocks occurred for each con- 
dition. The same pattern of scheduled 
shock was used for both conditions, On 
the VDS side a warning signal was pre- 
sented 4 sec. before shock and terminated 
with shock. At the end of 40 trials each S 
was asked what condition he preferred on 
the next 10 trials. However, these trials 
were never administered. Each S was 
given the same postexperimental interview of 
the second study. Identical treatment was 
given the 20 Ss in the control group except 
for the warning signal. 


Results and Discussion 


In line with the prediction, 18 of the 
20 Ss in the control condition stated a 
preference for immediate shock (p 
< .01), while only 10 Ss in the experi- 
mental condition developed a prefer- 
ence for the immediate condition. The 
remaining 10 Ss in the experimental 
group stated a preference for the de- 
layed shock side. It is worth noting 
that the mean rated strength of prefer- 
ence for the two groups is consistent 
with the results of both conditions. 
For Ss in the control condition prefer- 
ring immediate shock, the mean rated 
preference was 4.l, whereas for the 
experimental condition it was 34. 
Analysis of these differences indicated 
that the probability level did not attain 
a rigorous level of significance, ¢ (38) 
= 1.89, p < .10. Since preference rat- 
ings for one condition are relative to 
both conditions, the apparent weaker 
mean rating of the experimental group 
may have resulted from the informa- 
tion provided by the warning signal. 
It may be asked why a preference did 


not develop for the VDS since accurate 
information regarding shock onset oc- 
curred for this condition. A partial 
answer is provided by the postexperi- 
mental interview. Of the 10 Ss in the 
experimental group preferring immedi- 
ate shock, 8 of them indicated they felt 
tense and uncomfortable under the de- 
lay condition. However, 4 of these Ss 
also said they prepared on every trial 
even though shock occurred only on 
some trials. Apparently preparation, 
either physical or cognitive, requires 
little effort and since shock did occur 
immediately following the response, 
preparation could be timed well. The 
effect of this behavior would, of course, 
mitigate the value of the warning sig- 
nal. For the 10 Ss in the experimental 
group preferring the delayed warning 
condition, 8 indicated they wanted to 
know when shock was going to occur. 
Six of these Ss said they prepared for 
shock while 2 Ss “didn’t like to be sur- 
prised.” The remaining 2 Ss were 
unable to state why they preferred the 
warning. 

Alternative interpretations of the re- 
sults can be given, the most obvious 
being an anxiety approach. It could be 
argued that delayed shock in the first 
two experiments was more aversive 
than immediate shock because Ss had 
a longer duration of time to anticipate 
it. While this duration of anxiety in- 
terpretation is appealing and is sup- 
ported by the first two experiments, 
the last experiment fails to provide 
support. In the final study with shock 
occurring unpredictably 25% of the 
time, anticipation of delayed shock fol- 
lowing the response should have oc- 
cured equally for both the experimental 
and control groups. Only after a 
period of delay and just 4 sec. before 
its onset did shock become predictable 
for the experimental group. An 
anxiety interpretation would presum- 
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ably predict a preference for immediate 
shock for both groups since anticipa- 
tion over an interval of time is minimal 
under this condition. This did not 
occur, 
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ASSOCIATIVE INTRUSIONS IN SHORT-TERM RECALL * 


WAYNE A. WICKELGREN 
Massachusetts Institute of Technology 


30 MIT undergraduates listened to a list of 9 letters presented at the 
rate of 4 letters per second and then attempted to recall the letters in 
order. Some lists contained repeated letters, and some did not. The 
letters following repeated letters tended to be substituted for each 
other in recall, by comparison to the frequency of confusing letters in 
the same positions of lists without repeated letters. Such substitutions 
were called “associative intrusions,” and the associative-intrusion phe- 
nomenon was observed whether the repeated letters occurred at the be- 
ginning or the middle of the list, whether 1 or 2 items separated the 
repeated letters, and whether the items following the repeated items 
did or did not have a vowel phoneme in common. The results were 
interpreted as supporting an associative theory of short-term memory. 


Evidence for the importance of serial 
position-to-item associations in short- 
term serial learning has been obtained 
by Conrad (1959, 1960), who demon- 
strated that there is a significant ten- 
dency for an intrusion to be the item 
that occurred in the same position in a 
prior list. 

The most direct evidence for the 
existence of item-to-item associations in 
short-term serial learning was an inci- 
dental finding of a study comparing 
memory for lists with and without re- 
peated digits (Wickelgren, 1965b). In 
lists such as “8391957... ,” that 
have separated repeated digits, the 
digits following the repeated digits tend 
to be substituted for each other in re- 
call. In the example just cited, “1” 
would tend to be substituted for “5” in 
recall and vice versa. We shall refer 
to these types of errors as “associative 
intrusions.” 

Associative intrusions are most easily 
explained by assuming that associations 
are strengthened between the internal 
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Grant MH 08890-02, from the National 
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Service. Further aid was received from a 
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representatives of temporally adjacent 
items and that there is only one internal 
representative of an item regardless of 
how many times it is presented in a list. 
Competition of A-B and A-C associa- 
tions accounts for associative intru- 
sions, while items prior to A and serial 
position serve as differentiating cues 
that make correct responses much more 
frequent than associative intrusions. 

The primary purpose of the present 
study is to replicate the associative- 
intrusion phenomenon in the forward 
direction and determine if the effect is 
significant in the backward direction. 
Associative intrusions in the backward 
direction are confusions in recall of the 
items preceding repeated items. If Ss 
always recalled in a strict left-to-right 
order there would be no reason to ex- 
pect associative intrusions in the back- 
ward direction. However, there is no 
way to guarantee this order of recall 
by S, and in the present experiment no 
attempt was made to guarantee this 
order of writing the items on paper. 
Thus, it would not be surprising to 
find associative intrusions in the back- 
ward direction. 

The secondary purpose is to test the 
generality of the associative-intrusion 
phenomenon over a variety of experi- 
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mental conditions. The conditions 
are: (a) the position of the repeated 
items in the list (beginning or mid- 
dle) ; (b) the number of items separat- 
ing the repeated items (one or two) ; 
and (c) the phonemic similarity of the 
items following the repeated items 
(similar or not similar). 

According to the associative-memory 
explanation of the phenomenon, one 
would expect to find associative intru- 
sions under all of these conditions and, 
without a quantitative model, one could 
not predict that the effect would be 
greater under one of these conditions 
than under another. On the other 
hand, if the effect depends on strategies 
connected with the recognition of repe- 
tition, then the phenomenon should be 
Stronger for repeated items at the be- 
ginning, rather than the middle, of the 
list and stronger for repeated items 
Separated by one, rather than two, 
items. The reason for this is that the 


Substitution of one for the other. If 
this were so, then the explanation of 


Rather 
explanation in terms of the tendency to 
make a similar error, which combines 
With the presence of a nearby similar 
item, to greatly enhance the proba- 
bility of the nearby item intruding in 
recall. The previous study was not 
systematically confounded by this fac- 
tor. Also, digits show virtually no 
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systematic substitution errors in recall 
(Conrad, 1959; Wickelgren, 19652). 
Nevertheless, digits do have common 
phonemes, slight similarity effects 
could summate in a very nonlinear 
fashion with the presence of a nearby 
item, and the previous findings were 
based on a relatively small number of 
different lists. Thus, it would be in- 
teresting to compare the magnitude of 
the effects for phonemically different 
items following the repeated items. 
This is best achieved, not with digits, 
but with letters, where large phonemic 
similarity effects on intrusions are ob- 
tained (Conrad, 1964; Wickelgren, 
19652). 


METHOD 


Procedure. —The Ss listened to a list of 
nine letters presented at the rate of approxi- 
mately four letters per second and then had 
15 sec. in which to attempt to recall the let- 
ters in the correct order by filling in nine 
boxes on their answer sheet. The next trial 
followed immediately after the 15-sec. recall 
period. Each list contained two /ē/ letters 
(B, C, D, E, G, P, T, V, Z), two /&/ letters 
(F, L, M, N, S, X), two /ā/ letters (A, H, 
J, K), two /ū/ letters (U, Q, W), and one 
/i/ letter (I, Y). The experiment was re- 
corded on tape and lasted about 40 min. 

Design.—There were nine different types 
of lists, eight types with repeated items and 
one type with no repeated items. The cight 
types with repeated items were all the com- 
binations of two places for the repeated items 
(beginning or middle of the list), two 
degrees of Separation of the two occurrences 
of the repeated item (one or two intervening 
items), and two types of items following 

e two occurrences of the repeated item 
(phonemically similar or dissimilar items). 
Phonemically similar items were letters with 
à common vowel phoneme, /e/, /&/, /a/, or 
/ü/. When the repeated items were in the 
middle of the list and separated by one item, 
the repeated items were in Positions 4 and 6 
of the list. When the repeated items were 
in the middle of the list and separated by 
two items, the repeated items were in Posi- 
tions 3 and 6. When an item was repeated, 
it used up both Occurrences of its voweltype 
of letter, ie, if the repeated letter was "B," 
fhere was no other /ē/ letter in the list. 
The nine different types of lists, with a brief 
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TABLE 1 
REPETITION PATTERNS OF Lists USED IN EXPERIMENT 
Name Description Example 
AD All-different lists (no repeated letters) BLVKNUIQH 
(ijij)n | Repeated letters separated by one at inning of list phonemi- | DHDKQYMSW 
cally similar letters following repeated letters ` 
(ijik) | Repeated letters separated by one at beginning of list phonemi- SQSAVWJBI 
cally different letters following repeated letters 
(ijij)u | Repeated letters separated by one in middle of list phonemically | VUWASAFIT 
similar letters following repeated letters 
(ijik)u | Repeated letters separated by one in middle of list phonemically | HNWCSCAIU 
ifferent letters following repeated letters 
(üjlif)n | Repeated letters separated by two at beginning of list phonemi- | JSVJFGYQU 
cally similar letters following repeated letters 
(ijlik)p | Repeated letters separated by two at beginning of list phonemi- | KPWKLXQIT 
cally different letters following repeated letters 
(ijlij)y, | Repeated letters separated by two in middle of list phonemically WQFKGFHIP 
similar letters following repeated letters 
(ijlik) bate letters separated by two in middle of list phonemically NQTHUTSJY 
ifferent letters following repeated letters 
Note.—Sim = (jia + (iu + Gilifs + (Gi)w; Dit = (ika + (iiku + Gis + (libu; B Sim 
= (gi) + B; B Diff = (ijik)p + (lik)s; M Sim = Gi) + (ilij): M Diff = (ijik)u + Gilik)u; 
1 Sim is + (jiju; 1 Diff = (jik)s + (Hik)M; 2 Sim = (Glij)s + Gjli))w; 2 Diff = (ijlik)B + Gili). 


M 1 = (jij) + Giu; M 2 = (ili) + Gilil)w. 


name for each and an example of each, are 
shown in Table 1. Table 1 gives names for 
various combinations of these list types. 

The eight types of lists with repeated items 
occurred three times each in a block of 30 
trials (once with the repeated letter being an 
/ē/ letter, once being an /é/ letter, and once 
being an /à/ letter). Lists without re- 
peated items, all-different lists, occurred six 
times in a block of 30 lists. There were four 
blocks of 30 lists in the session, or 120 trials 
altogether. The 120 trials followed one 
after another with no extra time between 
blocks, so Ss had no knowledge of the block 
structure of the experiment. 

Subjects—The Ss were 30 Massachusetts 
Institute of Technology undergraduates tak- 
ing psychology courses. They participated 
in the experiment as part of their course 
requirements, 


RESULTS 


* Associative intrusions are only pre- 
dicted following or preceding correctly 


recalled repeated items. Thus, only 
cases where the repeated item was cor- 
rectly recalled were scored for intru- 
sions in the following or preceding po- 
sition. The frequency with which 
these intrusions were associative, as 
opposed to nonassociative (some other 
incorrect letter), is shown in Table 2. 
For example, slightly over 23% of the 
115 intrusions occurring after (cor- 
rectly recalled) repeated letters in 
(ijij)e were associative intrusions. 
That is, 23% of the intrusions were 1 
for j in the second position or j for j 
in the fourth position. 

To establish the associative intru- 
sion phenomenon, we need to deter- 
mine whether the frequency of associ- 
ative intrusions is greater than would 
be expected because of the presence of 
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TABLE 2 
RELATIVE FREQUENCY OF ASSOCIATIVE 


VS. NoNassociATIVE FORWARD AND 
BACKWARD INTRUSIONS 


oo 


Forward Instrusions 


Associa. Diferent 
List tive In: N | Control Differ- 
Type | trusions In- ence 
(%) trusions 
(%) 
(ie | 23+ | 115] 23 47} +0 
(ijik)p | 16 134 8 | 226] +8 
(iim | 24 Jams} s | 43 +19* 
GHb)w | 24 Jus) 13 | 426 +11* 
(ijlij)p | 33 118| 24 38| +9 
(ijlik)p | 23 1106] 6 | 193 Tie 
(ijlij)u | 16 91| 6 | 18|10 
QGHk)w| 9 99| 7 ]212| +2 
Sim 25 |435] 16 |146| +9% 
Diff 18  |454| 8 | 7471 +100 
BSim | 28 | 233] 54 85| +4 
B Dif | 19 [240] 7 |409 +12** 
M Sim | 21 200| 5 61 | +16* 
M Dif | 17 |214] 9 |338 +8 
1Sim | 24 |226| 14 90 | --10** 
lDif | 20 | 249] 10 | 352 +10* 
2Sim | 26  |209| 18 56] +8 
2 Dif | 16  |205| 7 | 395 +9** 
Backward Intrusions 
M1 LOE lat lata] 
M2 17 99 | 7 | 125 vo 
Pura « .05, 
5 $i 


the two letters in close proximity in the 
same list. To do this, it is necessary 
to Compare the frequency of associative 
intrusions to the frequency of confus- 
ing letters in the same Positions of lists 
containing no Tepeated letters 
different lists). The frequency of such 
intrusions in the control all-different 
lists (following correctly recalled prior 
or subsequent letters) is also shown in 
Table 2. The Control frequency for 
each type of list and each type of asso- 
ciative intrusion (forward or back- 
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ward) is obtained by scoring the all- 
different lists exactly as the list-type 
containing the repeated letters was 
scored. Also, in lists where the re- 
peated letters were followed by pho- 
nemically similar letters, only those 
control all-different lists were scored 
where the comparable intrusion errors 
were phonemically similar, When the 
associative intrusions were phonemi- 
cally different, only those control all- 
different lists were scored where the 
comparable intrusions were phonemi- 
cally different, 

In the backward direction no at- 
tempt was made to manipulate simi- 
larity Systematically, so cases where 
the prior items were similar or differ- 
ent are lumped together. “Similar” 
and “different” are always kept sepa- 
tate in the totals in Table 2 for intru- 
sions in the forward direction because 
intrusions tend to be similar to the 
Correct item, and the frequency with 
which similar items followed repeated 
items was greater than the frequency 
with which similar items appeared in 
the same positions of all-different lists. 
This was not true in the backward 
direction. 

The relative frequencies of associ- 
ative and control intrusions were de- 
termined for each S, and the signifi- 
cance level of the differences in relative 
frequency between associative and con- 
trol intrusions was determined by the 
Wilcoxon matched-pairs signed-ranks 
test. Since the direction of the differ- 
ence was definitely expected in ad- 
vance and all differences were in the 
expected (positive) direction, one- 
tailed tests were used. These signifi- 
cance levels are shown in Table 2 next 
to the difference in the average relative 
frequencies of associative and control 
intrusions for lists of each type. 

„In the forward direction, all eight 
list-types show a greater frequency of 
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associative intrusions than comparable 
confusions in all-different lists, and 
three of the differences are significant. 
In the backward direction, both list- 
types show a greater frequency of asso- 
ciative intrusions than comparable con- 
fusions in all-different lists, and one of 
these differences is significant. 

When the total number of intrusions 
is increased by combining conditions in 
the totals shown in Table 2, 7 out 
of 10 comparisons are significant and 
all are in the expected direction. The 
closest it is possible to come to an 
overall test of statistical significance 
is provided by the total Sim and the 
total Diff, both significant at beyond 
the .01 level. 

There do not appear to be any con- 
sistent differences in the magnitude of 
the associative intrusion phenomenon 
over the different conditions. Fur- 
thermore, there is no practice effect on 
the relative frequency of associative 
intrusions as measured by computing 
the difference between associative in- 
trusions and controls for similar and 
different conditions (Total Sim and 
Total Diff) separately for each of the 
four blocks in the experiment. For 
Blocks 1-4, respectively, the differ- 
ences in percent for Total Sim are: 
—1, +5, +5, +21 and for Total Diff 
are: +13, +16, +2, +9. 


Discussion 


The associative intrusion phenomenon 
has been replicated, and it appears to 
hold under all of the conditions of the 
Present experiment, These results are 
completely consistent with the theory that 
short-term associations are formed (more 
Properly, strengthened) between adjacent 
items in a serial list. This associative 
theory of short-term memory is illus- 
trated in Fig. 1, which shows the internal 
representatives of items as labeled circles 
and the internal representatives of tem- 
Poral order as arrows standing for the 
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Illustration of associative theory 
of memory for 83919 5 7. 


Fic. 1. 


direct forward associations between the 
internal representatives of items. Notice 
that although “9” is presented twice, 
there is only one internal representative 
of “9,” which is simply activated twice 
and which has, therefore, two direct for- 
ward associates. Remote associations 
and serial position-to-item associations 
have been omitted from Fig. 1. Thus, 
it is not correct to conclude that associ- 
ative intrusions should be as frequent as 
correct responses following correct recall 
of “9.” However, because of the repeti- 
tion of “9,” there should be a greater 
frequency of substituting “1” for “5” and 
“5” for “1” in this list than in a list 
where there was no repeated item. 

A nonassociative memory is one in 
which successive items are stored in an 
ordered set of cells (boxes, locations, 
registers, etc.) and an item can be stored 
in more than one cell. Such a non- 
associative memory is illustrated in Fig. 
2. The internal representative of an 
item is a pattern stored in a cell. The 
internal representative of temporal order 
is the fixed ordering of cells. It is par- 
ticularly important to note that, since any 
item can be stored in the “first” cell or 
the “second” cell, etc, a repeated item 
can be stored in more than one cell. In 
retrieval S reads out the contents of each 
cell, starting with the “first” cell and pro- 
ceeding in order to the end of the list. 
Errors are due to decay of the pattern in 
each cell and to errors in the order of 
storing in or reading out of cells. So 
long as the errors in storage or retrieval 


858 


EbEbEbEPELERD 


Fic. 2. Illustration of nonassociative theory 
of memory for 8391957. 


ordering of cells do not depend upon the 
contents of a cell, there is no reason to 
expect associative intrusions with such 
a monassociative short-term memory. 
Finding associative intrusions is rather 
strong evidence against such a non- 
associative theory of short-term memory. 

Nevertheless, it is logically possible for 
a nonassociative memory to produce asso- 
ciative intrusions, provided we make a 
number of additional assumptions, many 
of which have no plausibility on other 
grounds. For example, assume that 
read-out is nondestructive, that is, the 
pattern in a cell is not destroyed when 
its contents are scanned in recall, Sec- 
ond, assume that the act of writing the 
item on paper requires the scanning 
mechanism to lose the information as to 
which cell ("frst," "second," etc.) was 
last scanned. Third, assume that the in- 
formation as to the contents of the cell 
last scanned is not lost, but is available 
to the scanning mechanism as it tries to 
find the next cell to scan. What the sys- 
tem might do under these circumstances 
is to look for a cell with the same con- 
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tents as the one last scanned, then scan 
the "next" cell in the fixed order. 


Such 
a system will produce associative intru- 
sions, though additional ad hoc assump- 
tions are required in order to explain 
why the system does not make associ- 
ative intrusions as often as correct re- 
sponses following repeated items. 

All in all, the nonassociative explana- 
tions of associative intrusions are ex- 
tremely inelegant, while the associative 
explanation is simple and plausible. 
Thus, it seems likely that the prior item 
is an important cue in short-term mem- 
ory for serial lists and that there is only 
one internal representative of an item 
no matter how many times it is presented. 
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FUNCTIONS RELATING CHILDREN’S OBSERVING BEHAVIOR 


TO AMOUNT AND RECENCY OF STIMULUS 
FAMILIARIZATION ' 
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108 kindergarten children controlled the exposure time of a series of 
black and white figure drawings. Subsets of 18 stimuli were repeated 
in successive phases of the experiment so that the effects of amount of 
stimulus familiarity could be studied within phases. Following 5 fa- 
miliarization phases, all Ss received test phases at 0-day (5 min.), 
2-day, and 7-day intervals. Reciprocal observing time increased sig- 
nificantly (observing time decreased) as a function of increasing 
amount of familiarity in both the familiarization and test phases. Al- 
though the overall means for the test phases differed significantly, no 
interaction was found between time interval and amount of familiarity, 
indicating that the familiarity functions were essentially parallel for 


the time intervals studied. 


It has been demonstrated in a variety 
of situations that human Ss_ prefer 
novel as opposed to familiar stimuli 
(eg., Berlyne, 1958; Fantz, 1964). 
In two recent studies (Cantor & Cantor, 
1964a, 1964b), kindergarten children 
were familiarized with a set of pictures 
and were then allowed to control the 
duration of exposure of these “familiar” 
pictures as well as a set of “novel” pic- 
tures not seen during the familiariza- 
tion period. In both experiments, Ss 
spent significantly more time observing 
the novel as compared with the familiar 
pictures. The time interval between 
familiarization and testing was varied; 
and in both studies, the difference be- 


1This study was supported by Public 
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National Institute of Child Health and Hu- 
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National Institute of Mental Health. Thanks 
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Cooperation in providing Ss for this study: 
goes Community Schools, Sam Wiley, 

uperintendent; West Branch Community 
Schools, John Spiess, Superintendent. 


tween the familiar and novel conditions 
did not differ for a 0-day (5-min.) in- 
terval as compared with a 2-day inter- 
val. In the second study, it was also 
found that a large amount of familiari- 
zation produced a significantly greater 
difference between the novel and 
familiar conditions than did a small 
amount of familiarization. 

The purpose of the present experi- 
ment was to extend the earlier investi- 
gations of amount of familiarization 
and the time interval between familiari- 
zation and testing, using more levels of 
each variable. In order to increase the 
precision of the experiment, all vari- 
ables were studied on a within-Ss basis, 
except for the necessary counterbalanc- 
ing of stimuli. Kindergarten children 
were allowed to control the exposure 
time of a number of black and white 
figure drawings presented in five un- 
interrupted familiarization phases fol- 
lowed by three test phases at intervals 
of 0 days (5 min.), 2 days, and 7 days. 
The introduction of new stimuli in each 
phase resulted in an increasing number 
of levels of stimulus familiarity over 
the course of the experiment. The ob- 
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serving time scores were used to obtain 
familiarity functions for the various 
phases of the experiment. 


METHOD 


Apparatus—The apparatus used in this 
study was essentially the same one described 
in the two earlier studies (Cantor & Cantor, 
1964a, 1964b). The stimuli were photo- 
graphed on individual frames of 16-mm. film 
and were presented by means of a Dunning 
Animatic filmstrip projector. The S sat at 
a small table in front of a 7 X 9.5-in. pro- 
jection screen. By pushing on a rubber ball 
attached to a 4-in. metal rod, S advanced 
one frame of the film and in this way con- 
trolled the exposure time of pictures through- 
out the experiment. The duration of ex- 
posure of each picture was recorded to the 
nearest .1 sec. 

Design and stimuli—The stimuli were 18 
black and white drawings taken from the 
Welsh Figure Preference Test (Welsh 1959). 
These included primarily low-meaningful 
configurations and geometric patterns vary- 
ing in complexity? The pictures were 
divided into six sets of three pictures each. 
One stimulus set was introduced in the first 
familiarization phase and an additional set 
was added in each of the remaining four 
familiarization phases. Each stimulus set 
consequently represented an increasing 
amount of familiarity over successive phases 
in which it appeared; €g., the stimuli in the 
set introduced in Phase 1 were novel (F, 
amount of familiarity) in that phase, had 
been seen once before (Fi) in Phase 2, etc., 
and finally, had been seen four times before 
(F.) in Phase 5. Within a particular phase, 
each stimulus set represented a different 
amount of familiarity, The order of presen- 
tation of the stimuli within a phase was ran- 
domly determined within subsets consisting 
of one member of each familiarity level. 
The same sequence of familiarity values was 
used for all Ss, 


In the test phases, a sixth novel (Fo) set 
? A different sample of Welsh figures used 


inan earlier study (Cantor & Cantor, 1964b) 
was reproduced in that article. In the pres- 
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of three pictures was added to the five sets 
that represented five amounts of familiarity 
(Fi-Fs) at the end of the familiarization 
phases. One member of each stimulus set 
was assigned to one of three test phases 
that were presented at 0-day, 2-day, and 7- 
day intervals. Thus, six pictures, each 
representing a different amount of familiar- 
ity, were presented in each test phase. In 
an attempt to increase the number of obser- 
vations for each amount of familiarity in the 
test phases, the six stimuli were presented 
twice in successive random sequences within 
each test phase. 

Six groups were used in the familiarization 
phases for counterbalancing the stimulus sets 
in relation to amount of familiarity. Each 
of these six groups was further divided into 
three subgroups in order to counterbalance 
the assignment of stimuli to the three test 
phases. The use of these 18 counterbalancing 
subgroups insured that in each phase of the 
experiment, all 18 stimuli contributed to each 
of the familiarity levels, 

The pictures were photographed as black 
figures on a white background on individual 
frames of 16-mm. movie film with black 
frames occurring between pictures. Because 
the observing time scores tend to be un- 
usually long on the first stimulus presenta- 
tion, a novel Preliminary picture (used only 
for this purpose) was placed at the begin- 
ning of the familiarization sequence and be- 
for each test-phase sequence. The scores 
on these preliminary pictures were not in- 
cluded in the data. 

Procedure—The child was seated and 
taught to operate the response lever, using a 
series of four preliminary pictures that were 
provided only for this purpose. Before the 
familiarization phases began, S' was told, 
"You can look at each picture as long as 
you want to until I tell you that the pictures 
are all gone" The S then projected all 45 
Pictures making up the five familiarization 
phases, with no intervals occurring between 
successive phases. During the 5-min. inter- 
val between familiarization and the first test 
phase, S was asked to draw a picture of a 
man. The remaining test phases were given 
2 days and 7 days following familiarization. 

Subjects—The Ss were 108 5- and 6-yr.- 
old kindergarten children in the College 
Community and West Branch Community 
School systems near Iowa City, Iowa. Boys 
and girls were randomly assigned in equal 
numbers to each of the 18 counterbalancing 
subgroups. 
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RESULTS 


A reciprocal transformation was used 
because of the tendency for the observ- 
ing time scores to be positively skewed. 
The mean reciprocal observing time 
scores are plotted over amount of 
familiarity for each familiarization 
phase on the left side of Fig. 1. (The 
scores on the ordinate have been in- 
verted so that a high point on the 
graph represents a long observing 
time.) These means were computed 
for all counterbalancing subgroups 
combined, the basic score being each 
S’s mean of the reciprocal time scores 
for the three stimuli representing the 
particular amount of familiarity. It 
may be seen that observing time was a 
decreasing function of amount of famil- 
larity within each phase. Also, observ- 
ing time decreased over successive 
phases of familiarization. 

The test-phase means, plotted across 
amount of familiarity for each time in- 
terval, are shown on the right side of 
Fig. 1. In computing these means, the 
amounts of familiarity were combined 
in pairs, with the exception of the F, 
(novel) stimulus. The first six stimuli 
in each test phase represented one of 
the amounts of familiarity, F,-F;. The 
second six stimuli (repetitions of the 
first six) each represented one of the 
amounts, F,-F,. Thus, in the total set 
of 12 stimuli, there was one F, score, 
one F, score, and two scores for each 
of the other amounts, F,-F,. The 
test-phase means shown in Fig. 1 there- 
fore are based on different numbers of 
scores at the various points. The F, 
means are presented separately in order 
not to obscure the tendency found in 
both the familiarization and test phases 
for the familiarity functions to decrease 
sharply from the F, to the F, points. 

Reciprocal observing time scores 
were analyzed for each of Familiariza- 
tion Phases 2, 3, 4, and 5, using analy- 
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time (plotted with an inverted ordinate) for 
familiarization and test phases with counter- 
balancing groups combined (N = 108). 


ses of variance in which amount of 
familiarity was a within-Ss effect and 
counterbalancing a between-Ss effect. 
In each of the four analyses, the only 
significant effect was amount of famil- 
iarity, the obtained values for Phases A 
3, 4, and 5 being, respectively, F (1, 
102) 2627, p<.025; F (2, 204) 
= 21.29, p < .0001; F (3,306) = 12.78, 
p < .0001; F (4, 408) = 11.63, p< 
0001. Follow-up tests using pairs of 
familiarity means yielded significant F 
values (at the 5% level or higher) for 
the four F, vs. F, differences in Phases 
2, 3, 4, 5 and for the F, vs. F, differ- 
ence in Phase 3. 

The familiarization data were also 
analyzed separately at each of four 
familiarity levels, Fẹ F,, Fẹ» Fs, in 
order to test the change in reciprocal 


_ observing time over successive phases. 


In these analyses, familiarization phase 
was the within-Ss variable and stimu- 
lus counterbalancing the between-Ss 
variable. In each analysis, the only 
significant effect was familiarization 
phase, the obtained values for the 
familiarity amounts F, F,, Fa and F, 
being, respectively, F (4, 408) = 22.92, 
p < .0001; F (3, 306) = 1270, p< 
0001; F (2, 204) = 4.05, p < .025; 
F (1, 102) = 424, p < .05. 
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The test-phase data plotted in Fig. 1 
were analyzed using an analysis of 
variance in which the two hetween-Ss 
factors were familiarization-phase coun- 
terbalancing and test-phase counter- 
balancing, and the within-Ss factors 
were amount of familiarity (F,, EIS 
F,4, Fy) and test interval ( 0-day, 2- 
day, 7-day). A significant effect was 
obtained for amount of familiarity, F 
(3, 270) = 2527, p < .0001, the means 
for the F,, Fy, Band F,- amounts 
being .257, .290, 301, and .316, re- 
spectively. The only other significant 
effect was test interval, F (2, 180) 
= 14.51, p < .0001, the means for the 
0-day, 2-day, and 7-day intervals being 
320, .261, and 292, respectively, 
Tests for individual pairs of means 
were run for both of these variables, 
and the differences were significant at 
the 5% level or higher for all pairs of 
both familiarity amounts and test inter- 
vals. Thus, there was a significant 
decrease in observing time (increase in 
reciprocal observing time) as the 
amount of familiarity increased, and 
observing time was shortest for the 0- 
day interval and longest for the 2-day 
interval. Tt will be noted that there 
was no significant interaction between 
amount of familiarity and test interval, 
F (6, 540) = 1.72, $ > .10, so that the 
familiarity functions for the three inter- 


vals may be considered to be essentially 
parallel. 


Discusston 


The familiarization data are quite clear- 
cut in showing a decrease in observing 
time as a function of amount of familiar- 
ity. These results are consistent with 
earlier findings (Berlyne, 1958; Cantor 
& Cantor, 19643, 1964b) that showed 
longer observing times for novel as com- 
pared with familiarized stimuli, The 
familiarity functions appear to be nega- 
tively accelerated, the differences between 
the F, and F, amounts being relatively 
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large in each case. Thus, if reciprocal 
observing time is used as an index of in- 
terest value, the greatest loss in interest 
value occurs following the first exposure 
to the stimulus. 

The significant decrease in observing 
time over familiarization phases can prob- 
ably be attributed to a general boredom 
effect. Tt will be noted, however, that the 
familiarity functions were essentially 
parallel, indicating that their slope was 
not affected by a general increase in 
boredom. Further (nonstatistical) evi- 
dence consistent with the suggested effect 
of boredom is found in comparing the 
general levels of responding in the vari- 
ous test phases with those in the familiari- 
zation phases. The lowest observing 
times (highest reciprocal times) occurred 
in Phase 5, and the subsequent 5-min. 
test apparently dissipated a certain amount 
of boredom, since the observing time in 
the first test phase increased to a level 
comparable to that found in Familiariza- 
tion Phase 4. The high observing times 
following the 2-day interval suggest a 
large degree of recovery of the original 
interest value of the stimuli to approxi- 
mately the level shown in Familiarization 
Phase 2. The decrease in observing time 
from the 2-day to the 7-day test is some- 
what puzzling and perhaps reflects the 
children’s general loss of interest in the 
experiment. It should be noted in this 
connection that, in the test phases, delay 
interval is confounded with total amount 
of prior familiarization experience. The 
major interest, however, is in the slope 
of the familiarity functions for the vari- 
ous delay intervals. The finding of 
parallel functions strongly suggests that 
neither delay interval nor total amount of 
Prior familiarization influenced the slope 
of the functions. These results are con- 
sistent with those found for the 0-day and 
2-day intervals in the earlier studies in 
which delay interval was not confounded 
with amount of prior familiarization 
(Cantor & Cantor, 1964a, 1964b). 
Whether the slope of these familiarity 
functions is also stable over periods of 
weeks and months remains a question for 
further investigation. 
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CHANGES IN RESPONSE 
SHIFTS IN THE PIT 


LATENCY Fi OLLOWING 
CH OF A SIGNAL? 
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A set of 3 experiments, c S 
latencies to a tone following an adaptation series of 20 tones. The 


results indicate response time to increase toa 


of the test tone differed from the Pitch, or average pitch, of the pre- 


ceding series, 
series, 


following a variable-pitch than following a Constant-pitch series, 
iti i und to be minimal when the pitch 


Larger differences resulted in a 


Were greater 
In 


of the test signal coincided with the mean, as Compared to the mode 


of tones upon Tesponse latency (RL) 
to a test tone, 


groups received 
the same pitch 


tone equal to or higher than those of 
the adaptation 


the constant-pitch adaptation series was 
a variable-pitch series, In 
Exp. III, RLs were compared when 
test pitches equaled, Tespectively, the 
mean, modal, the midrange pitch of the 
adaptation Series, 

These three experiments are similar 
to a series recently reported by Hard- 
esty and Bevan (1965) in Which the 
independent variable was duration of 
the presentation interval, 
"m independent &roups received 
the adaptation Signals at intervals of 
i 20, or 40 Sec. and the test 
* The experimental work for this report 


The manuscript 
B. was in residence 


tone after either 5, 10, 20, 40, or 80 sec. 
When the test interval was equal to the 
adaptation interval, ARI, was effec- 
tively zero but increased as the test in- 
terval was either lengthened or short- 
ened relative to the adaptation interval. 
In their Exp. IT, variable-interval 
Schedules were substituted for the con- 
stant-interval schedules of Exp. I with 
means at 10, 20, or 40 sec. The test 
intervals were the same. Response 
latencies were shortest when the test 
interval coincided with the series means 
and increased as the test intervals dif- 
fered from these values. Indeed, the 
functions describing the relationship 
between ARL and test interval duration 
for Exp, IT did not differ significantly 
from their Companion curves of Exp, I. 
In Exp. IIT three groups received an 
adaptation Series constructed so as to 
have its mean, mode, and midrange at 
different values. Test intervals were 
Set at each of these magnitudes. Re- 
Sponse latencies were found to be short- 
est when the test interval equaled the 
mean. They were significantly longer, 
and did not differ for the mode and 
midrange durations, 

Hardesty and Bevan (1965) inter- 
Preted their data to mean that a mecha- 
nism similar to the adaptation level in 
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RESPONSE LATENCY TO SHIFT IN PITCH 


sensory judgment operated in deter- 
mining the duration of the latencies on 
the test trial. The Ss were inferred to 
have pooled successive durations during 
the adaptation series, deriving a trial- 
by-trial average, which became the best 
estimate of the interval on each succeed- 
ing trial. As the test interval deviated 
from this expectation, RLs were in- 
creased. In line with adaptation-level 
theory, the mean was identified as the 
best approximation of S’s bench mark 
for response. 

The present experiments were car- 
ried out to determine if variation in a 
substantive property of the signal— 
here pitch was arbitrarily selected as 
the independent variable—would exert 
the same influence upon test latency as 
changes in the temporal pattern of the 
stimulus program. 


EXPERIMENT I 
Method 


The design employed 12 independent 
groups: A control group in which the test 
signals had the same frequency as that of the 
invariant adaptation series and 11 experi- 
mental groups, each with a test signal higher 
than the series pitches. Test signals dif- 
fered in increments of 100 mels. 

Subjects.—Two-hundred and sixteen male 
undergraduates, divided randomly into groups 
of 18 Ss each, served as Ss. 

Apparatus and procedure.—The stimulus 
program for each group was recorded on 
high-fidelity tape and played for S through 
. high-fidelity earphones fitted into Willson 
sound barrier earmuffs. The tones were 
produced by a Hewlett-Packard audio-oscil- 
lator, with rise and decay times controlled 
by a Grason-Stadler electronic switch, Du- 
rations were timed by means of Hunter 
decade interval timers. 

Each S received an uninterrupted schedule 
of 21 4 sec. tones at intervals of 10 sec. The 
first 20 (the adaptation series) had a fre- 
quency of 295 cps and an intensity of 60 db. 
(SPL). The final (test) tone for the 
several groups was, respectively: 295, 400, 
500, 620, 800, 910, 1,000, 1,325, 1,425, 1,500, 
1,750, and 1,875 cps. Its intensity was 
matched subjectively to the loudness of the 
series tones. 
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A RESPONSE LATENCY IN MSEC. 
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PITCH OF THE TEST SIGNAL IN 100 MEL STEPS 

ABOVE THE PITCH OF THE ADAPTATION SERIES 
Fic. 1, Decrement in response latency as 
a function of the deviation in pitch of a test 
tone above the pitch of tones in a fixed-fre- 
quency adaptation series. (The control 
group, receiving the same frequency on both 
series and test trials, is plotted at 0 on the 
abscissa. The horizontal line at 0 represents 
the mean response latency of the series.) 


The Ss were tested individually. Each S 
was seated before a telegraph key and in- 
structed to press it each time a tone was 
heard. A Standard electric timer, activated 
with the occurrence of each signal, was 
stopped by S's response. Response latencies 
were read to 1/100 sec. and recorded manually 
by E. The first five trials were regarded by 
E as practice trials and not evaluated as data. 
A mean latency was computed for the re- 
maining 15 trials. The difference between 
this score and the latency on the test trial 
was then obtained and is the performance 
measure (ARL) presented in this and the 
two succeeding experiments. 


Results 


Figure 1 presents the results of Exp. 
I. The control group displayed a de- 
crease in latency, reflecting the in- 
fluence of practice or habituation. In 
contrast, ARL for the experimental 
groups is seen to have increased as the 
pitch of the test tone deviated from that 
of the series. This increase resembles 
that observed in the Hardesty and 
Bevan (1965) results. However, a 
maximum is reached at 300 mels and 
increases in pitch beyond this point 
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A RESPONSE LATENCY 
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ABOVE THE MEAN PITCHOF THE ADAPTATION SERIES 


Fic. 2. Decrement in response latency as 
a function of the deviation in pitch of a test 
tone above the average pitch in mels of a 
variable-frequency adaptation series. 


are accompanied by a change in ARL 
back toward the response level of the 
adaptation series, the envelope closing 
at 800 mels. This return to base line 
did not occur in the Hardesty-Bevan 
data. An analysis of variance indi- 
cated a significant difference among the 
several groups, F (11, 204) = 4.99, p 
< .01, and a trend test revealed signifi- 
cant linear, F, (1, 204) = 5.39, p< 
025, quadratic, Fa (1, 204) = 18.37, p 
<.01, and cubic F. (1, 204) = 27.45, 
b < 01 components, 


Experiment II 
Method 


Subjects —Two-hundred forty male under- 
graduates were random. assigned to 
groups of 20 Ss each, : id g 

Procedure —The variable-pitch adaptation 
series, replacing the constant interval series 
of Exp. I, had the following composition : 
two 200-cps, four 235-cps, eight 295-cps, four 
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330-cps, and two 385-cps tones. Pitches 
ranged 100 mels above and below 295 cps 
in 50-mel steps. Order of presentation was 
random. Test tones were those of Exp. I. 


Results 


Figure 2 resembles Fig. 1 in shape. 
A practice effect again is seen in the 
control group. The curvilinear rela- 
tionship between ARL and pitch dis- 
crepancy again obtains in the data. 
However, there are three interesting 
differences between the results of the 
two experiments. The apex of Fig. 2 
is approximately 700 mels above base 
line in contrast to the 300 mels of Fig. 
1. The envelope in Fig. 2 closes some- 
where near 1,100 mels in contrast to 
near 800 in Fig. 1. Finally, the range 
of differences in ARL in Fig. 2 clearly 
exceeds that of Fig. 1. While the 
apex of Fig. 1 represents a decrement 
of 35 msec., that of Fig. 2 is 62 msec.— 
almost twice the smaller value. An 
analysis of variance indicated signifi- 
cant differences in ARL among groups, 
F (11, 228) — 5.37, p<.01, and a 
trend test, significant linear, F, (1, 
228) = 21.28, p < 01, and quadratic 
Fa (1, 228) = 34.93, p < .01 compo- 
nents in the curve, 


Expertment III 


The results of the first experiment 
suggest that ARL reflects the operation 
of an internal standard that guides Ss 
expectation on the test trial. Those of 
the second suggest that this standard 
is derived through the pooling of suc- 
cessive signals in terms of the criterion 
Property—in this case, pitch. The 
purpose of Exp. III was to determine 
the nature of the internal referent. 
Adaptation-level theory, based on the 
Principle of averaging, identifies the 
mean as the best estimate of the refer- 
ence value. Other values represent 
plausible alternatives, A simple ex- 


RESPONSE LATENCY TO SHIFT IN PITCH 


pectancy theory, based on the perceived 
frequency structure of the sequence 
of stimulus events, suggests the most 
frequently occurring value—the mode 
—as the standard. The midrange, as 
the value halfway between the extremes 
and a way of minimizing error, is an- 
other reasonable choice. The strategy 
of Exp. III involved employing these 
three measures of central tendency as 
test values with the expectation that 
ARL would be smallest for the one 
which most closely approximated the 
internal referent. 


Method 


Subjects.—Sixty male undergraduates were 
assigned at random to three groups of 20 Ss 
each. 

Procedure.—The adaptation series for 
these groups consisted of ten 200-cps, four 
235-cps, two 295-cps, two 330-cps, and two 
385-cps tones, presented randomly. The 
modal, mean, and midrange test tones were, 
respectively, 300, 355, 400 mels. 


Results 


Inspection of Fig. 3 indicates the 
shortest response latency to be asso- 
ciated with the mean pitch of the adap- 
tation series, with those for the modal 
and midrange values being somewhat 
longer. A simple analysis of variance 
indicated a significant difference among 
groups, F (2, 59) = 4.00, p < .025. 
A series of ¢ tests revealed ARL for 
the test tone at the mean to be reliably 
Shorter than that for the mode but 
not for the midrange—mean vs. mode: 
t (38) = 276, p < .01; mean vs. mid- 
range: t= 1.28, p «.20; mode vs. 
midrange: t= 2.14, p< .05. In the 
Hardesty and Bevan (1965) data, 
ARL for the mean groups was signifi- 
cantly shorter than for both the mode 
and midrange groups, which did not 
differ from each other. Thus, in the 
present experiment, the expectation 
that the mean was the best estimate 
of the subjective standard is not un- 
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A RESPONSE LATENCY IN MSEC. 
o 


300 355 400 
(MODE) (MEAN) (MIDRANGE) 
TEST PITCH IN MELS 


Fic. 3. Response decrement on the test 
trial when the test interval was equivalent to 
the mode, mean, or midrange of the adapta- 
tion series. 


equivocally confirmed. Why ARL for 
the mean group was not reliably 
shorter than that for the midrange 
group at present is unclear. While 
the midrange pitch lies slightly closer 
to the series mean value on the pitch 
scale than does the modal pitch, this 
greater proximity seems not sufficient 
reason for the present results. Mean- 
while, it is impressive to note that in 
this stimulus program, as was also the 
case in the Hardesty and Bevan experi- 
ment, the modal value was presented 
on half the trials, while the midrange 
occurred in only 10% of the trials, and 
the mean not at all. The fact that the 
mean is associated with the most effi- 
cient response points to the power of 
the pooling model. This model recog- 
nizes that the organism can use as a 
behavioral standard a derived value 
that need not be part of its immediate 
past experience. 


Discussion 


Two aspects of the present data are of 
special interest: the shape of the functions 
in Fig. 1 and 2 and the difference in am- 
plitude between these two curves. Both 
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appear to bear upon the question of what 
happens when S is confronted by signals 
that reside outside the population relevant 
for pooling. A suggestion by Swets 
(1963) that irrevelevant signals may be 
reclassified as relevant by a qualitative 
shift in response strategies is germane 
at this point. The S, when presented 
test stimuli widely discrepant from his 
series norm or adaptation level, may 
have changed the definition of the ex- 
perimental task from one of identifying 
pitches to one of detecting tones, in which 
event Fig. 1 and 2 would be expected 
to apex and then shift more or less 
abruptly back to the level of the control 
condition. If average pitch is the best 
estimate of what the test pitch will be, 
discrepancy between adaptation and test 
signals will reduce S's confidence in this 
Tesponse strategy. If response latency 
varies in curvilinear fashion with confi- 
dence level, a test signal at the series 
norm will yield a minimal ARL. As 
they differ in intermediate degree, confu- 
sion should be present and ARL will in- 
crease. When the threshold of relevance 
is exceeded, the task strategy will change 
to one of responding to tones and ARL 
will drop to approximately zero again, 
The return to base line, especially in 
Fig. 1, is seen to be gradual, a condition 
not predicted from the shift-in-strategies 
hypothesis. This, however, may be in- 
terpreted as an artifact of grouped data. 
One may expect reasonably that terminal 
thresholds of relevance for individual Ss 
will vary about the group estimates and 
thus that, as the difference between the 
test stimulus and these estimates in- 
creases, the number of Ss per group who 
Show RLs reliably greater than their 
individual series norms will first increase 
and then decrease, As this frequency in- 
creases, the average value of ARL grad- 
ually will increase; as it decreases, mean 
ARL will gradually descend. Data sug- 
gest this interpretation, The number of 
Ss in each group whose RL on the test 
trial exceeded by more than two SDs their 
series mean gradually increased, then de- 
creased: in Exp. I from 1 for the control 
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group to 11 for the +300-mel group to 
zero for the +1,000-mel group; in Exp. II 
from 0 for the control to 12 for the +900- 
mel group to 2 for the +1,100-mel group. 
The shift-in-strategies explanation of 
Fig. 1 and 2 is reminiscent of Cart- 
wright’s (1941) theoretical discussion 
linking decision time to differentiation of 
the phenomenal field. According to this 
view, uncertainty in the categorization 
of stimuli may be characterized as a con- 
flict of field forces, with conflict being 
assumed to result in lengthened response 
latency. Discriminable categories of 
stimuli are identified as differentiated 
regions of the field. As stimuli vary 
from the standards of class membership, 
they are conceived of as moving from the 
center toward the boundary of these dif- 
ferentiated regions. Conflict—and thus 
response time—is predicted to be maximal 
for stimuli that are located at the bound- 
ary. Further decreases in similarity 
place stimuli outside the criterion region 
and lead to a decrease in response time. 
That the amplitudes of ARL in Fig. 2 
markedly exceed those of Fig. 1 is a 
matter of some curiosity. One may have 
expected Ss exposed to the variable-pitch 
Series, with greater variability of re- 
sponse, to have displayed greater toler- 
ance of ambiguity in the face of the test 
signal and therefore smaller ARLs. In- 
Stead, test-signal discrepancy on a back- 
ground of varied experience (series sig- 
nal range: 200 mels) prompted greater 
confusion than when a change in pitch 


was abruptly introduced with the test 
trial. 
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MAGNITUDE ESTIMATION OF THE TASTE OF SODIUM 


CHLORIDE AFTER ADAPTATION TO 
SODIUM CHLORIDE* 


DONALD H. McBURNEY ? 


Brown University 


Following adaptation to 1 of 4 different concentrations of NaCl, Ss 
rated the subjective intensity, without regard to quality, of NaCl solu- 
tions ranging in concentration from HOH to 1.0 M. The minimum 
subjective intensity occurred at the adapting concentration (AC). 
The slope of the psychophysical function was steeper close to the AC, 
and this effect was greater with stronger ACs. Below the AC the 
test solutions had a sour-bitter taste which increased as concentration 
decreased, reaching a maximum at 0 concentration, ie. it was great- 
est for distilled HOH. With correction for adaptation, the results fit 


a power function. 


Gustatory adaptation has been recog- 
nized and studied for over 100 yr., but 
investigators seem to have overlooked 
the adapting influences of the normal 
buccal environment. The saliva which 
constantly bathes the tongue provides 
a number of gustatory stimuli to which 
adaptation must occur. For example, 
the ambient concentration of one of 
these salivary constituents, sodium, af- 
fects the detection threshold for its 
related gustatory stimulus, sodium 
chloride. Specifically, adaptation of 
the tongue to water produces a very 
low NaCl threshold. Low concentra- 
tions of adapting NaCl raise the detec- 
tion threshold above this minimum, to 
a value just greater than the adapting 
concentration (AC). A plot of the 
threshold obtained after the tongue has 
rested in the mouth against the salivary 
sodium level shows that the normal 
_ 1 This report is based on a thesis submitted 
in partial fulfillment of the requirements 
for the PhD degree, Brown University. I 
thank Carl Pfaffmann for generous guidance 
throughout the course of this work. Trygg 
Engen gave much valuable advice. These 
results were reported at the 1963 Meeting of 
the Psychonomic Society. This research was 
supported by a United States Public Health 


Service Predoctoral Fellowship. 
2 Now at the University of Tennessee. 


threshold can be included on the func- 
tion relating threshold to AC (McBur- 
ney & Pfaffmann, 1963). 

The detection, or absolute threshold, 
however, is only a single point on a 
psychophysical continuum, the point 
at which the system reaches its lower 
limit. The present experiments pre- 
sent data on changes in perceived inten- 
sity of NaCl solutions with adaptation 
to differing concentrations of NaCl. 


EXPERIMENT I 
Method 


Apparatus—All solutions were kept ina 
water bath at 34? C. controlled by a Jumo 
thermoregulator with a rated sensitivity 
+ 001° C. A Teel submersible pump kept 
the bath well stirred. The Ss were not able 
to detect any variation in temperature be- 
tween solutions, even though one of the Ss 
was highly trained in temperature discrimi- 
nation. A flow system presented both the 
adapting solutions and test stimuli by gravity 
from within the bath, except for the standard 
which was presented by pipette. A large 
polyethylene bottle containing the adapting 
solution communicated with one input of a 
four-way stopcock (See Fig. 1). This modi- 
fied three-way Teflon stopcock could divert 
the stimulus solution to flush any residue of 
the preceding stimulus from the system up to 
the stopcock. The exit from the stopcock 
formed the final common tube for both the 
stimuli and the adapting solution, hence the 
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ADAPTATION FLUSH STIMULUS 


d 


AS - Adapting Solution 


S -Stimulus 
FCT- Final Common Tube 
F -Flush 


Fic. 1. Diagram of the four-way stopcock. 
(The outlet indicated by “F” was drilled in 
the Teflon plug.) 


system could be completely flushed of any 
residue of the preceding stimulus solution 
except for the few millimeters of the stop- 
cock itself. The stimulus solution followed 
the adapting solution in the final common 
tube without interruption. An interface 
existed between stimulus and adapting solu- 
tions in this tube. 

Subjects.—Ten Ss were used in Exp. I. 
To minimize problems in following instruc- 
tions all Ss were familiar with psychophysi- 
cal methodology. Only two Ss were aware 
of the purpose of the study. 

Solutions.—All solutions were made of 
distilled water, reagent grade sodium chloride 
or commercial sucrose. The standard was 
4.2% (012 M) Sucrose, estimated at 5 gusts 
(Beebe-Center, 1949; Lewis, 1948). Su- 
crose is not a salivary constituent and hence 
should not be subject to varying adaptation 
in the experiment. Four adapting solutions 
were used: HOH, 0.001 M, 0.01 M, and 0.1 
M NaCl. Eight stimuli were used: HOH 
and 0.001 M to 1.0 M NaCl in 4 log steps. 

Presentation of stimuli—To avoid cross 
adaptation, the standard was presented when 
the tongue was adapted to saliva and before 
any of the other experimental solutions were 
given. The S extended his tongue comfort- 
ably between the closed lips so that nothing 
would flow into or out of the mouth. The 
Standard Was presented by pipette and then 
was rinsed off by the adapting solution, The 

retracted his tongue and rested for ap- 
Hbri tely 20 sec. This procedure was re- 
peated as many times as $ wished (usuall 
2-3). The standard was not Wow seii 
during the session. The S 
tongue for the remainder of the trials. 
solutions irrigated the entire exposed dorsal 
tongue; this provided control Over adapta- 
tion, area of stimulation, and Prevented saliva 
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from coming into contact with the tongue, 
and thus acting as an adapting solution. 
Preliminary adaptation proceeded for 2 min, 
The adapting solution was then interrupted; 
the next stimulus flowed through the alter- 
nate exit for 1 sec. to flush any remaining 
droplets of the previous stimulus and adapta- 
tion was resumed. Then the stimulus was 
presented immediately following the adapting 
solution and flowed for approximately 3 sec, 
When S had responded, adaptation was 
resumed for at least 20 sec. before the next 
stimulus, The S wrote his response on a 
recording sheet that masked the previous 
responses, 

Psychophysical method.—The intensity of 
the stimuli so presented were judged by the 
method of direct magnitude estimation, The 
instructions were typical of magnitude-esti- 
mation experiments with the exception that 
S was specifically instructed to ignore qual- 
ity and judge intensity only. 

Procedure.—Each of the 10 Ss judged all 
eight stimuli once under each of the four 
ACs. Two ACs were presented in each of 
two sessions. The HOH and 0.1-M ACs 
or the 0.001-M and 0.01-M ACs were pre- 
sented together in the same session because 
of physical restrictions. Within these limits, 
order of AC was varied among Ss. The 
stimuli were presented in a different random 
order for each S and AC. 


Results 

Arithmetic means of the magnitude 
estimates are plotted logarithmically in 
Fig. 2. Each curve corresponds to 
one AC. The geometric mean is 
ordinarily preferred, but would require 
the arbitrary assignment of some num- 
ber to zero ratings. This was tried, 
but the arithmetic mean gave more 
regular results ; at the higher intensities 
values of the two different means are 
almost identical. 
Discussion 

The curves reach a minimum at the 
AC as would be expected (Abrahams, 
Krakauer, & Dallenbach, 1937 ; Hahn, 
1934; McBurney & Pfaffmann, 1963), 
but they do not reach zero. It is possible 
that Ss responded to some other quality 
than salty, perhaps tactile, but Bartoshuk * 


3L. M. Bartoshuk, personal communica- 
tion, 1965. 


T—— 
a m 
-O ———— SR 


TASTE PERCEPTION AFTER ADAPTATION 


Log Magnitude Estimate 
o 


H,0 0.00! 0.003 0.0! 


o 


^ 
Molar Conc. NaCl! 


871 


003 O.I 03 1.0 
n 


Fic. 2. Estimated magnitude of sodium chloride solutions after sodium chloride adapta- 


tion, (Each curve is for one AC. 
symbol on the abscissa.) 


has found that if there is no interruption 
in the adapting solution, almost all stimuli 
the same as the adapting solution are 
called zero. The 1-sec. pause in the rinse 
may allow the adaptation state of the 
tongue to change enough to produce a 
taste. 

The change in slope of the psychophysi- 
cal function with adaptation is similar to 
what occurs in vision (Stevens & Stevens, 
1963). In both these and Stevens’ data 
the curves not only have a different inter- 
cept on the concentration axis, the slope 
alters as well. However, with taste the 
curves meet at the higher intensities, 
whereas in vision they do not meet in 
the usual range of values. This means 
that the effect of gustatory adaptation is 
primarily on the intensity of solutions 
close to the AC. If the size of the jnd 
for taste is smaller around the AC, then 
adaptation will influence the location of 
the point of maximum sensitivity of the 
system, as is true with vision and audi- 
tion (Hedgecock, 1955; Keidel, Keidel, & 
Wigand, 1961). 

The rise in subjective intensity below 
the AC is another demonstration that 
"sub-threshold" salt solutions have a 
definite taste (Bartoshuk, McBurney, & 


The AC is indicated by the location of the appropriate 


Pfaffmann, 1964) and intensity. The re- 
sponse to the solutions below the AC (sub- 
adapting) is largest for zero concentra- 
tion (ie, HOH) and decreases mono- 
tonically up to the AC and is stronger 
with higher ACs. 

The only curve that can be fitted with a 
power function over the whole supra- 
adapting stimulus range is the 0.001-M 
AC curve; the others are power functions 
only at the higher intensities. | Note 
especially the 0.01-M AC curve, which 
should be a power function according to 
the findings of Stevens (1961a, 1961b) 
and Ekman (1961), since it most closely 
approximates the normal adaptation state 
of the tongue to saliva (0.005-0.01 Eq/L 
Na). The part of the curve that is a 
power function yields a slope of .41 com- 
puted from the arithmetic means (fitted 
by eye) rather than the 1.3 reported by 
Stevens or the 1.59 found by Ekman. 

Two reasons can be suggested for the 
difference in slope. First, the slope is 
known to be sensitive to the range of 
stimuli employed. Stevens does not 
report such details of his experiment; 
Ekman's highest concentration was 72%, 
slightly stronger than 0.1M. The second 
reason for a difference in slope may be 
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Fic. 3. Data from Fig. 2 after S — Sy cor- 
rection. (Note that the subadapting data 
are not included.) 


that in the present experiment the method 
provided stimulation of only the anterior 
region of the tongue while the other in- 
vestigators used the sip method, which 
stimulates the whole mouth. Note, how- 
ever, that the anterior region of the 
tongue is the most sensitive for salt. 
Although the curves (except for the 
0.001-M AC curve) are not simple 
power functions, of the form R=kS*, 
they do conform reasonably to a power 
function when S—S, is plotted against R, 
where S, is the so-called (empirically 
determined) threshold correction (Fig. 
3). The slope is affected by the choice 
of the constant and vice versa. Since 
the 0.001-M AC was a power function 
over the whole range and the others ap- 
pear to fit that slope at the upper end, 
the slope of the 0.001-M AC curve was 
chosen as a starting point and the con- 
Stants derived from there. Different 
slopes would have yielded a better fit for 
certain of the curves. Whereas Stevens 
commonly refers to S, as the absolute 
threshold for the particular continuum, 
Ekman (1959) has found that for a num- 
ber of continua the value of S, must be 
negative, unsatisfactory for the notion of 
an absolute threshold, He has suggested 
instead a function of the form R= 
+S". Although these are not mathe- 
matically equivalent, they appear to 
handle certain data equally well. The 
constant a represents the Sübjective mag- 


nitude present when the stimulus is zero, 
indicating a residual "sensory noise.” 
The S—S, correction seems better in the 
present case since the one constant will 
handle both the differences in adaptation 
and in minima. The resultant function 
is: R=47(S—S,).41 with S, being 
minus 0.0042 M for HOH AC; 0 for 
0.001-M AC; 0.0093 M for 0.01-M AC 
and 0.099 M for 0.1-M AC. These func- 
tions hold from S — S, to 1.0 M. 

These results indicate that adaptation 
is an important variable in determining 
both the intensity and quality of salt 
solutions of moderate and even sub- 
threshold intensity. State of adaptation 
must be considered in any experiment 
employing salt solutions as stimuli. 


EXPERIMENT II 


Since the psychophysical function 
rose sharply above and below the AC 
for 0.01-M and 0.1-M ACs, the shape 
of the function for concentrations near 
the AC needed to be determined. 


Method 


The method was the same as Exp. I with 
the following exceptions: 

Apparatus—Another stopcock was added 
between the point of exit of the tube carrying 
the adapting solution from the bath and the 
four-way stopcock in order to admit air into 
the system and bleed the adapting solution 

from the distal tubing. In this manner, a 
separation of approximately 13 sec. was pro- 
duced between the adapting and test solu- 
tions; this made easier S's task of dis- 
criminating between AC and the stimulus. 

Solutions—The AC was 0.01-M NaCl, 

and the stimuli ranged from 0.003-M to 0.03- 
M NaCl in 55 log M steps. The standard, 
0.003-M NaCl, was used because it should 
fall about the middle of the intensity range. 
It should serve also to demonstrate that a 
subadapting solution is not tasteless, but 
has a definite taste that can be used as a 
modulus for magnitude estimates, 

Instructions—The instructions were the 
same as in Exp. I except that the reference 
to the standard being sweet was deleted. 

i Procedure.—The series was presented 10 
times, in a different random order each time, 
to the three Ss. The standard was presented 
after the AC rather than after salivary adap- 
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Fic. 4 Estimated magnitude of sodium 
chloride solutions after 01 M sodium chlo- 
ride adaptation. (Each curve is for a single 
S. Each value has been multiplied by a 
constant so that the distance between the 
ratings of .003 M and .01 M are the same 
as the difference between these points in 
Fig. 2 and the same for each S. The 0.01 M 
adapted curve from Fig. 2 is shown for 
reference.) 


tation as in Exp. I because there was only 
one AC and thus adaptation state was 
constant throughout the experiment. 


Results and Discussion 


The arithmetic means of the individ- 
ual Ss’ data were computed and plotted 
logarithmically in Fig. 4, together with 
the 0.01-M AC curve presented in 
Exp. II. For comparison, the ratings 
were multiplied by a constant so that 
the distance between the rating of 0.01 
M and the rating of 0.003 M is the 
same for each S and also equal to the 
distance between the ratings of these 
two solutions in Exp. I. 

The curves begin to rise very close 
to the AC; the stimulus 4-log step 
away from the AC is clearly judged 
stronger than the AC for both sub- and 
supraadapting stimuli. The concept of 
threshold does not apply in this situa- 
tion because intensity of “taste” rather 
than identification of @ particular taste 
was requested, resulting in nonzero in- 
tensities for all solutions. 

The curves are consistent with the 


001-M AC curve of Exp. L despite 
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the differences in range and spacing of 
stimuli in the two experiments. The 
regularity of the data obtained from 
magnitude estimates with individual Ss 
and relatively few judgments per stim- 
ulus, even with these differences in 
procedure, suggests the economy of 
the method of direct magnitude estima- 
tion in obtaining psychophysical func- 
tions. 
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EFFECT OF INSTRUCTIONS AND PERSPECTIVE-DRAWING 
ABILITY ON PERCEPTUAL CONSTANCIES AND 
GEOMETRICAL ILLUSIONS * 


JULIA A. CARLSON ? 
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To test the hypothesis that geometrical illusions are examples of mis- 
placed constancy scaling, 2 groups of 64 Ss, high and low in drawing 
skill, were administered size constancy, shape constancy, Miiller-Lyer 
illusion, and Sander Parallelogram illusion tasks under 2 sets of in- 
structions. No differences were found on the basis of skill. A differ- 
ence as a result of instructions and an Instruction X Skill interaction 
were obtained for size and shape constancy. There was no correlation 
of an illusion with a constancy. The above findings weaken the 
proposition that illusions reflect misplaced constancy scaling. 


Two classes of perceptual phenom- 
ena, constancies and illusions, both 
continue to be of interest to students 
of visual perception. However, at- 
tempts to determine relationships 
among such phenomena have been sur- 
prisingly rare. 

A number of recent papers (Gregory, 
1962, 1963; Segall, Campbell, & Her- 
skovits, 1965; Sonoda, 1961; Tausch, 
1954; von Holst, 1957) have sug- 
gested the possibility that these two 
classes of phenomena are based on the 
same inference habits, i.e., that illusions 
are examples of misplaced constancy 
scaling. They have pointed out the 
similarity of certain two-dimensional 
geometrical illusions to perspective 
drawings and have noted that the 
typically overestimated portions of 
these figures are those that would be 
depicted as farther away if interpreted 


as representations of three-dimensional 
objects. 


1This paper is based on a dissertati 
submitted in partial fulfillment of the had 
ments for the PhD degree. The writer is 
indebted to Marshall H. Segall for his help. 
The statistical analyses were done using the 
facilities of the University of Iowa Computer 
Center. 

?Now at Educational Testi it 
Princeton, New Jersey. M penes 


If constancies and illusions are based 
on the same inference habits, then one 
would expect to find laws which hold 
for both. For example, differences as 
a result of different instructions have 
been reported for the constancies ( Carl- 
son, 1960, 1962; Carlson & Tassone, 
1963; Gilinsky, 1955; Jenkin & Hy- 
man, 1959; Singer, 1952). If S is 
instructed to report how the stimuli 
"look" to him, judgments are closer 
to visual angle than if S is asked to 
estimate the actual properties of the 
stimuli. If illusions are examples of 
misplaced constancy scaling, then com- 
parable differences as a result of in- 
structions should occur, only in the 
reverse direction. If S is asked to 
estimate the actual properties of an il- 
lusion stimulus, he should estimate 
visual angle, since all parts are known 
to be in the same plane. If asked to 
tell how the stimuli look, he might 
apply constancy scaling as though the 
illusion figures were two-dimensional 
Tepresentations of solid objects, and 
his responses would exhibit greater 
departure from visual angle. 

Also, one might expect to find dif- 
ferences in response to constancy and 
illusion tasks as a function of S's 
ability to draw. To draw in perspective 
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necessitates learning to discount the 
phenomenal appearance of objects to 
reproduce them as they are represented 
on the retina of the eye. This process 
reinforces habits conducive to a visual- 
angle response to constancies and a 
veridical or nonillusion response to the 
illusions. One would, thus, expect that 
Ss more skilled in drawing in perspec- 
tive would give responses closer to 
visual angle for both constancies and 
illusions. 

It is also reasonable to predict a 
greater difference between instructions 
for skilled Ss than for unskilled Ss 
since drawing skill might be related to 
a superior ability to shift sets on de- 
mand. On the other hand, it might be 
argued that the skilled Ss habitually 
disregard depth cues and tend to do so 
regardless of instructions, while the 
unskilled Ss, unpracticed in disregard- 
ing depth cues, do so only when specif- 
ically so instructed. In either case, one 
group would be more responsive to 
instructions than the other. 

Finally, if constancies and illusions 
are based on the same inference habits, 
they should correlate positively with 
each other, particularly when the re- 
spective tasks are administered under 


functionally equivalent instructions. 


METHOD 


In the present study the performance of 
two groups of Ss, one skilled and one un- 
skilled in perspective drawing, was com- 
pared on four tasks (size constancy, shape 
constancy, Müller-Lyer illusion, and Sander 
Parallelogram illusion) under two sets of 
instructions. 


Selection of Ss 


At the beginning of the semester, a per- 
spective-evaluation test was administered to 
the lecture sections of the introductory 
course in psychology at the University of 
Iowa. Six items were adapted from the 
Lewerenz (1927) Tests in Fundamental 
Abilities of Visual Art and the Knauber 
(1935) Art Ability Test. The Ss who made 


scores in approximately the upper and lower 
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25% of the distribution were considered 
eligible for the perspective-drawing test 
administered individually later in the 
semester. 

The perspective-drawing test was ad- 
ministered to 207 Ss. The Ss were asked to 
draw three objects so that their drawings 
would match, as closely as possible, a photo- 
graph of the objects taken from the same 
angle at which they viewed them. Viewed 
through a fixed monocular eye piece, the 
three objects, a cube, a triangle, and a 
rectangular solid, each 1 cu. in. in volume 
and orange in color, were mounted in a box 
lined with black cloth so that the contours 
of the interior of the box were not evident. 
The interior of the box was lighted with a 
150-w. bulb placed in the front left-hand 
corner of the box. 

As a basis for scoring S's drawing, a 
photograph of the objects was taken through 
the eye piece. The S's score was the sum 
of the absolute value of the difference be- 
tween six angles in his drawing and in the 
photograph. One-hundred and twenty-eight 
Ss, 64 from the upper one-third and 64 
from the lower one-third of the score dis- 
tributions on the perspective drawing test, 
were selected for the experiment. 


Apparatus 


All tasks were administered in a gray, 
windowless room 20 ft, 10 in. long, X 8 ft. 
wide X 9 ft, 
placed 5 ft., 9 in. from the front wall, was 
drawn between trials to conceal the equip- 
ment not in use at the time. The S, seated 
in a chair, faced the curtain with his eye 
level 45 in. from the floor and viewed the 
stimuli binocularly, his head secured by a 
chin rest. For all tasks the standard was 
placed on the side of S’s preferred eye, 
which had been determined by noting the eye 
used to view the objects in the perspective 
drawing test. 


of the triangle was positioned 40 cm. above 
the base of the background. 

The variable was similarly mounted against 
a flat black background 55 cm. X 80 cm. with 
a bottom margin of 40 cm. It consisted of a 
white equilateral triangle which was raised 
or lowered into a slot, thereby varying the 
altitude of the visible portion of the tri- 
angle. 


The altitude could vary from 0 to 
30 cm. 
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The standard was placed 18 ft. from S’s 
eyes. The variable was 7.5 ft. from S's 
eyes. The standard and variable were 
placed so that the inner edges of the black 
backgrounds were seen by S to overlap. 
The outer edge of the background on which 
the standard was mounted was obscured by 
the curtain. 

Shape constancy.—The standard consisted 
of an equilateral triangle 12 cm. in altitude, 
fashioned of white metal strips 5 mm. wide, 
mounted on a flat black background 55 cm. 
wide and 80 cm. high. It was slanted 60° 
back from vertical. 

The variable stimulus was similar to that 
described by Gottheil and Bitterman (1951). 
A triangle, fashioned of three white metal 
strips 5 mm. wide, appeared against a flat 
black background 55 cm. wide and 80 cm. 
high. The base of the triangle was fixed and 
measured 13.64 cm. The altitude of the 
triangle could be varied so that in appearance 
it resembled a tilted triangle. 

The S was seated 10 ft. away from 
standard and variable, which were placed 
side by side on a table. The standard was 
set on a platform so that the base of the 
standard triangle was on the same level as 
that of the variable. Standard and variable 
were set so that S could see both triangles 
clearly but could see neither outer edge of 
the black backgrounds on which they were 
mounted, 

Sander Parallelogram.—The Sander Par- 
allelogram appeared against a flat black 
background 70 cm. wide and 48 cm. high. 
It was formed of white metal strips 5 mm, 
wide. The smaller angle of the parallelogram 
was 55°, One-half of the parallelogram 
(the portion which formed the larger par- 

llelogram) was fixed in position. The 
diagonal of this portion, the standard, was 
12 cm. in length. The other half of the 
Parallelogram was mounted on a slide set 
back slightly from the front of the apparatus 
so that its length could be varied from 0 to 
17 cm. The variable diagonal, thus, could 
be varied from 0 to 24 cm. 

The S was seated 10 ft. away from the 
apparatus. Two similar units were em- 
D One had the movable portion on the 

t; 


Müller-Lyer illusion—The Müller-Lyer 
figure appeared in white against a flat black 
background 39 cm. wide and 25 cm. high. 
The obliques were 44 mm. long and formed 
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a 50* angle with the horizontal line. The 
segment with the outward-extending obliques, 
the standard, was fixed at 12 cm. long. The 
variable, the segment with the inward- 
extending obliques, moved in a slot and could 
be varied from 4 to 18 cm. 

The S was seated 10 ft. away from the 
apparatus. A base was fastened along both 
upper and lower edges of the apparatus so 
that the standard could be presented either 
on the right or on the left. The curtain was 
drawn so that $ could see the whole il- 
lusion figure but could see neither outer 
edge of the black background on which it 
was mounted. 


Procedure 


Each S received the four tasks in the 
following order: size constancy, Sander 
Parallelogram, shape constancy, Müller- 
Lyer. The S received six trials on each 
task under each of two instructions (a total 
Of 48 trials in all), using the method of 
limits. The first two trials of each task 
were practice trials, one ascending, one de- 
scending. Of the remaining four trials, two 
Were ascending and two descending. Half 
of the Ss received an ascending trial first 
and half a descending trial. The series of 
four tasks with six trials per task was 
repeated for each instruction condition, with 
all four tasks under a particular instruction 
completed before the four tasks under the 
Other instruction were administered. One- 
half the Ss received the "objective" instruc- 
tions first and the other half the "apparent" 
Instructions first. The instructions were as 
follows, with specific wording adapted for the 
specific task : 


Objective instructions, 

This is what you are to do this time: 
I will adjust the (shape, size, length) of 
ahis ssr, (the variable), and you are to 
tell me when it is the same (shape, size, 
length) as this other , . - (the standard). 
I want you to imagine picking up the one 
: - Und placing it on the other and to 
tell me when this , - is such as they 
would match exactly so that if each were 
measured with a (protractor, ruler) the 
measurements would be the same, Say 
stop” when you think the . . . actually are 
the same, and don’t hesitate to change a 
setting if you are not satisfied with it. 
Apparent instructions. 

This is. what you are to do this time: 
I will adjust the (shape, size, length) of 

+++ (the variable), and you are to 
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TABLE 1 TABLE 2 
Ms 1N MILLIMETERS AND SDs OF Ms IN MILLIMETERS AND SDs OF 
THE SETTINGS OF THE SIZE- THE SETTINGS OF THE SHAPE- 
CONSTANCY VARIABLE CONSTANCY VARIABLE 
First Second First Second 
Order of Order of 
Instructions Instructions 
SD M SD SD M SD 
High Skilled High Skilled 4 
bjective | 152.59 | 25.69 168.25 | 29.70 Objective | 106.06 | 23.92 111.75 | 22.50 
Apparent | 127.91 | 24.29 116.75 | 17.39 Apparent | 83.13 | 15.63 83.00 | 13.21 
Both 140.25 | 27.88 | 142.50 | 35.43 Both 94.59 | 23.24 | 97.38 | 23.39 
Low Skilled Low Skilled 
Objective | 145.25 | 23.90 162.34 | 21.81 Objective | 104.19 | 18.09 107.06 | 16.58 
Apparent | 131.31 | 20.96 | 123.91 | 19.81 Apparent | 89.75 | 15.87 | 88.41 | 15.99 
Both 138.28 | 23.54 | 143.13 | 28.35 Both 96.97 | 18.49 | 97.73 | 18.77 


M 
Note.—Veridical or “perfect constancy'' score = 134. 
N = 128, equally divided between skills and instruc- 
tional sequences. 
tell me when it looks the same in apparent 
(shape, size, length) as this other . . . 
(the standard). I don't care whether it 
actually is the same (shape, size, length). 
I just want to know how it appears. Say 
"stop" when you think the . . . look the 
same, and do not hesitate to change a set- 
ting if you are not satisfied with it. 
| 
| 


RESULTS 


Sige constancy. —Table 1 shows the 
mean settings of the variable for all 
Ss. The Instruction X Skill interaction, 
indicative of greater responsiveness to 
instructions on the part of the skilled 
Ss was significant at the .05 level. 
Over both skill groups, the difference 
between objective and apparent in- 
structions was significant beyond the 
01 level. The difference between 
means of the high-skilled vs. the low- 
skilled Ss was nonsignificant. Analysis 
of variance showed the sequence effect 
to be significant beyond the .01 level. 
When data from the first-administered 
instruction were analyzed separately, 
the Instruction X Skill interaction be- 
came nonsignificant. 

Shape constancy.—The mean settings 
for all Ss are shown in Table 2. The 
Instruction X Skill interaction was sig- 
nificant at the .01 level with the high- 


Note.—Veridical or “perfect constancy" score = 120. 
N = 128, equally divided between skills and instruc- 
tional sequences. 


skilled Ss more responsive to the in- 
structions than the low-skilled Ss. The 
difference in means for objective vs. 
apparent instructions over both skill 
groups and orders of administration 
was significant beyond the .01 level. 
Over both orders and instructions, the 
difference between means for high as 
opposed to the low-skilled Ss was not 
significant. The sequence effect was 
not significant. 

Sander Parallelogram.—The mean 
settings of the variable for all Ss are 


TABLE 3 


Ms 1N MILLIMETERS AND SDs OF 
THE SETTINGS OF THE SANDER- 
PARALLELOGRAM VARIABLE 


———————— 


First Second 
Order of 
Instructions 
SD M SD 
High Skilled 
Objective 156.09 | 18.61 | 150.75 | 21.17 
Apparent | 152.38 19.48 | 159.81 | 19.15 
Both 154.23 | 19.14 | 155.19 | 20.67 
Low Skilled 
Objective | 159.44 14.75 | 148.78 | 16.10 
Apparent 153.53 | 12.01 | 158.81 | 15.78 
Both 156.48 | 13.77 | 153.80 | 16.71 


Note—Veridical or  "nonillusion" «score = 120. 
N = 128, equally divided between skills and instruc- 
tional sequences. 
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TABLE 4 


Ms 1N MILLIMETERS AND SDs OF THE 
SETTINGS OF THE MULLER-LYER 
ILLUSION VARIABLE 


First Second 
Order of 
Instructions 
M SD M SD 
High Skilled 
bjective | 137.25 | 8.47 | 133.78 | 8.95 
Apparent | 138.28 | 7.36 | 144.91 | 8.86 
Both 137.77 | 7.95 | 139.34 | 10.50 
Low Skilled 
Objective | 138.69 | 10.13 | 135.09 | 12.39] 
Apparent | 138.25 | 7.83 | 143.91 | 9.01 
Both 138.47 | 9.05 | 139.50 | 11.69 


Note.—Veridical or ''nonillusion" score = 120, 
N = 128, equally divided between skills and instruc- 
tional sequence, 


shown in Table 3. The interaction of 
Instruction X Skill was not significant. 
Over both skill groups, the difference 
between means for the objective in- 
struction as opposed to the apparent in- 
struction was not significant, nor was 
the difference between means for the 
high-skilled and low-skilled Ss, The 
Sequence effect was significant at the 
05 level. 

Müller-Lyer illusion.— Table 4 shows 
the means for all Ss. The Instruction 
X Skill interaction was not significant, 
nor was the difference between means 
for the two skill groups. The differ- 
ence between means for the objective 
Vs. the apparent instructions was sig- 
nificant at the .01 level. The sequence 
effect was significant beyond the .01 
level. When the data from the first- 
administered instruction were con- 
sidered separately, the difference be- 
tween instructions was not significant. 

Correlations between tasks—Com- 
bining the data for both sequences, sig- 
nificant correlations occurred for size 
and shape constancy (.30 under ob- 
jective instructions and .36 under ap- 
parent instructions). There was no 
correlation between the two illusions, 
between any illusion and any constancy, 
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nor between different instructions ex- 
cept in the case of the correlations of a 
task under one instruction with the 
same task under the contrasting in- 
struction. These correlations were .26 
for size constancy, .50 for shape con- 
stancy, .65 for the Sander Parallelo- 
gram, and .36 for the Miiller-Lyer 
illusion. 


Discussion 


The assumptions regarding the rela- 
tionship of perceptual constancy and abil- 
ity to draw were, in general, supported 
in this study, but the extension of these 
assumptions to cover illusions was not 
strongly supported. Although the pre- 
diction regarding differences in mean 
judgments for high-skilled as opposed to 
low-skilled Ss was not confirmed for any 
of the tasks, group differences with re- 
gard to instructions were found for the 
Constancies, The data clearly favor the 
interpretation that the highly skilled Ss 
are better able to change sets on com- 
mand. Unfortunately, in view of the 
effort made to interpret illusions as 
misplaced constancy scaling, no signifi- 
cant Instruction X Skill interactions oc- 
curred on either illusion task. Thus, it 
appears that those who draw well are 
better able to Suppress or ignore con- 
stancy producing cues, but that this flexi- 
bility is not relevant to performance on 
the Sander Parallelogram and Müller- 
Lyer illusions. 

The prediction for the constancies that, 
under objective as compared to apparent 
instructions, the distant triangle will be 
estimated as larger and the tilted triangle 
estimated as taller was upheld, Com- 
parable findings have frequently been re- 
ported in the literature. However, the 
prediction that Objective instructions 
would produce a smaller illusion effect 
Was supported only for the Müller-Lyer 
illusion over both sequences. For the 
Sander Parallelogram and for the first- 
instruction data of the Miiller-Lyer illu- 
Sion, no significant differences between 
instructions were found. This may re- 
flect the fact that, in the illusion situa- 
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standard and variable are in 
Regardless of the in- 
struction given, it may be difficult for S 
to conceive of his task as anything but 
comparison of actual line length. The 
constancy situation, with standard and 
variable in different planes, is more con- 
ducive to a differentiation of these two 
instructions. 

That objective vs. apparent instructions 
are relevant to the illusion situations is 


tion, both 
the same plane. 


implied by the sequence effects which 
were found for three of the four tasks 


employed. For the two constancy tasks, 
the apparent-objective sequence of in- 
structions led to higher scores than the 
objective-apparent sequence. For both 
illusions, the sequence effects, both sig- 
nificant, were in the opposite direction. 
The fact that opposite sequence effects 
were found for the two classes of tasks 
may relate to the argument that apparent 
instructions for one class of tasks are 
functionally equivalent to objective in- 
structions for the other class of tasks and 
vice versa. Thus, the sequence effects 
found were actually uniform, provided one 
considers this postulated functional equiv- 
alence. Thus when the second instruction 
happens to be the one which produces 
larger scores, the fact that it comes 
second, and can be contrasted with a prior 
instruction, enhances its impact. Simi- 
larly, the instruction which produces 
smaller scores produces even smaller 
scores when it comes second than when it 
comes first. A differentiation of instruc- 
tions in the illusions does not, however, 
necessarily imply that illusions are con- 
stancy phenomena. The objective-ap- 
parent distinction may arise from other 
sources. 

The lack of correlation between con- 
stancies and illusions casts doubt on the 
interpretation of illusions as examples of 
misplaced constancy scaling. At least, if 
there is constancy scaling applied in the 
case of the illusions, there is no consistent 
tendency for Ss who apply a great deal of 
constancy scaling in the three-dimensional 
situation to apply itin the two-dimensional 
situation. And, if the Sander Parallelo- 
gram and Müller-Lyer illusions are in- 
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deed related to a single factor, this rela- 
tionship did not show up in a correlation 
between the two illusion tasks. In sum, 
the proposition that illusions reflect mis- 
placed constancy scaling was not given 
support by the data gathered in this study. 
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EFFECT OF RATE OF CHANGE IN PHYSICAL INTENSITY 
ON BISECTION AND FRACTIONATION JUDGMENTS 
OF BRIGHTNESS * 


ROBERT F. FAGOT, PAUL R. ESKILDSEN,? anp MANARD R. STEWART 
University of Oregon 


8 Ss made bisection and fractionation judgments of brightness under 
4 rates of change in luminance, 4 of the Ss bisecting Ist and the other 
4 fractionating Ist. The effect of rate of change was significant for 
both bisection and fractionation. The data indicated certain advantages 
of bisection compared to fractionation: Ist, fractionation judgments 
were influenced by prior experience with bisection (significant order 
effect), but bisection judgments were relatively independent of prior 
exposure to fractionation. Second, the variability of judgments was 
smaller for bisection than for fractionation. 


Several studies (Canter & Hirsch, 
1955; Parducci, 1956a, 1956b; Stevens 
& Galanter, 1957) have shown that 
category scales are not independent of 
the spacing or frequency of presenta- 
tion of the discrete stimuli used in such 
experiments, In addition, Pradhan and 
Hoffman (1963) obtained results show- 
ing that for lifted weights, magnitude 
estimates are dependent upon both the 
spacing and range of stimuli. For con- 
tinuous variation of stimuli where the 
method of adjustment or average error 
is used, there may be an analogous 
biasing effect; i.e., the dependent vari- 
ables in such experiments may be de- 
pendent on the rate of change in physi- 
cal intensity. Bisection and fractiona- 
tion are examples of scaling methods 
which require continuous variation of 
stimuli. Of considerable importance 
for bisection is the fact that the axiom 
systems for bisection structures assert 
the existence of a unique element which 
is interpreted as the bisecting stimulus. 
(See, e.g., the discussion by Luce & 
Galanter [1963, p. 259] of Pfanzagl's 
[1959] axioms for bisection Structures, 
in particular Axiom 1 on existence.) 


1This research was supported by Na- 
tional Science Foundation Grant G19210. 


? Now at San Francisco State College. 


If judgments of the bisecting stimulus 
are not independent of rate of change 
in physical intensity, then in general 
such a unique element will not exist. 
Clearly, the utility of bisection and 
fractionation scales is severely restricted 
if independence of rate of change can- 
not be established. 

Although continuous variation of 
stimuli has been employed in several 
studies (Gage, 1934; Gregg, 1951; 
Hanes, 19493, 1949b; Stevens & 
Galanter, 1957; Stevens & Poulton, 
1956), and Torgerson (1958; pp. 103- 
104) discussed the problem of rate of 
change, there have been no studies re- 
ported in which rate of change has been 
treated as an independent variable. Our 
general purpose was to study the effect 
of rate of change in physical intensity of 
light on brightness scaling for bisection 
and fractionation. A special apparatus 
was used which controlled for variation 
of intensity with time. As in the study 
by Pradhan and Hoffman (1963) we 
analyzed the data for individual Ss 
separately. 

METHOD 


Subjects—The Ss were two male graduate 
students, not in psychology, and six females, 
wives of university students. All Ss were 
naive with respect to fractionation, but one 
had had previous experience with bisection. 
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detail elsewhere (Eskildsen, 1963), provided 
two (for fractionation) or three (for bisec- 
tion) luminous white circular targets 13 mm. 
in diameter, place horizontally and separated of 
by 90 mm., center to center. The luminance 
(9) was continuously variable in each of the 
targets from .1 to 580 ftl. For fractionation, 
one target (to S’s right) was held constant 
in ġ as a standard, while S could vary the ¢ 
of the other target continuously over the 
entire stimulus range by means of a control 
switch. 
placed to the left of the variable target. The 
S's control switch activated a cam mechanism 
which varied the $ of the target at a rate 
which depended on the shape of the par- 
ticular cam being used. Variously shaped 
cams were available which provided means 
for controlling the functional relationship be- th 
tween $ and time. If the psychophysical law 
is assumed to be a power function of 
y — ag* +b, where y is psychological mag- 
nitude, a cam can be construct: 
= (1/a) (ct-b)™, where ¢ denotes time. 
For an assumed log law, ¢ = exp [Cet-b) /al 
is the appropriate stimulus-time relationship. 
Thus for any S whose psychophysical func- lo 
tion is a power law or a log law, a cam can 
always be constructed by the above pro- 
cedure such that psychological magnitude 

(V) is linear with time; ie, y= ct. 


BISECTION AND FRACTIONATION 
Apparatus.—The apparatus, described in 


In bisection, à third target was 


the form 


ed such that 


Four cams were constructed, one matching 


an assumed log function, and three matching 
assumed power functions with exponents 

0.5, 10, and L5. 
controls the rate of change will be denoted 
the "cam exponent" The log cam was as- 
signed cam exponent 0.0, since the power 
function converges to a log function as the 
et approaches zero (Fagot, 1963, P- 


The exponent k which 


Procedure—All Ss were tested under each 


of the four cam (rate of change) conditions, 
and by each scaling method (bisection and 


and the other four fractionated first. The Ss 
were assigned at random to one of the two 
orders, except that one S (who had had 
prior bisection experience) was assigned to 
the bisection-first group. In order to control 
for possible practice effects, each session was 
broken up into four blocks, each cam being 
used four times in succession in each block 
for a total of 16 observations per 
observations per cam, and 
per session. Each session lasted above 1 hr., 
and was repeated for 3 successive days. 

the three sessions, there were a 


block, 16 
64 observations 


total of 12 
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blocks, and hence 12 different sequences of 
cams, These 12 sequences were selected at 
random (without replacement) from the 41 
possible sequences of four cams. Under each 


the 64 conditions (two methods, four cams, 


eight Ss) there were a total of 48 observa- 
tions (recorded in footlamberts). 


A brief dark adaptation period of 10-15 


min. preceded each experimental session. The 
Ss were seated at a distance of 12 ft. from 
the targets in an ordinary classroom desk 
seat which had a control switch and button 
attached. When S had finished making his 


closed a shutter over the targets and sig- 
naled E that the observation was completed. 
After recording, E reset the cam device at a 
previously determined starting point and 
opened the shutter which signaled S to make 


e next observation. The starting point for 


each observation was chosen at random from 
30 points equally spaced on the time interval 


from maximum to minimum stimulus in- 
tensity. 


The terminal stimuli for bisection were 2 


and 580 ftl. The standard stimulus for frac- 
tionation was 580 ftl. For bisection, the fol- 


wing instructions were given: 


You will be presented with three lights. 
The light on the left will be somewhat 
dimmer than the light on the right. The 
light in the middle will be of a brightness 
somewhere in between the right and left 
hand lights. Your task will be to adjust 
the brightness of the middle light such that 
its brightness is halfway in distance be- 
tween the bright and dim lights. In other 
words, the interval of brightness between 
the dim and middle light should be equal 
to the interval of brightness between the 
middle and bright light. 


For fractionation the following instruc- 
tions were given: 

You will be presented with two lights. 
The light on the right will be of a fixed, 
constant brightness, while the light on the 
left will be variable in brightness. Your 
task will be to adjust the brightness of the 
left hand light such that its brightness is 
one-half that of the right hand light. 


All Ss were given the following further 
instructions : 

There are certain things which you 
should be cat ioned against doing. Do not 
make a judgment on the basis of the size 
of the lights (they tend to appear larger 
or smaller to some people as the intensity 
changes) nor on the distance of the lights 
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TABLE 1 
G AND k ror Each METHOD, CAM, AND S TÉ MS 
Order I (Bisection First) 
Bisection (BF) Fractionation (BF) 
S: 
: Cam Exponent Cam Exponent 
0.0 0.5 1.0 15 M 0.0 0.5 1.0 15 | M 
Ba 35.084 | 34.009 | 36.233 | 35.737 | 35.266 | 19.477 | 20.154 | 19.472 | 22.761 | 20.466 
k :000 —.000 015 012 .007 204 .206 .204 219 208 
JA 32 | 22.435 
-172 | 40.932 | 42.119 | 41.712 | 40.234 | 24.153 | 23.648 | 20.807 | 21.132 22435 
E set 046 053 051 041 218 217 .208 .209 3213 
OF 
9.980 | 52.059 | 52.524 | 54.712 | 52.320 | 5.276] 5201] 6.4 17| 7.549] 6.111 
f 5 .097 107 110 420 .108 447 147 «154 .160 AS 
SL 
G 55.170 | 60.567 | 58.612 | 59.426 | 58.444 | 4.865 4.131 | 4.847] 4.727] 4.643 
k 1122 447 139 142 .138 4445 .140 145 144 444 
M 
G 44,102 | 46.892 | 47.370 | 47.897 | 46.565 13.443 | 13.284 | 12.886 | 14.042 er 
k .059 075 .079 .081 .074 79 178 178 183 180 
Order II (Fractionation First) 
Bisection (FB) Fractionation (FB) 
GW 
G 28.826 | 37.611 | 35.391 | 34.524 34.008 | 44.661 | 58.697 | 59.145 | 67.370 | 57.168 
i —.042 025 015 .003 .000 270 303 .304 .322 .300 
J 
G 64.723 | 68.593 | 74.388 | 82.052 72.439 | 81.154 | 79.204 82.020 | 86.818 | 82.322 
E 166 4182 .205 234 497 .352 .348 .354 .365 355 
G 31.435 | 31.899 | 34.486 | 38.512 34.083 | 40.522 | 43.034 50.610 | 52.513 | 46.670 
^ % = 020] —.016| .003 | 1031 | —000| 260 266] .284| .289| 275 
G 25.845 | 25.590 | 25.282 24.414 | 25.283 | 19.682 19.666 | 21.722 | 19.618 | 20.172 
d —069 | —.072 | —.075 | —.084 | —‘o75 205} 205 2 .205| .207 
G 37.107 | 40.923 | 42.387 44,876 | 41.473 | 46.505 50.173 | 53.374 | 56.580 | 51.658 
k .009 030 .037 046 031 272 .281 .288 .295 .284 


Note. 


—The four groups are denoted BF, BF, FB,'and FB, 


(they tend to appear further away or 
Closer to some peop! 


changes) nor on the 


le as the intensity 


as defined implicitly in?the table. 


light away from that point, even if you 
move it right back to the same place. You 


basis of time. What 
I mean by that is, it will take the light a 
certain amount of time to go from the start- 
Ing point to the point where you want it 
to be. Since we will start the light at a 
different place for each judgment, you will 
just increase your error if you take into 
consideration how long it is taking. If by 
chance the light is started On a point which 
you think is just right, you must move the 


Will not be limited in the amount of time 
you take to make an adjustment or the 
number of times you increase or decrease 
the intensity of the light. 


RESULTS 


The data were analyzed for each scal- 
ing method Separately by means of a 
factorial design with Cams and Orders 


| 
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TABLE 2 


ANALYSIS OF VARIANCE Summary TABLE 
FOR BISECTION 


Source df MS F 
Between Ss 7 
Orders 1 207.494 <i 
Ss 6 1114.751 128.383 
Within 5s 24 
Cams 3 42.819 4.931* 
(Linear) 1 121.175 13.955** 
(Beyond Linear) 2 3.642 <i 
Orders X Cams 3 4.504 <1 
(Linear) 1 12.311 1.418 
(Beyond Linear) 2 601 «1 
Cams X Ss 18 8.683 


*p «.05. 
++p «.01. 
wk py < 001. 


completely crossed, and Ss nested 
within Orders. 

The dependent variables for bisec- 
tion was the bisecting stimulus ; and for 
fractionation, the ratio stimulus, ie., 
the stimulus judged by S to be one-half 
the brightness of the standard. The 
bisecting stimulus and the ratio stimu- 
lus were estimated by the geometric 
mean (G) of the 48 observations in the 
appropriate cell. The scaling method 
was not included as a factor in a larger 
factorial design, since the task in each 
method was dissimilar, and hence a 
comparison of bisecting and ratio stim- 
uli is not meaningful. Use of the ex- 
ponent as the dependent variable would 
have made the inclusion of Methods in 


TABLE 3 


ANALYSIS OF VARIANCE SUMMARY TABLE 
FOR FRACTIONATION 


Source af MS F 
pan Pt rn ad 
Between Ss 7 
Orders i | 11701.366 | 7-795% 
Ss & | 1501.081 | 138.745*** 
Within Ss 24 
Cams 3 40719 | 3.764% 
(Linear) 1 120.392 | 11.128** 
(Beyond Linear) 2 .883 «1 
Orders X Cams 3 34.768 3.214* 
(inset) 1 101.735 |  9.403** 
Beyond Linear | 2 1.284 «1 
Cams X Ss 18 10.819 
* p «.05. 
Mp <01 
"ep « 00! 


G ( foot- lamberts) 
m 
o 


0.0 0.5 10 15 
CAM EXPONENT (k) 


Fic. 1. Bisecting and ratio stimuli as a 
function of cam exponent. 


the factorial design meaningful, but the 
Cams effect would then have been con- 
founded with the power law assump- 
tion. A mixed model was assumed, 
with Cams and Orders fixed effects, 
and Ss a random effect. 

All observations were recorded in 
footlamberts. Table 1 gives the geo- 
metric mean (G) and the exponent (k) 
of an assumed power law for each of 
the 64 conditions. The estimates in 
each of the 64 cells were based on 48 
observations. The exponents were 
estimated by the method outlined by 
Fagot (1963). An iterative solution 
to the equation 


G = [ie re LU 


where $z = 2 ftl, ox = 580 ftl., and 
G is the geometric mean of the 48 ob- 
servations under the given condition, 
gave estimates of the exponents k for 
bisection. For fractionation, the ex- 
ponent under each condition was esti- 
mated directly from the following equa- 
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BF 
- F 

FB 
40 

FB 
30 BF 


8 


COEFFICIENT OF VARIATION (v) 
S 


00 05 LO [E] 
CAM EXPONENT (k) 


Fic. 2. Coefficient of variation as a function 
of cam exponent. 


tion: 


X log 4 
^ig(G e) 7 


where $y = 580 ftl. was the standard 
stimulus. 

Tables 2 and 3 give analysis of vari- 
ance summary tables for bisection and 
fractionation, respectively, with G as 
dependent variable. For bisection, Ss 
and Cams were significant, but not 
Orders nor the Ox C interaction. 
However, for fractionation, all effects 
were significant (the C x S interaction, 
of course, could not be tested with this 
design). Figure 1 shows the bisecting 
stimulus and the ratio stimulus as a 
function of the cam exponent, for each 
order averaged over Ss within order. 
Analysis by orthogonal polynomials in- 
dicated that for both bisection and frac- 


3 Use of k as dependent variable in place 
of G gives exactly the same significance re- 
Sults reported in Tables 2 and 3 (ie, the 
same factors are significant and at the same 
levels). 


tionation, the overall trend was not sig- 
nificant beyond the linear component. 
However, for fractionation, the O x C 
interaction is significant, and only the 
fractionation-first group showed a linear 
trend. 

To compare the two scaling methods 
with respect to variability of observa- 
tions, the coefficient of variation (V) 
was used as the dependent variable. 
This choice was made, rather than the 
standard deviation, to avoid possible 
dependence of the measure of variation 
on stimulus magnitude, since the bi- 
secting stimulus and ratio stimulus dif- 
fered appreciably in magnitude. Figure 
2 shows a plot of V as a function of 
cam exponent for bisection and frac- 
tionation, for each order averaged over 
Ss within order. The graph indicates 
that, independent of order and cam, bi- 
section judgments tend to be less vari- 
able than fractionation judgments. 


Discussion 


There are several points of interest re- 
garding the set of exponents shown in 
Table 1. Note that for bisection some of 
the exponents are negative. As Fagot 
(1963, p. 147-148) has shown, the power 
function (with an additive constant) con- 
verges to a log function as the exponent 
approaches zero, A negative exponent 
indicates that the bisecting stimulus is 

Ow the geometric mean and the psy- 
chophysical function is even more nega- 
tively accelerated than a log function. 
One of the Ss (SD) had a negative ex- 
ponent under all four conditions, and 
three other Ss (BA, GW, and KN) had 
negative exponents under at least one 
condition. The mean exponent for the 
eight Ss for bisection was .05, and the 
data imply that the individual brightness 
functions for three Ss (BA, GW, and 
KN) are approximately logarithmic. 
Note also that for every S except SL the 
bisection exponent is considerably lower 
than the fractionation exponent, fairly 
conclusive evidence that for brightness 
the psychophysical function for bisection 
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is definitely more negatively accelerated 
than for fractionation. 

Of special interest is the effect of order 
on fractionation judgments. That bisect- 
ing first has a marked effect on fractiona- 
tion judgments is shown in Fig. 1, in 
which the ratio stimulus shows a marked 
linear trend with cam exponent under 
fractionation-first conditions, but shows 
no trend (i.e., no cam effect) when frac- 
tionation is preceded by bisection. Even 
more striking is the discrepancy in heights 
of the two mean curves for the fractiona- 
tion groups as contrasted to the bisection 
groups. Inspection of Table 1 indicates 
that this latter difference is mainly due 
to two Ss (OF and SL) of the four Ss 
in the BF group, these two Ss having 
fractionation exponents very close to their 
bisection exponents. The hypothesis that 
these two Ss actually bisected (the stan- 
dard and the threshold or a hypothetical 
zero stimulus) under fractionation in- 
structions is consistent with the data. 

The results summarized in Tables 2 and 
3 indicate that rate of change in lumi- 
nance is a significant source of variability 
in bisection and fractionation scaling. 
However, the monotonic trends exhibited 
in Fig. 1 are difficult to explain, especially 
in view of the absence of any trend in 
the BF group. The significant cam effect 
does raise a problem in the selection of a 
rate of change for a particular experi- 
ment. In some cases—for example, when 
the purpose of the experiment is simply 
to obtain a rough estimate of the ex- 
ponent of the power law—it will not 
matter, since the changes in exponent are 
not large in absolute value. When more 
precision is required, and a single rate of 
change is desired, it seems reasonable to 
select the rate of change which appears 
to S to be constant with time. This im- 
mediately suggests the use of the cam 
apparatus as a scaling method; i.e., the 
cam exponent which provides a uniform 
rate of change is taken to be the ex- 
ponent of S’s assumed power law (see 
Eskildsen, 1963). This method reverses 
the usual scaling procedure: Here we 
first construct the psychophysical func- 
tion, then infer the scale from this func- 


tion. However, the method has the dis- 
advantage of the prior assumption of the 
form of the psychophysical function. 
Nevertheless, it would be of interest to 
compare such an estimate of k with 
estimates obtained in other scaling ex- 
periments, such as bisection and frac- 
tionation, where all three methods. in- 
volve continuous variation in stimuli. 

In summary, these data suggest certain 
advantages of bisection compared to frac- 
tionation. First, the variability of bisec- 
tion judgments was smaller than that of 
fractionation (see Fig. 2). Second, the 
O x C interaction was significant for frac- 
tionation, but not for bisection. Although 
the effect of cams was a significant source 
of variation in both cases, the regularity 
of the cam dependence (absence of inter- 
action) is a marked advantage for that 
method. Finally, the significant order ef- 
fect suggests that for some Ss at least, 
bisection was in some sense a more mean- 
ingful or “dominant” task, since it “in- 
truded” on fractionation, but not vice 
versa. 
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ADAPTATION TO DISPLACED AND DELAYED VISUAL 
FEEDBACK FROM THE HAND* 
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position of his nonvisible right hand. The motion of the trace was 
equivalent to the movements of the hand, which were restricted to a 
frontal plane. The bar could be optically displaced by 20 diopter 


3.0 sec. 


no delay is completely eliminated under all delay intervals, including 


the minimum of 0.3 sec. 


In recent years investigators have 
experimented with devices which in- 


troduce a time delay between the oc 


currence of an event and its registra- 
Normally, as 5 moves 
his hand, motion of its retinal image 
follows with a negligible lag caused by 
How- 
ever, the new devices have been used 
to introduce delays in visual feedback 
ranging from a fraction of a second up 
A few authors development of sensorimotor coordina- 


tion at the eye. 


the transmission time of light. 


to several seconds. 


have reported severe loss of control 
in self-paced, visually guided move- 
ments (Kalmus, Fry, & Denes, 1960; 


Smith, McCrary, & Smith, 1960) a 
a consequence of delay. 


loss of control. 


under delay. 


Although the appropriate strategy 


iThis research has been supported by 
the National Institute of Mental Health 
(Grant M-7642), the National Aeronautics 
and Space Administration (Grant NsG-496), 
Force Office of 
Scientific Research (AF-AFOSR 354-63), 


the United States Air 


and the Rockefeller Foundation. 


However, 
extensive experiments of Ferrell and 
Sheridan (Ferrell, 1965; Sheridan & 
Ferrell, 1963) have shown that Ss will 
usually adopt a simple strategy of 
“move and wait” and suffer no such 
Use of this strategy 
allows time to be traded for accuracy in 
the performance of manipulative tasks 


apparently precludes loss of visual 
control of manipulation under delay, 
. we have reason to suspect that the 
delay should have an effect on at least 
one aspect of eye-hand coordination. 
A series of experimental studies has 
demonstrated the role of movement- 
produced visual feedback in adaptation 
to various types of sensory rearrange- 
ment (Held & Freedman, 1963; Pick 
& Hay, 1965) as well as in the early 


tion (Held & Hein, 1963). Because 
of spatial and kinematic relations, these 
feedback signals are correlated with 
s output signals to the musculature re- 
sponsible for the movements that pro- 
duce the change in retinal stimulation. 
We have advanced the idea that this 
correlated information is basic for 
adaptation to optical displacement pro- 
duced by wedge prisms and to other 
forms of rearrangement. Confirming 
this notion are the findings that when 
S views his hand optically displaced 
and with the normal relation between 
innervation of limb muscle and actual 
movement of the limb upset by varying 
external forces, there is a marked re- 
duction of adaptation to the displace- 
ment (Abplanalp & Held, 1965; 
Efstathiou, 1963). 

Delay complicates the normal relation 
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between spatial dispositions of the hand 
and its retinal image. Under delay the 
position of the retinal image at any 
given time is related to the past position 
of the hand as a function of the delay 
interval and the path and velocity of 
movement of the hand. If, as we sus- 
pect, the nervous system cannot factor 
out the effects of these variables the 
result should be equivalent to decor- 
relation of output and feedback signals. 
Consequently, adaptation to a constant 
visual displacement, such as is pro- 
duced by prism goggles, should be 
reduced. Comparison of the magni- 
tudes of adaptation to displacement 
under normal, no delay conditions with 
those produced under delay should pro- 
vide a sensitive test of the effect of 
delay on the visuomotor control system. 


" 


METHOD 


Apparatus —During exposure S, seated on 
a stool, firmly grasped a knob at the end of 
a control stick, 17.5 in. long, whose other 
end was fixed to a gimbal. Movement of 
S’s hand was then restricted to the surface of 
a sphere of 17.5 in. radius tangent to his 
frontal parallel plane at a distance of 14 in. 
from his eyes. The movements were further 
limited to right-left and up-down components 
of displacements in this plane of 15 and 
1l in, respectively, and rotations around 
an axis along the length of the control stick, 
Movements of the stick actuated three po- 
tentiometers of which one Was responsive 
to rotation and the other two to the two 


the hand. (See Appendix for a more com- 
plete discription). The potentiometers mod- 
ulated three voltages which could be fed 
either directly to an 
recording heads of a tape transport system. 


Speed of tape transport was constant. The 
outputs of the playback heads were then fed 
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direction to those of the hand which ac- 
tuated the system. The scope trace was 
optically projected onto a ground-glass screen 
so that its magnified displacements were 
equivalent to those of the hand. The S$ 
viewed the screen in a fully reflecting mirror 
in an otherwise dark room. The position 
and orientation of the mirror was such as to 
make the virtual image of the trace remain 
superimposed on the invisible hand under con- 
ditions of no-delay and no constant visual 
displacement. For convenience, wedge prisms 
were used to displace the image of the hand 
although mechanical or electronic means 
might have been used. 

Before and after exposure, S' was tested in 
à separate apparatus (Held & Gottlieb, 1958). 
He marked, with his right hand, the virtual 
images of two targets viewed binocularly in 
a fully reflecting mirror that obscured his 
hand. The targets appeared to lie on a sur- 
face whose location approximated that in 
which his hand moved during exposure. 
"Throughout the experiment, S’s head was 
held by a bite board. 

Subjects—Male, college undergraduates 
Served as Ss. All were naive as to the ob- 
ject of this experiment. 

Procedure-—Each of 24 Ss was first run 
in two sessions without time delay. Half of 
the Ss were exposed with base left (BL) 
prisms first followed by base right (BR) 
Prisms on the next day; the other half were 
given the reversed sequence. The sequence of 
events within each session was as follows: 
(a) Preexposure marking: S alternately 
marked the apparent position of each of two 
visual targets for a total of 20 marks, with- 
drawing his hand between markings. (b) 
Exposure: S grasped the knob of the ap- 
Paratus firmly and moved it from left to 
right and back at 21 C/min to the beat of a 
metronome. He was instructed to watch the 
Oscilloscope trace as he viewed it through a 
Pair of 20 diopter prisms oriented either BR 
or BL, depending on the session. (c) Post- 
exposure marking: After 2, 6, 14, and 30 
min. of cumulated duration of exposure to 
the prisms, $ was tested in a manner identi- 
cal to that of the preexposure marking. 
From the 24 Ss tested, we selected 6 who 
showed significant and approximately equal 
adaptation under both BR and BL exposures. 
Experiments with each of these 6 Ss were 
then conducted in 12 further sessions. 

, The procedure of each of the next 10 ses- 
Sions was essentially identical to that of the 
first 2 except for the fact that in each session 
S's visual feedback from his hand movements 


r^ 
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was subjected to a constant time lag whose 
magnitude was varied from session to ses- 
sion. The delays used were 0.3, 0.5, 0.9, 
17, and 3.3 sec. The 10 sessions using 
combinations of each of the five delays with 
each of the two prism orientations were 
presented to Ss in varied sequences. The 
final 2 sessions were identical to the first 2 
in which no delay was introduced. 


RESULTS 


Pre- to postexposure shifts for each 
exposure duration were computed for 
all delays. The S’s mean shifts, aver- 
aged across the two target locations, 
were computed for all six magnitudes 
of time lag (0.0-3.3 sec.). The shifts 
for BR and BL conditions were also 
averaged to yield a single score for each 
condition of delay. A preliminary anal- 
ysis of variance had shown that S’s 
shifts during the first two no-delay 
sessions did not differ significantly from 
those obtained in the last two no-delay 
sessions; therefore, the averaged shifts 
of these four sessions were used for 
further analysis. The S’s mean shifts 
are presented in Fig. 1. With time 
lag in the feedback loop, none of these 
mean shifts was significantly different 
from zero by one-tailed t tests at the 
05 level. However, with no delay the 
mean shifts after 2, 6, and 14 min. of 
cumulative exposure were significant 
beyond the .01 level of confidence and 
after 30 min. beyond the .001 level. 

‘An analysis of variance of the mean 
shifts for all magnitudes of delay indi- 
cated that the magnitude of delay and 
cumulative exposure main effects were 
highly significant (p < 001). This in- 
dicated that, first, the introduction of 
time delay differentially affected adap- 
tation to displacement, and second, that 
adaptation to displacement was in- 
fluenced by the amount of time that S 
was exposed to prisms. 

A second analysis of variance in 
which the shifts obtained under no-de- 
lay sessions were omitted indicated that 
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the amount of delay main effect, failed 
to reach significance at the .05 level 
implying that amount of delay, ranging 
from 0.3 to 3.3 sec., did not differ- 
entially affect the amount of shift re- 
sulitng from exposure. The cumulative 
exposure main effect was significant 
(p < .001). However, as mentioned 
above none of the mean shifts reached 
significance under delay conditions. 
The general trend of the curves for de- 
layed feedback (Fig. 1) was not con- 
sistently in the adaptive direction. 
Since the interaction. between magni- 
tude of delay and cumulative exposure 
was not significant, the trends of the 
curves for the several delays do not 
differ significantly. Consequently, al- 
though cumulative exposure entails sig- 
nificant change, the variation is not of 
the type that we identify with adapta- 
tion. 

An analysis of variance of the scatter 
(standard deviation in right-left dimen- 
sion) in markings of individual target 
points showed no significant changes 
over the period of exposure. 


CONCLUSIONS 


The results indicate that the correlating 
mechanism, believed to account for adap- 
tation, cannot handle a feedback signal 
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delayed by as little as 0.3 sec. Such de- of delays of less than 0.3 sec. now under 
layed signals, then, produce one of the study. Such delays are of the order of 
effects that we have ascribed to decor- magnitude of transmission times in the 
relation, Of great interest are the effects central nervous system. 


APPENDIX 
Electronics of Delayed Feedback A Pparatus 


The electronic portion of the apparatus can be described as follows: S’s hand held 
a control stick which operated a gimbal arrangement consisting of two linear voltage 
divider potentiometers for X and Y directions of translational movement and a sine- 
cosine potentiometer for rotation (Fig. 2). The two linear potentiometers were fed 
a balanced dc voltage of + 150 v. Any deviation of the control stick from the center 
position resulted in a voltage corresponding in magnitude and polarity to the mag- 
nitude and direction of the stick movement. The Sine-cosine potentiometer was fed 
a 60 cps ac voltage of about 7 v. RMS which was used to provide a straight-line 
trace, rotatable on the oscilloscope screen. The Outputs of the sine-cosine potentiom- 
eter consisted of two voltages, corresponding in magnitude to the sine and cosine of 
the rotational displacement of the control stick from a zero position. 

The sin © signal was added to X and cos © to Y thereby superimposing rotational 
on translational information. The output of each adder consisted of a dc voltage 
varying in polarity and magnitude with a 60 cps ac voltage varying in magnitude 
superimposed upon it. Each composite signal was then fed into an analog data 
recorder where it modulated a 7,500 cps carrier and 
tape. Each signal was picked up on a multichannel playback head placed a short 
distance from the recording head, thus introducing a delay into the system. This 


and inversely Proportional to the speed of the tape. The tape speed remained constant 
at 74 in/sec for the entire experiment. The played-back signal was demodulated and 
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At the same time, another channel of the tape recorder was operated with shorted 
input to produce a signal corresponding to noise and drift generated by the tape re- 
corder. This error signal was subtracted from the two information channels, thus 
eliminating any error introduced into the information by the tape recorder. The sig- 
nals were then amplified 10 times and applied to the vertical and horizontal channels 


of the oscilloscope. 


The adders, subtractors, and amplifiers were made using standard analog computer 
operational amplifiers with external resistor networks to obtain the proper functions. 
The modulator/tape recorder/demodulator unit was a Mnemotron Model 200 with a 
special adjustable idler roller to give several different amounts of delay. 
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CORRELATION BETWEEN VISUAL AND KINESTHETIC 
SPATIAL AFTEREFFECTS 


A. A. LANDAUER! G. SINGER, 
University of Sydney 
AND R. H. DAY 
Monash University 


A kinesthetic and visual aftereffect in judgment of horizontality was 
found for 202 Ss in paired balanced trials using the method of adjust- 
ment. Adjustment times were also measured. The correlation be- 
tween the 2 aftereffects was reduced but nevertheless significant after 
allowance had been made by partial correlation for adjustment times in 
the 2 tasks. This finding is contrary to that of earlier experiments in 
which allowance was not made for adjustment times. 


The cortical satiation theory of 
spatial aftereffects (Kohler & Wallach, 
1944) has been extended by others to 
include a notion of differential satia- 
bility between normal and certain psy- 
chopathological conditions (Klein & 
Krech, 1952; Wertheimer, 1955) and 
between individuals with different 
personality characteristics (Eysenck, 
1955). Spatial aftereffects following 
prolonged stimulation have been used, 
therefore, to compare cortical modifia- 
bility between these various behavioral 
States following prolonged stimulation. 
An implication of differential satiation 
is a positive correlation between the 
magnitude of spatial aftereffects in 
different sensory modalities, If, for 
example, a certain psychopathological 
condition is associated with restricted 
cortical satiability, then it would be 
expected that spatial aftereffects would 
be smaller in magnitude than for normal 
Ss in both vision and kinesthesis. So 
far investigations have failed to confirm 
significant positive correlations between 
visual and kinesthetic aftereffects 
(Gardner, 1961; McEwen & Rodger, 
1960; Spitz & Lipman, 1960). 

Since both visual kinesthetic spatial 
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aftereffects dissipate exponentially over 
time (Hammer, 1949; Oyama, 1953; 
Singer & Day, 1965) and, further, are 
most frequently investigated using the 
method of adjustment involving a time- 
delay aftercessation of stimulation, 
their magnitude will vary as a func- 
tion of the time S takes to complete the 
adjustment. Lack of correlation be- 
tween aftereffects in two modalities 
could, therefore, be attributable to dif- 
ferences in adjustment times. This 
would be more likely to occur when 
the two tasks demand different degrees 
of fineness and accuracy so that adjust- 
ments in one takes substantially more 
time than adjustments in the other. 
Alternatively positive correlation could 
be due to similarities in adjustment 
time. An S making quick adjustments 
would show greater aftereffects than 
one making slow adjustments. In 
only one of the studies mentioned above 
(McEwen & Rodger, 1960) was ad- 
justment time controlled and so far no 
account has been taken of the contribu- 
tion of this factor to the relationship 
between aftereffects in different 
modalities. 

The purpose of this experiment was 
to establish the correlation between a 
kinesthetic and visual spatial after- 
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TABLE 1 


MEANS AND SDs OF AFTEREFFECTS 
(DEGREES) AND ADJUSTMENT 
Times (SEC.) 


Variables M SD 
Visual Aftereffect 2.16 | 1.19 
Kinesthetic Aftereffect 4.72 | 2.38 
Visual Adjustment Time 14.04 | 15.62 


Kinesthetic Adjustment Time | 31.86 | 22.24 


effect for essentially similar tasks in- 
volving visual and kinesthetic judg- 
ments of horizontality following ap- 
propriate stimulation by a slanted 
stimulus object. Adjustment times 
have been taken into account by means 
of partial-correlation procedures. 


METHOD 


Subjects —There were 92 male and 110 
female Ss with a mean age of 19.9 yr. re- 
cruited from introductory classes in psy- 
chology. 

Apparatus —The apparatus already de- 
scribed in detail (Day & Singer, 1964) 
consisted of a pivoted circular target with 
luminescent bars for inducing and measur- 
ing the visual aftereffect and a pivoted 
wooden bar for the kinesthetic aftereffect. 
Both these could be adjusted by S to the 
apparent horizontal by means of manually 
operated rotary controls. 

Procedure.—The vi: target was fixated 
at its center for 90 sec. while slanted 15° 
after which S. adjusted it to appear hori- 
zontal. The bar was similarly adjusted 
after side to side movement of the extended 


TABLE 2 


INTERCORRELATIONS AMONG THE Four 
VARIABLES IN TABLE 1 


; . | Visual | Kinesthetic 
Kinesthetic | Adjustment| Adjustment 
Aftereffect Time Time 


Visual 
Aftereffect 

Kinesthetic 
Aftereffect 

Visual Ad- 
justment 
Time 


—489 | —173 


—411 


.303** 
—.304** 


.668*** 


arm and hand across its 15? slanted edge 
while S was blindfolded. Both scales were 
modified from earlier experiments to permit 
readings to the nearest 0.1°. 

Each S underwent four trials (two visual 
and two kinesthetic) consisting of 90 sec. 
stimulation and a posttest. The order in 
which the pair of visual trials and kinesthetic 
trials occurred alternated from S to S. Ad- 
justment times from cessation of stimula- 
tion to completion of adjustment were 
timed with a stopwatch. 

The aftereffect was determined using a 
method developed by Wenderoth (1964)? 


balanced trials. 
experiment pairs of trials were balanced 
when the inducing object or figure was 
slanted from the horizontal in one direction 
on the first trial of a pair and in the opposite 
direction to the same extent on the second 
trial. Although this procedure provides 
some control over cumulative effects which 
might have occurred if stimulation were 
always in the same direction, incomplete 
dissipation of an aftereffect from one trial 
would be reflected in the pretest setting of 
the second. With balanced trials this would 
result in a spurious inflation of the mean 
aftereffect. Wenderoth has shown that the 
mean difference between the two pretests 
when subtracted from the mean aftereffect 
from two balanced trials eliminates this in- 
flation and showed algebraically that the 
same value is arrived at by taking half the 
algebraic difference between two posttest 
settings of the test object or figure. 


RESULTS AND DISCUSSION 


In Table 1 are shown the means and 
standard deviations of aftereffects and 
adjustment times and in Table 2 the 
product-moment correlations between 
the four variables. These data indicate 
that there is a significant correlation 
(.303) between the two aftereffects. 

The correlation (.668) between ad- 
justment times for the two tasks, how- 
ever, suggests that the relationship be- 
tween the aftereffects may be due in 
part to this variable. When the effect 
due to adjustment times is corrected 

2This procedure for deriving an after- 
effect was used by Gibson and Bergman 
(1959) but fhe rationale was not given. 
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by partial-correlation procedures (Mc- 
Nemar, 1962), the correlation between 
kinesthetic and visual effects is reduced 
to .262 (p< 01), which shows that 
there is a significant correlation be- 
tween the aftereffects in the two modal- 
ities which is not entirely due to simi- 
larities in adjustment time. 

These findings based on a substantial 
group of Ss again draw attention to 
the importance of adjustment time as 
a factor in measuring the spatial after- 
effect using the adjustment method 
and lend support to the notion of a 
general underlying process. The lack 
of correlation between aftereffect in the 
two modalities in earlier studies may 
be attributable to differences in ad- 
justment times due to task differences. 
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EFFECTS OF STIMULUS UNCERTAINTY AND S-R 
COMPATIBILITY ON SPEED OF DIGIT CODING* 


LOUIS 


D. COSTA? MORTON HORWITZ, Ax» HERBERT G. VAUGHAN, Js. 


Albert Einstein College of Medicine 


1 of 8 digit-digit coding 
tainty (Us) and 4 levels of 
different groups of 10 Ss 


tasks varying in 2 levels of stimulus uncer- 
S-R compatibility were administered to 8 
each. Response speed was found to vary as 


a function of S-R compatibility and this effect increased as Us was 


raised from 1 to 


3 bits. S-R compatibility 
which sets of stimuli were jdentical and mode of 


effects in these tasks, in 
motor response ‘was 


held constant, were ascribed to the availability of logical S-R transla- 


tion rules as a function of 


Attempts to explain alterations in 
choice reaction time (CRT) in terms of 
variations in amount of stimulus uncer- 
tanty (Us) (Hick, 1952; Hyman, 
1953) have proved at least partially un- 
successful because of failure to consider 
other relevant variables. Among such 
is the degree of natural or learned rela- 
tion between individual pairs of stimuli 
and responses (Fitts & Switzer, 1962). 

In situations requiring directional 
motor responses to complex visual dis- 
plays, Fitts and associates have demon- 
strated a reduction in CRT when the 
response required to each stimulus is 
one which would be highly likely to 
occur even without experimental in- 
struction. They showed an increase in 
CRT for S-R pairs involving novel or 
unlikely S-R combinations. This vari- 
ation of CRT as a function of the re- 
lation between S and R was defined 
as the construct ibili 
(Fitts & Deininger, 
Seeger, 1953). High S-R compatibil- 
ity may be produced by a natural physi- 
ological relationship between stimuli 
and responses as in the case of the ex- 
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preexperimental experience. 


periment by Leonard (1959) in which 
Ss were required to respond with the 
same finger that was receiving tactile 
stimulation, Preexperimental over- 
learning of S-R pairs as in tasks which 
require naming of visually or aurally 
presented digits or letters also produces 
little increase in CRT as a function of 
U, and consequently shows the effect 
of S-R compatibility in reducing CRT 
in high Us conditions (Brainard, Irby, 
Fits, & Alluisi, 1962; Mowbray, 
1960). 

Several investigators have studied 
intraexperimental alterations in. CRT 
as a function of practice and Us. While 
Mowbray and Rhoades (1959) found 
that with extensive practice CRT be- 
comes independent of U, for Us <2 
bits, Hellyer (1963) demonstrated a 
significant Us effect on CRT after 5,040 
trials on a relatively unusual light- 
naming task. 

It is apparent then that the relation 
within individual stimulus and response 

irs helps to determine CRT even 
the number of such pairs is held 


when ; 
constant. Factors affecting the S-R 
relation may be intrinsic to the struc- 


ture and function of the organism Or 
may reflect the past history of the or- 
ganism with respect to previous »- 
translations required of him. 

In the present study we examined à 
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Fic. 1, S-R codes at each level of U, and 
S-R compatibility. 


Series of four digit-coding tasks re- 
quiring different cognitive operations 
in order to generate an appropriate R. 
fora given S. The tasks varied along 
two related continua: (a) the amount 
of preexperimental familiarity with 
the actual S-R ensembles and (b) the 
availability of a discoverable coding 
tule or concept which would enable 
S to generate the correct S-R associa- 
tions without rote learning. Tasks 
which maximized preexperimental fa- 
miliarity and easily discoverable cod- 
ing rules were construed as being high 
in S-R Compatibility. "The four tasks 
So developed were presented with both 
two or eight S.R alternatives, thus 
varying level of U, and producing eight 
discrete tasks. The four sets of stimuli 
at each level of U, were identical en- 
sembles of 1-digit numbers, The re- 
sponses appropriate to each stimulus 
were written 1-digit numbers, hence 
the mode of Tesponse remained con- 
Stant for all conditions. 
Sources of variation, then, were the 
number of S-R pairs in each ensemble, 
and the relation between each S-R 
pair. Naive Ss were used. Each re- 
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ceived 10 trials on only one of the 
eight tasks. 

We hypothesized an increase in re- 
sponse speed under conditions of higher 
S-R compatibility and an interaction 
between U, and S-R compatibility ef- 
fects such that the latter would be 
maximized at the higher level of U, 


METHOD 


Tasks.—The tasks employed were similar 
to the digit-symbol subtest of the WAIS 
except that both stimulus and response en- 
sembles contained single-digit numbers and 
the number of S-R pairs on each task was 
either two or eight. The keys for all tasks 
are shown in Fig. 1. The numbers in the 
top row are the stimuli and the bottom row 
indicates the correct responses. The four 
levels of S-R compatibility in order of 
hypothesized response speed involved: (a) 
Copying—the stimulus number and response 
number were identical. (b) Subtracting— 
the response required was always one num- 
ber less than the stimulus paired with it. 
Thus, once the nature of the operation re- 
quired was learned S could correctly respond 
by producing a response to each stimulus 
that had been strengthened by extensive pre- 
experimental experience, that of subtracting 
from a given l-digit number. (c) Choosing 
the opposite—the S and R ensembles were 
arranged in ascending and descending series, 
respectively, such that for the two-alterna- 
tive task the two S-R pairs were opposites 
(ie, 3,9 and 9,3) and for the eight-alterna- 
tive task, eight such pairs of opposites were 
Produced (see Fig. 1). In this task it was 
Possible to generate a rule which would en- 
able correct response coding without refer- 
ence to the key, but the individual S-R 
pairings did not involve the exercising of a 
habit which was Strengthened by extensive 
Preexperimental experience. That is, while 
Ss undoubtedly had had Practice in produc- 
ing opposites with a variety of stimuli, the 
specific stimuli considered opposite to each 
Other in this task had to be learned during 
the task itself. (d) Random pairing—no 
special relation existed between each S-R 
pair and thus each pair had to be learned as 
à separate unit. 

Pretest of Sheed of digit writing.—The 
speed at which digits between 1 and 9 could 
be written was determined by pretesting 20 
control Ss who Participated in no further 
experimentation. Each S ran nine trials, 30 
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TABLE 1 


MEAN RESPONSE RATE FOR WRITING SINGLE 
Drorrs (n = 20) AND AVERAGE RATE FOR 
ENSEMBLE oF Droits IN EACH Task 


Digit M Task M 

1 105.2 2 copy 60.55 
2 59.2 2 subtract | 58.75 
3 59.7 | 2 opposite | 60.55 
4 53.2 | 2 random 58.20 
5 49.3 
6 632 | 8copy 58.78 
T 664 | Ssubtract | 64.25 
8 57.8 | 8opposite 58.78 
9 61.4 | 8random 64.25 

Grand 

M 63.9 


sec. each, in which his task was to write a 
single digit repetitively as rapidly as possible 
in the same boxes used in the experiment 
proper. Each S ran one trial for each digit, 
the order balanced across Ss. The data are 
shown in Table 1. The pairs of response 
digits selected for the “two-alternative” 
tasks, in the experiment proper, closely ap- 
proximated the grand mean response rate 
for all 9 digits. For the tasks with eight 
alternatives, the substitution of the digit 1 
for the digit 9 in the random and subtract 
tasks made these tasks slightly easier in 
terms of motor response than the copy or 
opposite tasks. 

Subjects.—Eighty Queens College ele- 
mentary psychology students, 40 males and 
40 females, were each assigned to one of 
eight code groups. Groups were balanced 
for sex but otherwise assignment was random. 
All Ss were right-handed. 

Procedure.—The Ss were tested in groups 
of about 25 students. Each S was given a 
booklet containing a fact instruction sheet 
and 10 code series on separate pages with 
between them. Each 


sponses and contained a key at 
the page like that shown in Fig. 1. 
each group all 10 code series and keys were 
identical. 

The Ss read the instruction sheet which 
contained a key and a practice series of eight 
responses. They were instructed to complete 
each code pair in order without skipping 
pairs. No indication of the principle, if any, 
by which the appropriate response for each 
stimulus could be deduced was given. 

Aíter all Ss had completed the instruc- 
tion sheet the signal was given to turn the 
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page and start Trial 1. Forty seconds were 
allowed for each trial containing a series of 
100 items. At the completion of each trial 
Ss turned the page to an opaque sheet and 
They then turned the 
page on command and worked Trial 2. The 
sequence was repeated until 10 trials had 
been run. 


RESULTS 


The number of correct responses for 
each S on each trial were recorded. 
Errors occurred with a frequency of 
less than 1% and were disregarded. 
Only 1 of the 80 Ss completed all 100 
items and then on only 3 of 10 trials. 

The data were subjected to an anal- 
ysis of variance in which U, and code 
types were fixed effects and repeated 
measurements (trials) were made on 
Ss nested within the eight groups pro- 
duced by combinations of the fixed 
effects. 

The main effects, Us, code types, and 
trials, all showed significant variation 
(p<.01) as did the interaction be- 
tween U, and code types, and the triple 
interaction. The mean number of cor- 
rect responses on each trial as a func- 
tion of both Us and code type are shown 
in Fig. 2. 

It is clear that when code types are 
combined there is significantly faster 
response rate for the two-alternative 
task than for the eight. Differences 
among code types were determined by 
Scheffé test of multiple comparisons 
(Scheffé, 1959, Ch. 3) which demon- 
strated that the copy condition pro- 
duced significantly faster responses than 
all other coding conditions. The sub- 
tract condition produced significantly 
faster responses than the opposite and 
random conditions which were not 
reliably different from each other. 

The hypothesis that S-R compati- 
bility effects would vary as a function 
of U, was confirmed by the significant 
U, X Code Type interaction. Figure 2 
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and Scheffé's test demonstrate that, 
while the eight-copy condition was 
superior (5 « 01) to the two-copy 
condition, with increasing incompati- 
bility there was a greater drop off in 
Tesponse rate for the eight-alternative 
conditions than for corresponding two- 
alternative conditions, 

All groups improved over trials with 
Trial 2 producing significantly greater 
Tesponse rates than Trial 1. After 
Trial 2 there was no Significant im- 
provement in adjacent trials, but sig- 
nificant increment in response rate was 
shown by comparing means of trials 
which were on the average three trials 
apart. 

The significant triple interaction 
indicated differential learning rates as a 
function of the interaction of Us and 
code type. Inspection of Fig. 2 sug- 
Bests that this effect is produced pri- 


marily by a great increase in response 
speed in early trials for the eight-copy 
and eight-subtract conditions. 

The self-paced coding tasks utilized 
in this experiment differ from many of 
the tasks used by other investigators 
to study dimensions of U, and S-R 
compatibility. Stimuli are not pre- 
sented discretely and it is impossible 
to analyze individual response times 
to individual stimuli, What is meas- 
ured is response rate for a specified 
Period of time. This measure was used 
to produce a mean CRT: the time in- 
terval (40 sec.) divided by the number 
of correct responses. Informational 
measures, in bits transmitted per sec- 
ond, were computed by multiplying the 
number of correct responses by 3 (for 
an eight-alternative task) or by 1 (for 
a two-alternative task) and dividing by 

The mean CRT over 10 trials and 
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TABLE 2 


Mean CRT (MskC.) AND INFORMATION 
‘TRANSMITTED OVER 10 ‘TRIALS 
ror EACH GROUP 


Two-Alternative Eight-Alternative 


Code Type 
CRT Bits/Sec | CRT Bits/Sec 
Copy 547 | 1.825 528| 5.682 
Subtract 549 1.821 739 | 4.060 
Opposite 597 1.675 | 1136] 2.041 
Random | 609 | 1.642 1197 | 2.506 


the average information transmitted per 
second over 10 trials are shown in 
Table 2. 


Discussion 


Although the format of the tasks in the 
present experiment is not the usual CRT 
format, the results obtained are quite 
similar to those produced by Fitts and 
Deininger (1954) ina conventional eight- 
alternative CRT task. The latter authors 
produced S-R compatibility effects by 
varying the nature of the stimuli, spatial 
and symbolic codes, and the nature of the 
S-R translation rules required to produce 
the appropriate response (moving a 
stylus in one of eight directions). The 
range of variation on the first day’s trials 
on their tasks was from 421 to 1,339 
msec, which is similar to the range of 
528-1,197 msec. obtained in the present 
investigation. They report a rate of 
transmission of information of from 1.8 
to 7.3 bits per second which is roughly 
similar to the range of 2.5-5.7 bits re- 
ported here. Thus, although widely dif- 
ferent modes of stimulus presentation and 
response recording were used, and the 
ensembles of stimuli and responses as well 
as the rules of S-R translation differed, 
the results obtained in both experiments 
were of about the same order of magni- 
tude. 

In this experiment S-R compatibility 
effects were demonstrated on a series of 
tasks in which no variations in stimulus 
modality, spatial configuration, or mode 
of response were present. All significant 
intertask variation was produced by alter- 
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ing either U, or the S-R translation rules 
for connecting individual stimuli and re- 
sponses. 

The significant F ratio for U, must be 
interpreted with caution. While it is true 
that the overall response rate for eight- 
alternative tasks is slower than that for 
two-alternative tasks, this effect was pro- 
duced by the arbitrary selection of a set 
of tasks which would require a range of 
cognitive operations and thus produce a 
range of S-R compatibility effects. As 
can be ascertained by reference to the re- 
sults for the two-copy and eight-copy 
tasks, it is clearly not necessary for all 
eight-alternative tasks to produce slower 
response rates than two-alternative tasks. 
There is, one would infer, from compari- 
son of the copy tasks, no decrement in 
response rate produced either as a result 
of having to perceive eight vs. two stimuli 
or having to select one of eight vs. one of 
two responses. Forrin and Morin (1963) 
also report no difference in CRT on a two 
vs. four alternative number-matching task. 
Thus, the decrement in response rate at 
the higher level of Us results from the 
fact that at this level S-R compatibility 
factors are producing a greater effect. 
It is not merely increasing Us that pro- 
duces lower response rate but increasing 
U, in tasks with low S-R compatibility. 

A priori analysis of the four different 
coding tasks correctly suggested the order 
of response rate with one exception. It 
was assumed that the opposite condition 
would produce faster responses than the 
random condition whereas no significant 
difference was observed between them. 
This might have occurred because of the 
greater ease of motor performance of 
the eight random series (see Table 1). 

In general, however, the data demon- 
strate that in cognitive tasks as well as 
perceptual-motor tasks there exists an 
interaction between S-R relatedness and 
U,. At some levels of S-R compatibility 
or of task difficulty, additional stimulus 
and response alternatives can be added 
without affecting CRT. Under conditions 
of low compatibility or high test com- 
plexity increased U, begins to affect CRT 
performance by producing a correspond- 
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ing increase in CRT. As in our experi- 
ment (Table 2) the rate of information 
transmitted per second continues to rise 
but fails to keep pace with the increase 
in U,. 
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TEMPORAL INTEGRATION: THE RELATIVE UTILITY OF 
IMMEDIATE VERSUS DELAYED REWARD 
AND PUNISHMENT * 


K. EDWARD RENNER 


University of Illinois 


2 experiments demonstrated that an immediate shock-food combination 
was preferred over nonshock-nonfood, but nonshock-nonfood was pre- 
ferred over a delayed presentation of the same shock-food combination. 
This data showed that an immediate shock-food combination was posi- 


tive in net value, but when delayed, negative in net value. 


It was 


suggested that the temporal gradients of reward and punishment reflect 
the relative utility of immediate and delayed outcomes which must be 
combined (integrated) into a net value. 


A temporal gradient of reward which 
is steeper than a temporal gradient of 
punishment can be derived by selecting 
appropriate incentive values, and an 
immediate shock-food combination will 
be preferred over the delayed presenta- 
tion of the same combination ( Renner, 
1966a). These findings suggested that 
an immediate shock-food combination 
was positive in net value, but when 
delayed, negative in net value. The 
present experiments tested this sug- 
gestion by pairing each combination 
with nonshock-nonreward and obtain- 
ing preference data. Also, the prefer- 
ence for immediate over delayed shock- 
food was replicated and the effect of 
signaling the occurrence of the delayed 
shock-food combination was observed. 


METHOD 


Apparatus and subjects—A complete de- 
scription of the apparatus is available else- 
where (Renner, 1964a, 1966a, 1966b). 
Briefly, a start box was located in the center 
which provided S with entrance to a delay 
chamber on its right or left side; a goal box 
was at each end. In Exp. I, Ss were two 
groups of eight each of 84 day old, naive, fe- 
male, Long-Evans rats, and in Exp. II, four 


1 This work was supported by Grant No. 
MH 11633 from the United States Public 
Health Service, National Institute of Mental 
Health. 


groups of 10 each of 73 day old, naive, female, 
Sprague-Dawley rats. 

Procedure.—In Exp. I, Ss were placed on 
a liquid food diet and reduced to 75% of 
their body weight by adjusting the number 
of minutes each S was permitted to eat in 
the goal box where it received all of its 
daily ration. In Exp. II, Ss were on 
powdered food and were permitted to eat for 
2 min. in the goal box; the remainder of the 
daily ration necessary to hold weight at 75% 
was given in the home cage. 

The apparatus was arranged with a black 
and white striped delay chamber and a black 
goal box on the food side; the delay chamber 
on the opposite side (i.e., nonshock-nonfood) 
was white. All Ss received one free, and 1 hr. 
later, one forced trial per day. In Exp. I, 
Ss were run 6 days per week, and in Exp. 
II, five days per week. For the first 2 wk. 
of Exp. I, and the first week of Exp. II, 
Ss were trained for a position response, half 
of each group to the right and half to the 
left; shock was then introduced in the delay 
chamber on the food side. During Exp. I, 
the shock was .4 ma. for .75-sec. duration; 
for Exp. II, .5-sec. duration. The Ss were 
removed from the nonshock-nonfood side 
after 15 sec. Both experiments lasted 5 wk. 

In both experiments, Group 1 received 
shock-food after 1-sec. delay, and Group 2, 
after 30-sec. delay. Group 3 of Exp. II was 
similar to Group 2 except that a 6-w. light 
cue in the roof of the delay chamber was 
presented 2 sec. before the delayed shock-food 
combination was delivered. Week 1 was 
omitted for Group 4 of Exp. II; these Ss 
were given a choice between the immediate 
and the delayed shock-food combination over 
the final 4 wk. of training. 
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Fic. 1. Preference of Ss for shock-food 
vs. nonshock-nonfood. (In both Exp. I and 
II, Group 1 received immediate and Group 2 
received delayed shock-food. For Group 3, 
of Exp. II, the delayed shock-food combina- 
tion was signaled by a light cue. Shock was 
introduced after the second week in Exp. I 
and after the first week in Exp. II. There 
Were six free and six forced trials per week 
in Exp. I, and five free and five forced trials 
per week in Exp. IL) 


Resutrs AND Discussion 


Preference —The percentage of free- 
trial responses, by Group 4 of Exp. II, 
to the immediate Shock-food over the 
4 wk. of preference training were 5096, 
60%, 64%, 74%, and their SDs were 
30.14%, 13.32%, 29.52%, and 32.38%, 
respectively. This increase in the per- 
centage of responses to the immediate 
over the delayed side from the first to 
the last week yielded a + (9) = 1.857, 
b < 05, one-tailed. The preference for 
the immediate shock-food side occurred 
for 8 of the 10 animals, These data 
confirm the previous Teport (Renner, 
1966a) of a final preference Barca 
ate over delayed shock-food. 

Desire and aversion—The prefer- 
ence data for both experiments are ij. 
lustrated in Fig.l. It may be seen for 
Groups 1 and 2 in both experiments 
that there was an initial preference to- 
ward the food side, which shifted to- 


ward the nonshock-nonfood side after 
the shock was introduced. In Exp. I, 
the 1-sec. delay group was making sig- 
nificantly more choices to the shock- 
food side (vs. the nonshock-nonfood 
side) by Week 5 than were the 30-sec. 
delay group with ¢ (14) = 1.817, p< 
-05, one-tailed. For Exp. II, an analy- 
sis of variance of the three groups (see 
Fig. 1) yielded a significant between- 
groups difference, with F (2, 27) = 
6.15, p < .01, due in part to the fact 
that Group 1 preferred the immediate 
shock-food side over nonshock-nonfood 
more frequently than Group 2, t ( 18) 
—2917p5« -01, one-tailed. As in the 
previous experiment, the incentive 
values were not perfectly adjusted and 
demonstrate only the relative positive 
value of an immediate and the negative 
value of a delayed shock-food combi- 
nation. 

Group 1 of Exp. II was divided into 
two subgroups made up of five Ss that 
preferred the shock-food side and 
five Ss that Preferred the nonshock- 
nonfood alternative ; the choice latencies 
of these Ss yield a significant interac- 
tion term, with F (1, 19) = 16.24, p 
< .005. This interaction reflects the 
fact that Ss Preferring the shock-food 
side went faster to that side (X = 
53.52, SD = 2177) than did Ss pre- 
ferring the nonshock-nonfood side (X 
= 37.79, SD = 24.17) and the rela- 


14.67, and X = 5391, SD = 2033, re- 


group, preferred the nonshock-nonfood 
alternative. Over the final 3 wk., there 
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TABLE 1 


Ms AND SDs or MEDIAN RECIPROCAL CHOICE Larena ( 100/925) TO THE 
Two SIDES OVER THE Fina 3 WK.: Exp. 


Group and Side 


Group 
Shock-Food 
Nonshock-Nonfood 
Group 3 
Shock- Food 
Nonshock-Nonfood 


SD x SD 
13.84 43.53 13.85 
24.61 65.23 39.75 
31.65 58.02 24.51 
35.54 82.91 33.38 


an interaction term of F (2, 18) — 
4.07, p < .05. The latencies in Exp. 
I were similar in direction, but the dif- 
ferences were not significant. 

Signaled and unsignaled shock.—The 
present data do not deal directly with 
the additional problem that a delayed 
shock, because it is unsignaled, may be 
more aversive than an immediate shock 
(Knapp, Kause, & Perkins, 1959). 
The performance of Group 3 indicates 
the preference for nonshock-nonfood 
was not altered by signaling the de- 
layed shock-food. The final preference 
data shows a significant difference be- 
tween Group 1 and Group 3, ¢ (18) = 
2.667, p < .01, one-tailed (see Fig. 1), 
and the latencies of Group 3 are similar 
to those of Group 2, as may be seen in 
Tablel. There are progressively faster 
latencies to the nonshock-nonfood side 
over trials while the latencies to the 
shock-food side decline slightly, with F 
(2, 18) = 7.87, p «.005. Although 
these data are consistent with a gradient 
notion of delay of punishment (Renner, 
1964a, 1964b, 1966a) it should be noted 


that Group 3 does not provide direct 
evidence either on the relative aversive- 
ness of the immediate vs. delayed 
presentation of the shock used in the 
present experiments, or that the light 
cue signaling the shock, was, in fact, 
an effective technique for eliminating 
a possible increase in aversiveness due 
to the delay itself. 
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EFFECT OF MEAN AND VARIABILITY OF EVENT RUN 
LENGTH ON TWO-CHOICE LEARNING? 


BLASE GAMBINO Ax» JEROME L. MYERS 


University of Massachusetts 


In a 2X3 factorial design, 6 groups of 24 Ss predicted whether E 
would call out a 1 or a 2 on each of 540 trials. The independent 
variables were mean length of event runs (4.5 and 7.5) and variability 
of run length (low, medium, and high). The major findings were (a) 
more Ss predicted that runs would end too soon (early errors) in the 
low mean groups than in the high mean groups, (b) the effect of in- 
creasing mean was to increase the number of Consecutive events 
which occur before the response Probability curve begins to decline, 
(c) given the occurrence of the last event in the longest run for each 
group, predictions that the run would continue (late errors) increased 
as variability increased, and (d) repetition responses, predictions of 
the event which occurred on the preceding trial, increased as variability 
increased. It was concluded that variability of event runs is a major 
factor in the occurrence of negative and positive recency effects. 


In the usual two-choice probability- 
learning situation, it has been found 
that the probability of predicting an 
event, Ej, decreases after several Es 
have occurred in succession. This 
negative recency effect, first reported 
by Jarvik (1951), has proved difficult 
to explain (Estes, 1964). However, 
two recent studies (Derks, 1963: Good- 
now, Rubenstein, & Lubin, 1960) have 
demonstrated the importance of the 
average length of a run of events as a 
factor in any explanation of the nega- 
tive recency effect, The Goodnow et al. 
(1960) results showed that negative 
Tecency decreased as mean run length 
increased. Derks (1963) found nega- 
tive recency using short runs, and posi- 
live recency with long runs and extra 
long runs. 

More recently, Jones and Myers 
(1966) suggested that a more impor- 
tant factor is the variability of run 


1This research was supported by funds 
from National Institutes of Health Grant 
MH-03803-04 and National Science Founda- 
tion Grant GS-386. The authors wish to 
thank the Research Computing Center, Uni- 
versity of Massachusetts, for assistance in 
analyzing data, 


length occurrence, They randomized 
event frequencies over either 20 (RSB) 
or 300 trials (RLB). Negative recency 
was found in the RSB group and 
positive recency in the RLB group. 
Jones and Myers noted that increasing 
the size of the trial block led to an 
increase in the mean length of event 
runs and a more marked increase in 
the variability of run-length occurrence. 
No systematic interpretation of the 
relative contribution of these two vari- 
ables can be drawn since the experi- 
mental procedure led to a confounding 
of these effects. Previous studies, e.g., 
Goodnow et al. (1960) and Derks 
(1963), are also not helpful in assessing 
the relative effects of mean and vari- 
ability since their procedures were not 
designed to manipulate these variables 
independently. The present study was 
therefore designed to separate the ef- 
fects of mean run length and run 
Variability by orthogonally manipulat- 
ing these variables, 


METHOD 


Procedure.—On each of 540 trials, Ss were 
asked to Predict on IBM score sheets 
whether E would call out alora2 The 
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Fic. 1. Response and event probability as a function of run length. 


E signaled the beginning of each trial by 
calling out the item number on the score 
sheet where S's prediction was to be marked. 
The Ss were provided with two blank strips 
which they used to cover their predictions 
as the session moved along. The Ss were 
given approximately 3-4 sec. to make each 
prediction. 

Design.—Each of 144 Ss was randomly 
assigned to one of six experimental groups 
which differed with respect to the sequence 
of events which they were required to pre- 
dict. Each sequence consisted of 540 trials 
with each event occurring equally often. 
The sequences differed with respect to mean 
length of event runs (4.5 and 7.5) and 
variability of length of event runs (low, 
medium, high). Table 1 presents the run 
lengths used, and frequency of occurrence 
of each run for the six experimental groups. 
The method used in constructing the sequence 
for the low mean-low variability group (LL) 
will serve to illustrate how each sequence 
was generated. Thirty runs of length 4 
and 30 runs of length 5 were needed to total 
270 trials for one event, eg. Ers. A card 
was then labeled as 4 or 5 to represent each 
of the 60 run lengths used. This set of 60 
cards and an identical set representing the 


alternative event, E:'s, formed two separate 
decks. Each deck was independently shuffled 
to form two random orders, and runs, as 
represented by the cards, were drawn by 
alternating between the two decks until all 
the cards had been drawn. The other se- 
quences were constructed in a similar manner 
using the run lengths shown in Table 1. 
Subjects—Twelve males and 12 females 
were assigned to each condition to make a 
total of 24 Ss. The Ss were drawn randomly 
from the introductory psychology course at 


TABLE 1 


RuN LENGTHS USED AND 
FREQUENCY OF RUNS 


Group*| Run Lengths Used kie Aa 
p 4,5 30 
LM 3, 4,5, 6 15 
LH 2, 3, 4, 5, 6,7 10 
HL TS 18 
HM 6,7,8,9 9 
HH 5, 6, 7, 8, 9, 10 6 


*L = Low, M = Medium, H = High. First letter 
refers to mean, second letter refers to variability. 
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TABLE 2 
RESPONSE MEASURES OBTAINED FOR EACH EXPERIMENTAL GROUP 


LL LM LH HL HM HH 
04.4 02.7 03.8 01.3 02.5 01.8 
(pes id 12.2 22.8 39.8 15.2 36.4 $2.7 
Repetition Responses 43.2 69.8 80.9 46.7 70.7 82.0 
Redefined Repetition Responses 58.7 75.0 83.0 62.5 74,5 83.5 


the University of Massachusetts and served 
for grade points. 


RzsurTs 


Figure 1 presents average event 
and response probabilities, conditional- 
ized on run length. Since the results 
clearly were not a function of whether 
the preceding run of events were E,’s 
or £,’s, this distinction has been omit- 
ted. As an example of how the points 
in Fig. 1 were obtained, consider a run 
length of 4 for the LT. group. Since 
half of the run lengths are 4's and half 
5’s, the actual probability that a run 
will continue (Le, the event prob- 
ability), given that four consecutive 
Ey’s have just occurred, is .5. To ob- 
tain the Tesponse probabilities for 4 
run length of 4, the number of runs of 
that length, including those contained 
in runs of length 5, were divided into 
the number of times that S predicted 
that the ffth consecutive E, would 
appear. The response Probabilities 
Were originally calculated for each block 
of 90 trials, However, since the differ- 
ences between trial blocks were slight 
and the curves appeared to stabilize by 
the end of the first block of trials, the 
response Probability curves in Fig. 1 


blocks. The values for zero Tun length 
Tepresent the Probability of Predicting 
E, given that a Tun of Ejs (i 75i) 
of any length had just ended. 

he response and event curves may 
be described as roughly parallel to each 
other; however, in the early part of the 


curve, response probability undershoots 
event probability and this is more pro- 
nounced at low mean than at high, At 
the point where the event probability 
curve falls below 100%, this trend is 
Teversed, and the response probability 
curve now overshoots the event prob- 
ability curve. The amount of over- 
shooting becomes greater as variability 
increases and to a lesser extent as mean 
increases. 

For all six groups, when the event 
probability curve is at 100%, the op- 
timal response Strategy is obvious; Ss 
should always predict that the run 
segment. 

optimal 
Strategy was defined as an error. In 


to perseverate during that por- 
tion of the run when perseveration was 
the correct Tesponse. "The first row 
(early errors) in Table 2 shows a small 
but highly significant, F (1, 138) = 
1097, p< 01, difference ; the low 
mean groups exhibit less tendency to 
Perseverate and therefore commit more 
early errors, 


on each run curve is always to predict 
the alternative event, 


on the curve, any response probability 
Sreater than zero reflects late errors. 


experimenta] variables on these errors, 
an analysis of variance was performed 


b 


RUN LENGTH IN PROBABILITY LEARNING 


on the probability of predicting that the 
run would continue given that Ss had 
seen the last event of the longest run 
for that condition. The second row in 
Table 2 (late errors) reveals significant 
differences, F (2, 138) =13.12, p< 
01, as a function of variability; as 
variability increases, so do late errors. 

The results of previous investigations 
have suggested that mean run length is 
a major factor in determining negative 
recency (e.g., Derks, 1963; Goodnow 
etal,1960). Figure 1 shows that one 
effect of increasing mean run length is 
to increase the number of consecutive 
events which occur before the response 
probability curve begins to decline. 
Whether the rate of decline is also af- 
fected by mean run length is not im- 
mediately clear from Fig. 1, although 
variability of run length does seem to 
be a factor. These effects were meas- 
ured by performing an analysis of 
variance on the probability of repetition 
responses, ie., of predicting the pre- 
ceding event for that portion of the 
curve in which event probability is 
less than 1.0. Table 2 shows that 
repetition responses increase signifi- 
cantly, F (2, 138) — 107.87, p < .001, 
as variability increases, but not as mean 
increases (F < 1.0). Since late errors 
are included as repetition responses, 
the latter were examined with late 
errors removed. The last row of Table 
2 shows that repetition responses are 
greater when late errors are deleted, 
and this is especially so for low vari- 
ability. There is still no effect due to 
mean and the redefined repetition re- 
sponses still increase as variability in- 
creases, although the effect is slightly 
reduced. 


DiscussioN 


A reasonable assumption about the ef- 
fect of mean run length would be that Ss 
could keep count of run length more 
easily at low mean than at high. This 
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assumption is contradicted by the fact 
that significantly more early errors occur 
in the low mean groups. However, pre- 
diction of the alternative event is rein- 
forced each time a new run occurs. 
Therefore, since Ss in the low mean 
groups receive a greater number of re- 
inforcements than Ss in the high mean 
groups (Table 1), the low mean Ss fail 
to perseverate more than the high mean 
Ss even though such behavior is inap- 
propriate during the early portion of the 
run curve, 

The finding that as variability of run 
length increases, perseverative behavior 
also increases, as evidenced by less nega- 
itve recency and more late errors, sug- 
gests an explanation of certain incon- 
sistencies in the probability-learning lit- 
erature. Edwards (1961) has pointed 
out that previous investigators (e.g. 
Anderson, 1960; Jarvik, 1951; Nicks, 
1959) reported negative recency while he 
obtained positive recency. Edwards, how- 
ever, randomized event frequencies over 
1,000 trials while previous investigators 
used 400 trials or less. Jones and Myers 
(1966) have pointed out that random- 
izing over large trial blocks results in a 
slightly larger mean run length and a 
much larger variability of run length. In 
view of the results of the present study, 
it would appear that a major cause of the 
difference between the results reported by 
Edwards and those investigators whc 
used fewer trials is the difference in 
variability of run length produced by 
randomizing over different numbers of 
trials. 

The effect of variability of run length 
can be explained in terms of the com- 
plexity of the task that Ss face. As it 
becomes more difficult to predict ac- 
curately, ie. as variability, and there- 
fore uncertainty, increases, Ss may in- 
creasingly rely on the last event they 
have seen and may therefore exhibit 
more repetition responses. 

A more satisfying approach to the 
data would be a general model of choice 
behavior, one of whose consequences 
would be the prediction that persevera- 
tive responding increases as a function of 
run length variability. Restle (1961) 
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has developed a model which has this 
property and is also capable of describing 
negative recency. The model assumes 
that the prediction following a run of 
length j depends on how frequently that 
run was continued in the past, In addi- 
tion, Restle assumed that longer runs are 
more “salient” in Ss’ memory, and there- 
fore deserve a greater weight. The 
model does not describe the data very 
accurately; the predicted probabilities con- 
sistently undershoot the observed proba- 
bilities. This failure of the model might 
be remedied by taking into account the 
reinforcing effect of the last event, Thus, 
Some integration of the Restle model in 
which the continuation of the run is the 
reinforcer, and the traditional linear 
models (eg, Bush & Mosteller, 1955; 
Estes & Straughan, 1954), in which the 
last event is the reinforcer, may describe 
the underlying Processes reflected by Ss’ 
behavior, 
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A COMPARISON OF TWO METHODS OF EVENT RANDOMIZATION 
IN PROBABILITY LEARNING + 


MARI R. JONES anp JEROME L. MYERS 
University of Massachusetts 


12 groups of 16 Ss each were run in a 2-choice probability-learning 
situation. The groups differed with respect to the combinations of 
amount of money risked (0, 5¢, 254), the level of m (.60, .75), and 
the structure of event sequence (randomized in long or short trial 
blocks). The major findings were that Ss had a higher probability 
of predicting the more frequent event, a lower probability of being 
correct, and exhibited less negative recency when events were ran- 
domized in long blocks than when events were randomized in short 
blocks. These results were discussed in terms of the distributions of 


lengths of runs of events. 


Studies of probability learning have often 
differed with respect to the method of gener- 
ating the sequence of events that S is re- 
quired to predict. Several Es have random- 
ized in short blocks (RSB), requiring, for 
example, that m (probability of the more 
frequent event) be the same for each 10, or 
20, trial block (eg, Siegel & Goldstein, 
1959). Other Es have randomized the event 
sequence over long blocks (RLB), for ex- 
ample, 250 trials or more (e.g, Derks, 1962; 
Edwards, 1961). The purpose of this ex- 
periment is a direct comparison of the RSB 
and RLB procedures. In addition, three 
incentive levels were compared under each 
randomization procedure. 

Method.—Each S was assigned to 1 of 12 
combinations of risk (0d, 5¢, 256), m (.60, 
75), and sequences (RSB, RLB). The 0¢ 
group won or lost a chip on each trial, the 5¢ 
group won or lost a nickel, and the 25¢ group 
won or lost a quarter; thus, feedback was 
equivalent for all Ss in terms of information 
received. Sequences were generated with 
the procedure used by Nicks (1959) in which 
the expected number of event runs of a given 
length are calculated for a particular 7 level 
and » (number of trials over which the event 
occurrences are to be randomized). Se- 
quences in which the run distribution was 
derived for n — 20 were RSB sequences, and 
those based on n of 300 were RLB sequences. 

Each trial lasted 10 sec. During a 2-sec. 
interval following a .5-sec. buzz, S predicted 
either E; or Es by depressing the appropriate 
_2 This research was supported by funds from Na- 
tional Institutes of Health Grant MH-03803-04 and 
National Science Foundation Grant GS-386. The 
authors wish to thank Blase Gambino for his com- 
ments on the manuscript, and the Research Computing 


Center, University of Massachusetts, for assistance in 
analyzing data, 


toggle switch. The event light remained on 
for 3 sec. All Ss were run for 300 trials in 
groups of four. 

Results.—An analysis of variance was per- 
formed on P(Ax), the proportion of predic- 
tions of E;, the more frequent event, in each 
50-trial block. Since no effects involving 
risk approached significance, the value of 
P(41:) in Table 1 are averaged over the 
three risk levels. P(Ax) is higher under 
the RLB procedure than under the RSB 
procedure and at v of .75 than at m of .60. 
The effect of increases over trials. The 
Sequence X Trial interaction is also signifi- 
cant; sequence effects are greatest during the 
middle trial blocks. All of these effects as 
well as the interaction of m, sequences, and 
trials are significant at the .01 level. 

Significantly more correct responses were 
made at v of .75 than at m of 60 (p<.01), 
and under the RSB procedure than under the 
RLB procedure (p.01). Sequence and m 
both interact with trials (5 < .01). 

An index of the degree to which S's ac- 
curacy is related to the pattern of events 
may be obtained by adjusting for the fre- 
quency of responses and events, If the se- 
quence of responses is independent of the 
sequence of events, the expected number of 
correct responses would be 


50[P (4:)7 + (1— P(CA:)) (1— 7)], 


where 50 is the trial block size. Subtracting 
this value from the observed number of 
correct responses results in the D scores of 
Table 1, which provide an index of the extent 
to which Ss have learned something about 
the patterns of events. The D scores ob- 
tained under the RLB procedure are gen- 
erally close to zero, indicating that Ss in 
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TABLE 1 


PROBABILITY OF Ai RESPONSES AND D SCORES FOR EACH COMBINATION OF 
SEQUENCE, z, AND TRIAL BLOCK 


Trial Block 
Measure | x | Sequence 
1 2 3 5 6 
415 RLB 0.680 0.847 0.861 0.917 0.911 0.900 
P(A) 45 RSB 0.627 0.763 0.830 0.834 0.844 0.861 
60 RLB 0.521 0,655 0.690 0.705 0.707 0.668 
60 RSB 0.523 0.588 0.587 .660 0.639 0.697 
E] RLB 0.542 —0.823 0.542 0.646 —1.396 —1.031 
D Scores A5 RSB 0.646 1.167 3.031 2.635 4.385 1.698 
«60 RLB —0.150 —0.210 —1.067 —0.300 0.983 2.342 
60 RSB 1.317 2.058 3.233 4.792 2.850 0.158 


these groups are no more accurate than one 
would predict assuming the independence of 
responses and events. The RSB scores are 
significantly higher ($ < 01), indicating that 
these Ss have learned to pattern their re- 
sponses in a manner which results in better 
than chance performance. Neither + nor 
risk is a significant variable in this analysis. 
The quadratic components of trials and Se- 
quences X Trials are both significant (p 
« 01), apparently due to the RSB data in 
which the D scores increase for the first two 
thirds of the experiment and then decline. 

For each 100-trial block, Fig. 1 presents 
the average probability of the prediction of 
E. given that the prediction has been im- 
mediately preceded by a run of consecutive 
Evs (solid line), as well as the probability 
that E does follow a run of consecutive Ev's 
(dashed line). In general, the RSB re- 
sponse and event functions are more nearly 
parallel than are the RLB curves In almost 
all instances, the occurrence of the first event 
in a run results in a clear increase in the 
probability of predicting that event; the only 
exceptions occur in the last two blocks of As 
data in the .75, RSB condition. Negative 
recency, a decrease in response probability as 
run length increases, is quite marked in the 
two RSB groups. The .60, RLB curves also 
exhibit negative recency, though decidedly 
less than do the RSB curves. The .75, RLB 
curves exhibit even less decline; the As 
curves in this condition actually exhibit 
marked positive recency, an increase in re- 
sponse probability as run length increases. 

Discussion—The results of this experi- 
ment demonstrate that choice behavior is 
strongly influenced by the distribution of 
runs of homogeneous events. Randomiza- 
tion over larger blocks yields higher P(A:) 
values, lower frequency of correct responses, 
and less negative recency. These results can 
be accounted for by the fact that the mean 
and variability of run lengths is higher for 
the RLB than for the RSB procedure. The 


higher mean would lead Ss in the RLB group 
to expect longer runs of events, and they 
would therefore be less prone to predict the 
alternative event as a run increased in length. 
The higher variability of run lengths would 
result in more stimulus uncertainty for Ss 
in the RLB condition. Consequently, they 
have a greater tendency than Ss in the RSB 
condition to rely upon a simple perseverative 
strategy. Thus, differences in mean and 
variability both operate to produce less nega- 
tive recency and higher P(A) values under 
the RLB condition. The greater stimulus 
uncertainty which accompanies greater vari- 
ability of run lengths would also seem to 
account for the fact that Ss in the RLB 
condition were less accurate than Ss in the 
RSB condition. 

The finding that risk does not produce sig- 
nificant differences in either P(A:) or re- 
sponse accuracy is in agreement with results 
reported by many investigators using a be- 
tween-S paradigm (e.g., Derks, 1962; Myers, 
Fort, Katz, & Suydam, 1963; Schnorr, Lip- 
kin, & Myers, 1966). The absence of a 
significant Risk X Sequence interaction sug- 
gests that the conflicting results on incentive 
effects in probability learning cannot be ex- 
plained merely in terms of differences. in 
sequence randomization procedures. 
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MAGNITUDE OF THE DOUBLET EFFECT AS A FUNCTION OF LOCATION 
IN A VERBAL MAZE1 


R. TERRY PIVIK 
University of Wyoming 
AND CHARLES P. THOMPSON 
Kansas State University 


RICHARD T. HEINE 
University of Michigan 


160 Ss were divided into 7 experimental (a single doublet located at 
Positions 2 & 3, 4 & 5, 6 & 7,8 & 9, 10 & 11, 12 & 13, and 14 & 15, 
respectively) and 1 control (no doublet) conditions. Each S learned 
a l6-unit, 8-choice verbal maze. The results demonstrate that the 
magnitude of the doublet effect increases and then decreases as the dif- 
ficulty of the portion of the maze in which the doublet is located 


increases. 


Introduction of a doublet (two successive 
identical responses) into a verbal maze pro- 
duces an increase in errors at the second 
position of the doublet (Thompson, Voss, & 
Brogden, 1957). This effect is produced by 
S's difficulty in identifying the doublet 
(Thompson et al, 1957). One might reason- 
ably assume that difficulty of doublet identi- 
fication would vary with the difficulty of the 


locations were used and that the difficulty of 
the maze was increased by using eight, rather 


served as Ss, All S. 


A Tepeated-measures design with seven 
experimental groups and one control group 
was used. Each of the 160 Ss was assigned 
randomly to one of the eight groups. Each 

was required to learn a 16-unit, 8-choice 
verbal maze, The eight choices were the 
The mazes 


ferent control mazes Were used with seven 
experimental mazes constructed from each 
of the control mazes by inserting a single 


1 This investigation was conducted at the University 
of Wyoming and was support, 
University of Wyon on as 


part b: 
from the 
Council oming Graduate keent 
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doublet at Positions 2 and 3, 4 and 5, 6 and 
7, 8 and 9, 10 and 11, 12 and 13, and 14 and 
153 respectively. The control maze was 
modified as little as possible when the doublet 
was inserted. 

The Ss were given the instructions used 
by Thompson (1965). After any questions 
pertaining to the instructions were answered, 
learning trials were begun. Acquisition 
trials continued until S reached the criterion 
of one errorless trial, The intertrial interval 
was held at 30 sec, Each response given at 
each position on each trial and time for each 
trial were recorded for each S. 

Results and discussion.—Analyses of vari- 
ance performed on tota] trials, total errors, 
and total time to criterion showed no signifi- 
cant differences between groups. This is in 
agreement with the results of previous 
studies on the doublet, 

Following the lead of Ernst et al. (1960), 
a serial-position analysis was performed on 
the transformed (%E—9%C) errors data. 
(The control group percent means for each 
position were Subtracted from the percent 
errors of each experimental S' at the com- 
Since variations in the 


respectively, 


A Significant Doublet Location X Serial 
Position. within Doublet interaction, F (6, 


a 
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TABLE 1 


MEANS FOR THE SECOND DOUBLET POSITION BY 
LOCATION TOGETHER WITH THE RESULTS OF THE 
TUKEY TEST FOR MULTIPLE COMPARISONS 
PERFORMED ON THESE MEANS 


Location (Serial Position) 


s [ula] s[s|7 |» 


%E-%C Ms| -1.4 .7 17 28 48 59 7.0 


Note.—Ms underlined by the same line do not differ 
significantly. 


133) — 6.64, p<.01, demonstrated that the 
magnitude of the doublet effect varies as a 
function of doublet location. A subsequent 
Tukey test for multiple comparisons ? 
(Scheffé, 1959) showed that the doublet first 
position %E — %C means do not differ sig- 
nificantly from zero (control group level). 
Hence only the second position means, to- 
gether with the results of the Tukey test, 
are given in Table 1. (It should be noted 
that, since the first position means do not 
differ significantly, a significant doublet loca- 
tion main effect, F (6, 133) =3.63, p<.01, 
also reflects the variation in the second posi- 
tion means.) 

The %E—%C errors at the second 
doublet position as a function of doublet loca- 
tion in the verbal maze are plotted in Fig. 1. 
This graph shows clearly the variation in the 
magnitude of the doublet effect as a function 
of the difficulty of the portion of the maze 
in which the doublet is located. The marked 
decrease in magnitude of the doublet effect 
at the most difficult portion of the maze 
(Position 11) is puzzling. However, inspec- 
tion of the data revealed that all experi- 
mental mazes with a doublet in that location 
showed a reduction in the magnitude of the 
doublet effect relative to the experimental 
maze (constructed from the same control 
maze) with a doublet located at Positions 8 
and 9, This suggests that the reduction in 
magnitude of the doublet effect at Position 11 
is not simply a sampling or maze construc- 

LEY i: 

P URL. USD imis pravel tore 
significant. Figure 1 quite clearly shows the contribu- 
tion of the quadratic component but the contributions 
of the quintic and sextic components are more difficult 
to interpret. Since the trend test did not clearly point 
out the dramatic reduction in the magnitude of the 
doublet effect at Position 11, the authors chose to use 
the multiple comparisons test which demonstrates that 


the magnitude of the doublet effect is ificantl: 
duced from Position 9 to Position 11. eas 


WEAN XE-XC ERPORD. 


D s T > D " w 
SERIAL LOCATION or THE SECOND DOUBLET Ponton 


Fic, 1. %E—%C errors at the second doublet 
position as a function of the location of the doublet in 
the verbal maze. 


tion error. Therefore, assuming that the 
order 3, 15, 5, 13, 7, 9, 11 probably quite 
accurately reflects the order of difficulty of 
positions in the mazes, the results suggest 
that the magnitude of the doublet effect in- 
creases and then decreases as the difficulty of 
the portion of the maze in which the doublet 
is located increases. 

As noted previously, the doublet effect 
results from S’s difficulty in identifying the 
doublet. Other investigators (e.g, Brown, 
Rubin, & Volkuwitz, 1965) have offered data 
suggesting that Ss use additional (or alterna- 
tive) cues when learning the most difficult 
portion of a serial list. Therefore, one might 
hypothesize that the decrease in magnitude 
of the doublet effect in the most difficult por- 
tion of the maze results from S’s search for 
additional cues which leads to more rapid 
identification of the doublet. In view of this 
hypothesis, the results of this study suggest 
that additional research on the doublet, utiliz- 
ing designs in which difficulty of doublet 
identification and difficulty of the verbal maze 
are systematically manipulated, would be use- 
ful in determining the manner in which Ss 
use additional cues in the most difficult por- 
tions of a serial list. 
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A FAILURE TO REPLICATE THE INHIBITORY EFFECTS OF A SECOND 
STIMULUS FOLLOWING THE PRIMARY STIMULUS TO REACT 


JAMES H. KOPLIN, ROBERT FOX, AND FRANK DOZIER 1 
Vanderbilt University 


stimulus, de t 
The Ist employed procedures similar 


the 2nd tested the influence of 2 procedural variables, 


Helson and Steger (1962) report data 
which indicate that simple reaction time 
(RT) to a visual stimulus (S1) was signifi- 
cantly increased by the onset of a second 
visual signal (Ss) which followed the first 
Over a set of 18 delay intervals ranging from 
10 to 180 msec. The retroactive effect was 
an inverted U-shaped function of the delay 
interval (AT) with maximum retardation of 
RT at AT values around 100 msec, Al- 
though Ss were instructed to react to S; and 
had no explicit set regarding Ss, the second 
signal apparently could intercept the reaction 
of S, at comparatively late stages in its 
development, 

Lappin and Eriksen (1964), using five 
AT values (30-120 msec.), failed to obtain 
Several changes in 
their Procedure could account for the conflict- 


T second experiment 

tested specific hypotheses about (a) the role 

of fixation and (b) the effect of uncertainty 
Si. 


Ace of the black Background was s fq 
A transluminated 15-min. red fixation cross 


1F. Dozier was a recipient of an NSF/URP stipend 
while participating in this st: Gran: 
to Vanderbilt University. "4% Grant No, GE 407 
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A bank of interval timers controlled all intra- 
trial events, The S, and S» (upon onset) 


remained on until reset, A 1,000-cycle tone . 


of .1 sec. duration served as the ready signal, 
The RTs were measured by a Hunter 
klockcounter, 

The 10 Ss randomly assigned to the ex- 
perimental group each received 20 trials 
under each AT condition (0-180 msec. in 
20-msec. steps) plus 20 control trials when 
only a single stimulus was presented. These 

trials were divided into Íour blocks of 
55 trials each. Five trials of each AT con- 
dition and 5 control trials were randomized 
within each block. The left or right position 
of S; and the six foreperiod values (from .5 
to 3.0 sec.) were randomly assigned within 


and Steger (1962, P. 202). In essence, § 
was told to fixate the S: lamp designated on 
each trial and to react to its onset. Twenty 
Practice trials were given. During the ex- 
periment trials with RTs greater than 400 
msec. were repeated, 

Results —The mean RTs are presented in 
Table 1. analysis of variance of these 
data indicate no significant differences be- 


n y For the purpose of 
direct comparison, the data of the experi- 


) None of the relevant sources of 
variance approached significance, 
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TABLE 1 


MEAN RTs or THE EXPERIMENTAL GROUP (SINGLE AND DOUBLE STIMULI) AND FOR A 
COMPARABLE CONTROL GROUP (SINGLE STIMULI ONLY) 


A T Values 
Group 
Single 0 20 40 60 80 100 120 140 160 180 
Experimental 242.4 245.2 | 242.0 | 240.0 | 240.6 | 245.2 | 242.4 | 243.2 | 245.9 | 242.7 | 242.6 
Control 237.0 
N =10 


an examination of the data for each experi- 
mental S yielded no pattern suggesting an 
inhibitory effect. 

Although the variance in Exp. I was 
greater than that reported by Helson and 
Steger, comparisons (e.g, SDs of each S's 
responses) with other recent RT experi- 
ments indicate that our data have comparable 
variability. While there appears to be no 
explanation for the small variance obtained 
in the original experiment, it seems unlikely 
that variance differences could be responsible 
for the failure to replicate the inhibitory 
effect of Sa 


Experiment II 

The display used by Helson and Steger 
contained a fixation point, yet there was no 
specific reference to this point in the pub- 
lished instructions. This implicit ambiguity 
plus the unexplained onset of S; may have 
produced a state of uncertainty for S regard- 
ing the exact nature of his task. It is im- 
possible to clearly specify the effect of this 
state on RT, but there is evidence to indicate 
uncertainty regarding the possible occurrence 
of either of two stimulus events will produce 
increased RT (Kristofferson, 1965). Ex- 
periment II was performed in order to clarify 
the potential role of a fixation point and un- 
certainty as partial mediators of Helson and 
Steger's results. 

Method.—Five Ss were run for 960 trials 
each on the same apparatus. The trials were 
arranged in 16 blocks of 60 trials each. Each 
block contained 10 trials at each of five AT 
values (30-150 msec. in 30-msec. steps) plus 
10 single-stimulus trials. Half the trials 
in each block were informed (S was told 
that the left or right stimulus would function 
as S, as in Exp. I) and half the trials were 
uncertain (S was told that the probability 
of either stimulus appearing first on each 
trial was 5). The position of S: and the six 
foreperiod values was randomized with the 
restriction that each occur an equal number 
of times in the schedule. 


The instructions were the same as Exp. I 
except that stress was placed on the import- 
ance of fixating the cross at the beginning 
of each trial. This instruction was reviewed 
at the beginning of each of the four daily 
sessions. Each session began with a series 
of 20 practice trials. 

Results—The means for each of the treat- 
ment combinations are presented in Table 2. 
An A (16 Blocks) X B (Informed-Uncer- 
tainty) X C (5 AT Intervals plus Control) 
analysis of variance indicated Blocks, F (15, 
60) = 3.56, p < .001, as a significant source of 
variance. The only other significant factor 
was the Informed-Uncertainty condition, F 
(1, 4) 29.14, p<.05. The importance of 
this variable in accounting for the total vari- 
ance is shown by an estimate of statistical 
association (w*=.79). The Blocks X AT 
Intervals interaction reached borderline sig- 
nificance, F (75, 300) — 1.31, p<.06. A 
plot of this interaction did not disclose any 
trends worth additional attention. Separate 
analyses of variance were calculated for each 
of the five Ss. The AT Intervals variable 
was not significant in all cases, mean F (5, 
180) —1.38. The  Informed-Uncertainty 
variable was significant for four of the five 
Ss, F (1, 180) =7.62-73.80, .001 € p <.01. 

These data indicate uncertainty does in- 
crease RT. However, the increase is con- 
stant and independent of the temporal onset 
of Sa 


Discussion 
No evidence for an inhibitory effect of Ss 
is supplied by the data from Exp. I and Exp. 


TABLE 2 
Mean RTs For Exp. II 


AT Values 


Cond. 


Single] 30 60 90 | 120 | 150 


Informed _ | 209.5 | 206.1 | 208.7 | 209.5 | 206.2 | 208.2 
Uninformed | 221.2 | 215.4 | 218.6 | 220.8 | 218.5 | 219.4 
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IL Apparently the Ss effect occurs only 
under rather specific conditions. If so, cer- 
tain procedural differences or stimulus condi- 
tion differences between these experiments 
and Helson and Steger's experiment may be 
relevant. The major procedural difference 
common to the unsuccessful replications has 
been the use of an automatically controlled 
nonverbal ready signal. Stimulus changes 
have included the omission of the fixation 
cross (Lappin & Eriksen, 1964) or emphasis 
on the fixation cross (Exp. II), the inclu- 
sion of an uninformed condition (Exp. II), 
and variations in luminance level and area 
of Si and Sa targets. Failure of metacontrast 
masking to effect simple RT (Fehrer & 
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Raab, 1962) would be consistent with the 
notion of specificity of the Sa effect. 
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FIGURAL AFTEREFFECTS AS A FUNCTION OF HUE 


F. T. CRAWFORD anp ROGER L. KLINGAMAN 
Florida State University 


A previous experiment showed that chromatic stimuli generated 
figural aftereffects but failed to demonstrate differential effects due to 
differential hues. In this study 20 Ss observed gray, orange, and blue 


inspection figures equated for Munsell value and chroma. 


Observa- 


tions were made with a viewing box housing an illuminant C source. 
The results indicated that the samples produced aftereffects which were 


significantly different from 


A previous investigation of the displace- 
ment of figural aftereffects as a function of 
hue (Crawford & Kubala, 1964) reported 
that two chromatic stimuli produced greater 
displacements than an achromatic stimulus, 
but no significant difference between after- 
effects due to chromatic stimuli. In that 
study the brightnesses of an orange, a blue, 
anda gray were equated by psychophysically 
determined heterochromatic brightness 
matches using an illuminant C source. The 
relative reflectances of the stimuli were 
reasonably similar but the orange stimulus 
was somewhat more saturated than the blue 
and gray. The present experiment attempted 
to improve upon the former by using stimuli 
of equated Munsell value and chroma, a 
greater number of Ss, and a modified test 
apparatus, 

Method.—The Ss of the experiment were 
18 volunteers that were either psychology 
majors or were enrolled in elementary psy- 
chology classes. The Ss were selected be- 
cause of their ability to observe figural after- 


each other, 
increasing from gray to orange to blue. 


with the order of magnitude 


effects with minimal practice. The investi- 
gators also served as Ss for the experiment, 
Eight additional Ss were discarded because 
of an inability to see the effect using achro- 
matic stimuli, or for interference due to after- 
images. A ninth S was discarded because of 
color blindness. 

Data were collected using the viewing box 
previously described (Crawford & Kubala, 
1964). The viewing box was L shaped with 
a peephole at the front end for monocular 
viewing. The viewing distance was 4 ft. 
The upright portion of the viewing box 
housed a MacBeth illuminant C source. It 
illuminated a 9 X 9 in, visual field. Two par- 
titions within the apparatus had apertures cut 
30 that they coincided with the visual field. 
The partitions also served to reduce the scat- 
tering of light within the box. The inspec- 
tion and test figures were of the same pat- 
tern previously employed (Kóhler & Wal- 
lach, 1944, Fig, 41; also to be seen in 
Graham, 1965, Fig. 19.7) but were enlarged 
to permit measurement of greater movement 
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of the test figures. The inspection figures 
were rectangles measuring 2X14 in. faced 
with matte finish Munsell papers cemented 
to sheets of white matte board. The Munsell 
notations for the inspection figures were N/, 
B 5/6, and YR 5/6. The distance between 
the rectangles of the inspection figure on the 
viewer's right was 43 in. The horizontal 
distance between the rectangles was 1 in. 

"The end wall of the apparatus, and viewing 
surface for the test figures, was an aluminum 
plate, polished and painted a flat white in- 
distinguishable in texture and reflectance 
from the matte board sheets. The test fig- 
ures were l-in. squares of aluminum polished 
and painted in the same manner as the end 
wall. The edges of the test figures were 
painted flat black. The test figures were 
mounted on steel pins extending through slots 
jn the end wall. Behind the end wall the 
pins were fitted into a geared mounting which 
permitted the vertical pairs to move in oppo- 
sition to each other, ie. toward or away 
from each other. Each of the vertical pairs 
could be manipulated independently of the 
other by rotation of controls having cable 
and pulley operation upon the geared shafts. 
The distance between the test figures hori- 
zontally, and vertically when set to be equi- 
distant, was 2 in. The vertical distance be- 
tween the test figures on the viewer’s left 
were fixed at 2 in. in the present experiment. 
A disk calibrated in 5° intervals indicated a 
total movement of .1250 in. for each 360° 
rotation. 

The Ss were informed of the general 
nature of figural aftereffects using the Gibson 
(1933) curved-line effect as an illustration. 
Instructions on how to manipulate the appa- 
ratus were also given. The Ss were not 
informed that they were about to examine 
inspection figures of achromatic and chro- 
matic stimuli. This precaution was taken 
as an attempt to avoid the effects of set 
(Gaze & Dodwell, 1965). The S discarded 
for color blindness was discovered to be color 
blind when first encountering the blue inspec- 
tion figure. He reported that the blue stim- 
ulus produced a much greater effect than the 
“green,” (ie, the gray) stimulus. Testing 
was done in a dimly illuminated room follow- 
ing a brief period of dark adaptation. Each 
S received 10 preliminary trials on the gray 
inspection figures. The Ss were then given 
10 trials on each of the gray, blue, and orange 
inspection figures. The trials for a given 
color were presented in succession but order 
effects of the colors were randomized among 
Ss. The effect of afterimages upon the 
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TABLE 1 


ANALYSIS OF VARIANCE FOR MAGNITUDE OF 
DISPLACEMENT IN DEGREES 


Source af F 
Color (C) 2 | 29.25* 
Ss (S) 19 | 8041** 
CS 38 6.45** 
Within Replicates (Mean Square) 540 | (6,927.20) 
Total 599 
*p «.05 
**p c .0 


performance of Ss was minimal in nearly all 
cases. Data from two Ss were discarded 
because of an inability to adapt to the after- 
images. Although most Ss had no difficulty 
with afterimages, the afterimage compliment 
of the orange stimulus was the most pro- 
nounced and distracting. This afterimage 
compliment closely resembled the blue in- 
spection stimulus. 

On each trial S observed the inspection 
figure for 60 sec. The E then turned out the 
illuminant C source within the apparatus, 
withdrew the inspection figures, and turned 
on the source again. The S then adjusted 
the movable squares until they appeared equal 
in vertical distance to the corresponding 
square on the left. The extent of the test- 
figure movement in degrees was recorded. 
There was an approximate 1-min. interval 
between trials on a given set of inspection 
figures. Rest intervals were provided be- 
tweeen each series of 10 trials. 

Results and discussion.—The analysis of 
variance for magnitude of displacement in 
degrees is shown in Table 1. Both the main 
effects and the interaction were significant. 
Thus, there was a consistently significant re- 
sponse to the color variable, but there were 
great differences in the magnitude of effect 
observed by Ss, and the order of magnitude 
of effect for color was not always the same 
among Ss. The Neuman-Keuls procedure 
for individual comparisons (Winer, 1962) 
showed that the blue, orange, and gray 
samples produced effects that are signifi- 
cantly different from each other. In the 
previous study no significant difference had 
been found between the blue and orange 
samples. In both studies the total displace- 
ment of the chromatic and achromatic stim- 
uli were similar. In the first study the 
mean absolute displacements were: gray, 
091 in.; orange, .105 in.; and, blue, .103 in. 
In the present study the mean absolute dis- 
placements were: gray, 110 in.; orange, 
122 in.; and, blue, .132 in. These displace- 
ments corresponded to 7 min. 53 sec., 8 min. 
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44 sec., and 9 min. 27 sec. of visual angle for 
the gray, orange, and blue samples, respec- 
tively. The differences in minutes of arc 
between the samples are of sufficient magni- 
tude to be attributable to the differential 
effects of hue (Graham, 1965). The dis- 
placements observed in the present experi- 
ment are substantially greater than those 
reported by Hammer (1949), who reported 
displacements of 1-2 min. of visual angle for 
a viewing distance of 14 sec. The disparity 
in the magnitude of the aftereffects is pre- 
sumably due to the different inspection and 
test figures employed, and the difference in 
method used for measuring the extent of the 
aftereffects, Previous studies reporting no 
differences in aftereffects due to hue (eg., 
Hochberg, Day, & Hardy, 1960; Hochberg & 
Triebel, 1955) have employed uncontrolled 
levels of illumination and have been based 
upon frequency data. In view of the small 
differences in visual angle between hues 
found in the present experiment, together 
with the large overall displacements, it is 
not surprising that other studies have re- 
ported negative results, 
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The results of the present experiment are 
consistent with the conclusion that hue is 
capable of generating aftereffects independent 
of brightness differences, and in addition, 
that there may be differential effects as a 
function of differential hue. 
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. NEW MINIATURIZED 
3-CHANNEL TACHISTOSCOPE 
FOR 35mm TRANSPARENCIES 


MODEL GB-AUTO 


The Model GB Auto-Tach consists of a standard three channel tachisto- 
scope control system and a miniature three channel optical system which is 
designed for transillumination of 35mm transparencies. A beam splitter 
viewing head with adjustable interocular separation is provided. 


Two of the three fields are provided with automatic stimulus changers which 
operate in approximately one second and locate the slide holders with a 
precision of 0.02 inches. The third field (i.e., "Blank") is ordinarily used for 
pre- or post-exposure adaptation or fixation and is equipped with a manual 
(drop-in) holder. Additional drop-in filter holders are provided in each 
channel. 


The high brightness of transillumination systems and the high speed medium 
capacity changers (36 slide holder standard—100 slide drum available) make 
this the most convenient commercially available tachistoscope. 


PRICE $5,400 


Write for full, detailed specifications 


@ SCIENTIFIC PROTOTYPE 


615 W. 131st Street New York 10027 
Telephone (212) 368-6000 


Reed relay programming and control equipment, cages, feeders, manipulanda, cumulative and event 
recorders, soundproof chambers, power supplies, audio equipment, special shock sources, closed circuit 
T.V. equipment, 2, 3 and 6 channel tachistoscopes, solid-state programming equipment, and special 
devices and systems. 


A NEW 
A.C. MOTOR DRIVEN 


Dispenser 


FOR 


Monkey Pellets 
Marbles 
M & M Candies 


FEATURES 


* INSTANT DELIVERY OF PELLET—NO LATENCY 


* LOW D.C. CURRENT DRAIN 

* HIGH RELIABILITY 

.* LOW NOISE LEVEL : 
* DURABLE-LONG LIFE CONSTRUCTION 


* CAN BE MODIFIED TO DELIVER ONE OR TWO 
PELLETS ON COMMAND 


* WRITE OR PHONE FOR DETAILS NOW 
« Ralph Gerbranda Company 


Scientific I 
8 Beck Road tentific Instruments 


Tel. 648-6415 
Arlington, Mass. 02174 
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